More code data for you to use!!
#2
by
rombodawg
- opened
If you want additional coding datasets to use, i made a mega complilation that doesnt include code alpaca so you could add it onto your dataset to train the models for coding.
Link:
https://huggingface.co/datasets/rombodawg/MegaCodeTraining112k
This is amazing 🫡
Great work!
@TokenBender I have made an updated version 3 of the coding dataset if you are interested in using it:
https://huggingface.co/datasets/rombodawg/LosslessMegaCodeTrainingV3_2.2m_Evol