Automatic data mixture method for large language model pre-training
Sea AI Lab
company
Verified
AI & ML interests
None defined yet.
models
38
sail/Sailor-14B-Chat
Text Generation
•
Updated
•
43
•
11
sail/scaling-with-vocab-trained-tokenizers
Updated
•
2
sail/scaling-vocab-3b-32k-overtrain
Text Generation
•
Updated
•
11
sail/scaling-vocab-3b-43k-overtrain
Text Generation
•
Updated
•
11
sail/Zephyr-7B-DICE-Iter2
Text Generation
•
Updated
•
52
sail/Zephyr-7B-DICE-Iter1
Text Generation
•
Updated
•
68
sail/Llama-3-Base-8B-DICE-Iter2
Text Generation
•
Updated
•
43
•
2
sail/Llama-3-Base-8B-DICE-Iter1
Text Generation
•
Updated
•
45
•
1
sail/Sailor-0.5B-Chat-gguf
Updated
•
405
•
4
sail/Sailor-1.8B-Chat-gguf
Updated
•
409
•
2