3mei/llama_3.1_instruct_4bit_reflection_405_v1_gsm8k_3e_qkvogud Text Generation • Updated 6 days ago • 6
3mei/llama_3.1_instruct_4bit_evolutionary_405_v1_gsm8k_3e_qkvogud Text Generation • Updated 6 days ago • 5
SongTonyLi/Phi-3.5-mini-instruct-SFT-D1_chosen-dpo-mix-shuffled5 Text Generation • Updated 6 days ago • 63
SongTonyLi/Phi-3.5-mini-instruct-SFT-D1_chosen-then-D2_chosen-dpo-mix-shuffled5 Text Generation • Updated 6 days ago • 36