self-correct_mistral-small-it_mMQA_dpo_iter1_beta.05 / model-00007-of-00009.safetensors

Commit History

Training in progress, step 36
2dfe419
verified

RyanYr commited on