Locutusque
/

llama-3-neural-chat-v2.2-8B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Locutusque commited on May 2

Commit

77102a7

•

1 Parent(s): a272980

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -21,7 +21,8 @@ pipeline_tag: text-generation
 <!-- Provide a longer summary of what this model is. -->
-I fine-tuned llama-3 8B on an approach similar to Intel's neural chat language model. I have slightly modified the data sources so it is stronger in coding, math, and writing. I use both SFT and DPO.
 - **Developed by:** Locutusque
 - **Model type:** Built with Meta Llama 3

 <!-- Provide a longer summary of what this model is. -->
+I fine-tuned llama-3 8B on an approach similar to Intel's neural chat language model. I have slightly modified the data sources so it is stronger in coding, math, and writing. I use both SFT and DPO-Positive.
+DPO-Positive dramatically improves performance over DPO.
 - **Developed by:** Locutusque
 - **Model type:** Built with Meta Llama 3