Locutusque commited on
Commit
77102a7
1 Parent(s): a272980

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -21,7 +21,8 @@ pipeline_tag: text-generation
21
 
22
  <!-- Provide a longer summary of what this model is. -->
23
 
24
- I fine-tuned llama-3 8B on an approach similar to Intel's neural chat language model. I have slightly modified the data sources so it is stronger in coding, math, and writing. I use both SFT and DPO.
 
25
 
26
  - **Developed by:** Locutusque
27
  - **Model type:** Built with Meta Llama 3
 
21
 
22
  <!-- Provide a longer summary of what this model is. -->
23
 
24
+ I fine-tuned llama-3 8B on an approach similar to Intel's neural chat language model. I have slightly modified the data sources so it is stronger in coding, math, and writing. I use both SFT and DPO-Positive.
25
+ DPO-Positive dramatically improves performance over DPO.
26
 
27
  - **Developed by:** Locutusque
28
  - **Model type:** Built with Meta Llama 3