Locutusque
commited on
Commit
•
77102a7
1
Parent(s):
a272980
Update README.md
Browse files
README.md
CHANGED
@@ -21,7 +21,8 @@ pipeline_tag: text-generation
|
|
21 |
|
22 |
<!-- Provide a longer summary of what this model is. -->
|
23 |
|
24 |
-
I fine-tuned llama-3 8B on an approach similar to Intel's neural chat language model. I have slightly modified the data sources so it is stronger in coding, math, and writing. I use both SFT and DPO.
|
|
|
25 |
|
26 |
- **Developed by:** Locutusque
|
27 |
- **Model type:** Built with Meta Llama 3
|
|
|
21 |
|
22 |
<!-- Provide a longer summary of what this model is. -->
|
23 |
|
24 |
+
I fine-tuned llama-3 8B on an approach similar to Intel's neural chat language model. I have slightly modified the data sources so it is stronger in coding, math, and writing. I use both SFT and DPO-Positive.
|
25 |
+
DPO-Positive dramatically improves performance over DPO.
|
26 |
|
27 |
- **Developed by:** Locutusque
|
28 |
- **Model type:** Built with Meta Llama 3
|