traversaal-ai
/

traversaal-2.5-Mistral-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

traversaal-ai commited on Jan 31

Commit

f5403d7

•

1 Parent(s): d3a68da

Update README.md

Files changed (1) hide show

README.md +8 -1

README.md CHANGED Viewed

@@ -2,4 +2,11 @@
 license: apache-2.0
 datasets:
 - Intel/orca_dpo_pairs
----

 license: apache-2.0
 datasets:
 - Intel/orca_dpo_pairs
+---
+traversaal-2.5-Mistral-7B is trained via Direct Preference Optimization(DPO) from teknium/OpenHermes-2.5-Mistral-7B as its base model, with several optimizations in hyperparameters.
+teknium/OpenHermes-2.5-Mistral-7B is trained via Supervised Fine-Tuning (SFT) using LoRA, with the QWEN-72B model as its base-model.
+Note that we did not exploit any form of weight merge.
+For leaderboard submission, the trained weight is realigned for compatibility with Mistral-7b