Triangle104 commited on
Commit
e613915
1 Parent(s): ab82257

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -0
README.md CHANGED
@@ -111,6 +111,39 @@ model-index:
111
  This model was converted to GGUF format from [`nbeerbower/Mistral-Small-Drummer-22B`](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
112
  Refer to the [original model card](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) for more details on the model.
113
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
114
  ## Use with llama.cpp
115
  Install llama.cpp through brew (works on Mac and Linux)
116
 
 
111
  This model was converted to GGUF format from [`nbeerbower/Mistral-Small-Drummer-22B`](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
112
  Refer to the [original model card](https://huggingface.co/nbeerbower/Mistral-Small-Drummer-22B) for more details on the model.
113
 
114
+ ---
115
+ Model details:
116
+ -
117
+ mistralai/Mistral-Small-Instruct-2409 finetuned on jondurbin/gutenberg-dpo-v0.1 and nbeerbower/gutenberg2-dpo.
118
+ Method
119
+
120
+ ORPO tuned with 2xA40 on RunPod for 1 epoch.
121
+
122
+ learning_rate=4e-6,
123
+ lr_scheduler_type="linear",
124
+ beta=0.1,
125
+ per_device_train_batch_size=4,
126
+ per_device_eval_batch_size=4,
127
+ gradient_accumulation_steps=8,
128
+ optim="paged_adamw_8bit",
129
+ num_train_epochs=1,
130
+
131
+ Dataset was prepared using Mistral-Small Instruct format.
132
+
133
+ Fine-tune Llama 3 with ORPO
134
+ Open LLM Leaderboard Evaluation Results
135
+
136
+ Detailed results can be found here
137
+ Metric Value
138
+ Avg. 29.45
139
+ IFEval (0-Shot) 63.31
140
+ BBH (3-Shot) 40.12
141
+ MATH Lvl 5 (4-Shot) 16.69
142
+ GPQA (0-shot) 12.42
143
+ MuSR (0-shot) 9.80
144
+ MMLU-PRO (5-shot) 34.39
145
+
146
+ ---
147
  ## Use with llama.cpp
148
  Install llama.cpp through brew (works on Mac and Linux)
149