Update README.md
Browse files
README.md
CHANGED
@@ -63,7 +63,9 @@ parameters:
|
|
63 |
t:
|
64 |
- value: 0.0001
|
65 |
dtype: bfloat16
|
66 |
-
|
|
|
|
|
67 |
```
|
68 |
|
69 |
*Resultant merge finetuned* on [jondurbin/gutenberg-dpo-v0.1](https://huggingface.co/datasets/jondurbin/gutenberg-dpo-v0.1) for 1 epoch, 1.5e-5 learning rate, on Nvidia A100.
|
|
|
63 |
t:
|
64 |
- value: 0.0001
|
65 |
dtype: bfloat16
|
66 |
+
|
67 |
+
# Then, DPO Finetune
|
68 |
+
|
69 |
```
|
70 |
|
71 |
*Resultant merge finetuned* on [jondurbin/gutenberg-dpo-v0.1](https://huggingface.co/datasets/jondurbin/gutenberg-dpo-v0.1) for 1 epoch, 1.5e-5 learning rate, on Nvidia A100.
|