akaistormherald
commited on
Commit
•
3c2b1ab
1
Parent(s):
5fd0ae7
Update README.md
Browse files
README.md
CHANGED
@@ -10,6 +10,8 @@ tags:
|
|
10 |
- trl
|
11 |
- dpo
|
12 |
base_model: unsloth/zephyr-sft-bnb-4bit
|
|
|
|
|
13 |
---
|
14 |
|
15 |
# Uploaded model
|
@@ -18,6 +20,4 @@ base_model: unsloth/zephyr-sft-bnb-4bit
|
|
18 |
- **License:** apache-2.0
|
19 |
- **Finetuned from model :** unsloth/zephyr-sft-bnb-4bit
|
20 |
|
21 |
-
|
22 |
-
|
23 |
-
[<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
|
|
|
10 |
- trl
|
11 |
- dpo
|
12 |
base_model: unsloth/zephyr-sft-bnb-4bit
|
13 |
+
datasets:
|
14 |
+
- unalignment/toxic-dpo-v0.2
|
15 |
---
|
16 |
|
17 |
# Uploaded model
|
|
|
20 |
- **License:** apache-2.0
|
21 |
- **Finetuned from model :** unsloth/zephyr-sft-bnb-4bit
|
22 |
|
23 |
+
Mistral7b + SFT + 4bit DPO training with unalignment/toxic-dpo-v0.2 == ToxicMist? ☣🌫
|
|
|
|