bullerwins
commited on
Commit
•
6c02a42
1
Parent(s):
3c7df1a
Update README.md
Browse files
README.md
CHANGED
@@ -6,6 +6,8 @@ language:
|
|
6 |
- en
|
7 |
pipeline_tag: text-generation
|
8 |
---
|
|
|
|
|
9 |
Self-Play Preference Optimization for Language Model Alignment (https://arxiv.org/abs/2405.00675)
|
10 |
|
11 |
# Llama-3-Instruct-8B-SPPO-Iter3
|
|
|
6 |
- en
|
7 |
pipeline_tag: text-generation
|
8 |
---
|
9 |
+
Quantized to exl2 using [Exllamav2 0.1.6](https://github.com/turboderp/exllamav2)
|
10 |
+
|
11 |
Self-Play Preference Optimization for Language Model Alignment (https://arxiv.org/abs/2405.00675)
|
12 |
|
13 |
# Llama-3-Instruct-8B-SPPO-Iter3
|