Update README.md
Browse files
README.md
CHANGED
@@ -19,6 +19,7 @@ license_link: LICENSE
|
|
19 |
|
20 |
# Quant Infos
|
21 |
|
|
|
22 |
- quants done with an importance matrix for improved quantization loss
|
23 |
- K & IQ quants in basically all variants
|
24 |
- fixed end token for instruct mode (<|eot_id|>[128009])
|
|
|
19 |
|
20 |
# Quant Infos
|
21 |
|
22 |
+
- NOT Updated for new pre-tokenizer fixes (yet), I recommend using bartowski's quants. https://huggingface.co/bartowski/Meta-Llama-3-70B-Instruct-GGUF
|
23 |
- quants done with an importance matrix for improved quantization loss
|
24 |
- K & IQ quants in basically all variants
|
25 |
- fixed end token for instruct mode (<|eot_id|>[128009])
|