qwp4w3hyb
/

Meta-Llama-3-70B-Instruct-iMat-GGUF

Text Generation

importance matrix

Inference Endpoints

Model card Files Files and versions Community

qwp4w3hyb commited on May 8

Commit

5c49cf7

•

1 Parent(s): 8994266

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -19,6 +19,7 @@ license_link: LICENSE
 # Quant Infos
 - quants done with an importance matrix for improved quantization loss
 - K & IQ quants in basically all variants
 - fixed end token for instruct mode (<|eot_id|>[128009])

 # Quant Infos
+- NOT Updated for new pre-tokenizer fixes (yet), I recommend using bartowski's quants. https://huggingface.co/bartowski/Meta-Llama-3-70B-Instruct-GGUF
 - quants done with an importance matrix for improved quantization loss
 - K & IQ quants in basically all variants
 - fixed end token for instruct mode (<|eot_id|>[128009])