Quant-Cartel
/

Neophanis-8x7B-iMat-GGUF

Inference Endpoints

Model card Files Files and versions Community

InferenceIllusionist commited on Mar 22

Commit

e1591f2

•

1 Parent(s): 36796e5

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -26,10 +26,12 @@ PROUDLY PRESENTS
 ## Neophanis-8x7B-iMat-GGUF
 Quantized from fp16 with love.
 * Quantizations made possible using mixtral-8x7b-instruct-v0.1.imatrix file from [this](https://huggingface.co/datasets/ikawrakow/imatrix-from-wiki-train) repo (special thanks to [ikawrakow](https://huggingface.co/ikawrakow) again)
-For a brief rundown of iMatrix quant performance please see this [PR](https://github.com/ggerganov/llama.cpp/pull/5747)
 <i>All quants are verified working prior to uploading to repo for your safety and convenience. </i>

 ## Neophanis-8x7B-iMat-GGUF
+<b>The Good, The Bad, And The Ugly iMats edition</b>
 Quantized from fp16 with love.
 * Quantizations made possible using mixtral-8x7b-instruct-v0.1.imatrix file from [this](https://huggingface.co/datasets/ikawrakow/imatrix-from-wiki-train) repo (special thanks to [ikawrakow](https://huggingface.co/ikawrakow) again)
+* An analysis was run on mixtral-8x7b.imatrix that showed worse KL-Divergence than mixtral-8x7b-instruct-v0.1, hence the latter was used for the imatrixes instead
+* For a brief rundown of iMatrix quant performance please see this [PR](https://github.com/ggerganov/llama.cpp/pull/5747)
 <i>All quants are verified working prior to uploading to repo for your safety and convenience. </i>