Qwen2.5-32B-Instruct-GGUF / perplexity.md
ThomasBaruzier's picture
Upload perplexity.md
1fe2290 verified

Qwen2.5-32B-Instruct Quant Size (MB) PPL Size (%) Accuracy (%) PPL error rate IQ1_S 6938 12.2991 0.08384 IQ1_M 7565 10.2638 0.06990 IQ2_XS 9497 7.3601 0.04846 IQ2_S 9907 7.2397 0.04762 IQ2_M 10743 6.7268 0.04354 Q2_K_S 10956 6.9981 0.04644 Q2_K 11743 6.6603 0.04324 IQ3_XXS 12245 6.1570 0.03929 IQ3_XS 13071 6.0366 0.03833 Q3_K_S 13726 6.0878 0.03872 IQ3_S 13769 5.9886 0.03816 IQ3_M 14125 5.9942 0.03802 Q3_K_M 15197 5.8008 0.03677 Q3_K_L 16449 5.7812 0.03667 IQ4_XS 16874 5.6502 0.03586 IQ4_NL 17817 5.6408 0.03575 Q4_0 17845 5.6946 0.03599 Q4_K_S 17915 5.6367 0.03561 Q4_K_M 18932 5.6224 0.03554 IQ2_XXS 8611 8.0187 0.05388 Q4_1 19684 5.6586 0.03587 Q5_K_S 21590 5.5680 0.03515 Q5_0 21658 5.5880 0.03538 Q5_K_M 22185 5.5670 0.03515 Q5_1 23496 5.5734 0.03520 Q6_K 25641 5.5305 0.03483 Q8_0 33208 5.5221 0.03478 F16 62500 5.5191 0.03474