ThomasBaruzier commited on
Commit
4045143
1 Parent(s): 76b9b85

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -0
README.md CHANGED
@@ -25,6 +25,37 @@ All quants were made using the imatrix option and Bartowski's [calibration file]
25
 
26
  # Perplexity table (the lower the better)
27
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
28
  <hr>
29
 
30
  # Qwen2.5-32B-Instruct
 
25
 
26
  # Perplexity table (the lower the better)
27
 
28
+ | Quant | Size (MB) | PPL | Size (%) | Accuracy (%) | PPL error rate |
29
+ | ------- | --------- | ------- | -------- | ------------ | -------------- |
30
+ | IQ1_S | 6938 | 12.2991 | 11.1 | 44.87 | 0.08384 |
31
+ | IQ1_M | 7565 | 10.2638 | 12.1 | 53.77 | 0.0699 |
32
+ | IQ2_XS | 9497 | 7.3601 | 15.2 | 74.99 | 0.04846 |
33
+ | IQ2_S | 9907 | 7.2397 | 15.85 | 76.23 | 0.04762 |
34
+ | IQ2_M | 10743 | 6.7268 | 17.19 | 82.05 | 0.04354 |
35
+ | Q2_K_S | 10956 | 6.9981 | 17.53 | 78.87 | 0.04644 |
36
+ | Q2_K | 11743 | 6.6603 | 18.79 | 82.87 | 0.04324 |
37
+ | IQ3_XXS | 12245 | 6.157 | 19.59 | 89.64 | 0.03929 |
38
+ | IQ3_XS | 13071 | 6.0366 | 20.91 | 91.43 | 0.03833 |
39
+ | Q3_K_S | 13726 | 6.0878 | 21.96 | 90.66 | 0.03872 |
40
+ | IQ3_S | 13769 | 5.9886 | 22.03 | 92.16 | 0.03816 |
41
+ | IQ3_M | 14125 | 5.9942 | 22.6 | 92.07 | 0.03802 |
42
+ | Q3_K_M | 15197 | 5.8008 | 24.32 | 95.14 | 0.03677 |
43
+ | Q3_K_L | 16449 | 5.7812 | 26.32 | 95.47 | 0.03667 |
44
+ | IQ4_XS | 16874 | 5.6502 | 27 | 97.68 | 0.03586 |
45
+ | IQ4_NL | 17817 | 5.6408 | 28.51 | 97.84 | 0.03575 |
46
+ | Q4_0 | 17845 | 5.6946 | 28.55 | 96.92 | 0.03599 |
47
+ | Q4_K_S | 17915 | 5.6367 | 28.66 | 97.91 | 0.03561 |
48
+ | Q4_K_M | 18932 | 5.6224 | 30.29 | 98.16 | 0.03554 |
49
+ | IQ2_XXS | 8611 | 8.0187 | 13.78 | 68.83 | 0.05388 |
50
+ | Q4_1 | 19684 | 5.6586 | 31.49 | 97.53 | 0.03587 |
51
+ | Q5_K_S | 21590 | 5.568 | 34.54 | 99.12 | 0.03515 |
52
+ | Q5_0 | 21658 | 5.588 | 34.65 | 98.77 | 0.03538 |
53
+ | Q5_K_M | 22185 | 5.567 | 35.5 | 99.14 | 0.03515 |
54
+ | Q5_1 | 23496 | 5.5734 | 37.59 | 99.03 | 0.0352 |
55
+ | Q6_K | 25641 | 5.5305 | 41.03 | 99.79 | 0.03483 |
56
+ | Q8_0 | 33208 | 5.5221 | 53.13 | 99.95 | 0.03478 |
57
+ | F16 | 62500 | 5.5191 | 100 | 100 | 0.03474 |
58
+
59
  <hr>
60
 
61
  # Qwen2.5-32B-Instruct