latimar commited on
Commit
487b963
1 Parent(s): 883600b

Add PPL data

Browse files
Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -19,3 +19,17 @@ There are the following branches:
19
  2_75-bpw-h6
20
  2_55-bpw-h6
21
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
19
  2_75-bpw-h6
20
  2_55-bpw-h6
21
  ```
22
+
23
+ * Calibration dataset used for conversion: [wikitext-v2](https://huggingface.co/datasets/wikitext/blob/refs%2Fconvert%2Fparquet/wikitext-2-v1/test/0000.parquet)
24
+ * Evaluation dataset used to calculate perplexity: [wikitext-v2](https://huggingface.co/datasets/wikitext/blob/refs%2Fconvert%2Fparquet/wikitext-2-v1/validation/0000.parquet)
25
+ * PPL max seq. length used: 1792 (2048 with 5.0-bpw-h8 causes OOM on RTX 4090 when evaluating ppl, so had to go down a bit)
26
+
27
+ ----------------------------------------------
28
+ | BPW | Perplexity | File Size (Gb) |
29
+ ---------------------------------------------|
30
+ | 2.55-h6 | 15.0901 | 10.56 |
31
+ | 2.75-h6 | 13.6153 | 11.33 |
32
+ | 4.125-h6 | 6.8095 | 16.65 |
33
+ | 4.625-h6 | 6.7992 | 18.58 |
34
+ | 5.0-h8 | 6.7785 | 20.09 |
35
+ ----------------------------------------------