R136a1 commited on
Commit
fd8c2ba
1 Parent(s): 16da046

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -12,13 +12,13 @@ language:
12
 
13
  ## Model details
14
 
15
- Quantized at 3.23bpw with hb 8
16
 
17
  Perplexity:
18
 
19
  Base = 6.5820
20
 
21
- 3.23 h8 = 6.6758
22
 
23
  Dataset = [wikitext](https://huggingface.co/datasets/wikitext/resolve/refs%2Fconvert%2Fparquet/wikitext-2-v1/test/0000.parquet)
24
 
 
12
 
13
  ## Model details
14
 
15
+ Quantized at 3.18bpw with hb 6. Can run full 4K context on 16GB VRAM
16
 
17
  Perplexity:
18
 
19
  Base = 6.5820
20
 
21
+ 3.18 h6 = 6.6928
22
 
23
  Dataset = [wikitext](https://huggingface.co/datasets/wikitext/resolve/refs%2Fconvert%2Fparquet/wikitext-2-v1/test/0000.parquet)
24