EricB HF staff commited on
Commit
3cb8fe7
โ€ข
1 Parent(s): d1a2882

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +30 -0
README.md ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - uqff
4
+ - mistral.rs
5
+ base_model: google/gemma-1.1-2b-it
6
+ base_model_relation: quantized
7
+ ---
8
+
9
+ <!-- Autogenerated from user input. -->
10
+
11
+ # `google/gemma-1.1-2b-it`, UQFF quantization
12
+
13
+
14
+ Run with [mistral.rs](https://github.com/EricLBuehler/mistral.rs). Documentation: [UQFF docs](https://github.com/EricLBuehler/mistral.rs/blob/master/docs/UQFF.md).
15
+
16
+ 1) **Flexible** ๐ŸŒ€: Multiple quantization formats in *one* file format with *one* framework to run them all.
17
+ 2) **Reliable** ๐Ÿ”’: Compatibility ensured with *embedded* and *checked* semantic versioning information from day 1.
18
+ 3) **Easy** ๐Ÿค—: Download UQFF models *easily* and *quickly* from Hugging Face, or use a local file.
19
+ 3) **Customizable** ๐Ÿ› ๏ธ: Make and publish your own UQFF files in minutes.
20
+ ## Files
21
+
22
+ |Name|Quantization type(s)|Example|
23
+ |--|--|--|
24
+ |gemma1.1-2b-instruct-f8e4m3.uqff|FP8|`./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff EricB/gemma-1.1-2b-it-UQFF/gemma1.1-2b-instruct-f8e4m3.uqff`|
25
+ |gemma1.1-2b-instruct-hqq4.uqff|HQQ4|`./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff EricB/gemma-1.1-2b-it-UQFF/gemma1.1-2b-instruct-hqq4.uqff`|
26
+ |gemma1.1-2b-instruct-hqq8.uqff|HQQ8|`./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff EricB/gemma-1.1-2b-it-UQFF/gemma1.1-2b-instruct-hqq8.uqff`|
27
+ |gemma1.1-2b-instruct-q3k.uqff|Q3K|`./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff EricB/gemma-1.1-2b-it-UQFF/gemma1.1-2b-instruct-q3k.uqff`|
28
+ |gemma1.1-2b-instruct-q4k.uqff|Q4K|`./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff EricB/gemma-1.1-2b-it-UQFF/gemma1.1-2b-instruct-q4k.uqff`|
29
+ |gemma1.1-2b-instruct-q5k.uqff|Q5K|`./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff EricB/gemma-1.1-2b-it-UQFF/gemma1.1-2b-instruct-q5k.uqff`|
30
+ |gemma1.1-2b-instruct-q8_0.uqff|Q8_0|`./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff EricB/gemma-1.1-2b-it-UQFF/gemma1.1-2b-instruct-q8_0.uqff`|