metadata
tags:
- uqff
- mistral.rs
base_model: google/gemma-1.1-2b-it
base_model_relation: quantized
google/gemma-1.1-2b-it
, UQFF quantization
Run with mistral.rs. Documentation: UQFF docs.
- Flexible ๐: Multiple quantization formats in one file format with one framework to run them all.
- Reliable ๐: Compatibility ensured with embedded and checked semantic versioning information from day 1.
- Easy ๐ค: Download UQFF models easily and quickly from Hugging Face, or use a local file.
- Customizable ๐ ๏ธ: Make and publish your own UQFF files in minutes.
Files
Name | Quantization type(s) | Example |
---|---|---|
gemma1.1-2b-instruct-f8e4m3.uqff | FP8 | ./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff EricB/gemma-1.1-2b-it-UQFF/gemma1.1-2b-instruct-f8e4m3.uqff |
gemma1.1-2b-instruct-hqq4.uqff | HQQ4 | ./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff EricB/gemma-1.1-2b-it-UQFF/gemma1.1-2b-instruct-hqq4.uqff |
gemma1.1-2b-instruct-hqq8.uqff | HQQ8 | ./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff EricB/gemma-1.1-2b-it-UQFF/gemma1.1-2b-instruct-hqq8.uqff |
gemma1.1-2b-instruct-q3k.uqff | Q3K | ./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff EricB/gemma-1.1-2b-it-UQFF/gemma1.1-2b-instruct-q3k.uqff |
gemma1.1-2b-instruct-q4k.uqff | Q4K | ./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff EricB/gemma-1.1-2b-it-UQFF/gemma1.1-2b-instruct-q4k.uqff |
gemma1.1-2b-instruct-q5k.uqff | Q5K | ./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff EricB/gemma-1.1-2b-it-UQFF/gemma1.1-2b-instruct-q5k.uqff |
gemma1.1-2b-instruct-q8_0.uqff | Q8_0 | ./mistralrs-server -i plain -m EricB/gemma-1.1-2b-it-UQFF --from-uqff EricB/gemma-1.1-2b-it-UQFF/gemma1.1-2b-instruct-q8_0.uqff |