EricB's picture
EricB HF staff
Create README.md
9d03501 verified
|
raw
history blame
1.68 kB
metadata
tags:
  - uqff
  - mistral.rs
base_model: mistralai/Mistral-7B-Instruct-v0.3
base_model_relation: quantized

mistralai/Mistral-7B-Instruct-v0.3, UQFF quantization

Run with mistral.rs. Documentation: UQFF docs.

  1. Flexible ๐ŸŒ€: Multiple quantization formats in one file format with one framework to run them all.
  2. Reliable ๐Ÿ”’: Compatibility ensured with embedded and checked semantic versioning information from day 1.
  3. Easy ๐Ÿค—: Download UQFF models easily and quickly from Hugging Face, or use a local file.
  4. Customizable ๐Ÿ› ๏ธ: Make and publish your own UQFF files in minutes.

Files

Quantization type(s) Example
FP8 ./mistralrs-server -i plain -m EricB/Mistral-7B-Instruct-v0.3-UQFF --from-uqff mistral0.3-7b-instruct-f8e4m3.uqff
HQQ4 ./mistralrs-server -i plain -m EricB/Mistral-7B-Instruct-v0.3-UQFF --from-uqff mistral0.3-7b-instruct-hqq4.uqff
HQQ8 ./mistralrs-server -i plain -m EricB/Mistral-7B-Instruct-v0.3-UQFF --from-uqff mistral0.3-7b-instruct-hqq8.uqff
Q3K ./mistralrs-server -i plain -m EricB/Mistral-7B-Instruct-v0.3-UQFF --from-uqff mistral0.3-7b-instruct-q3k.uqff
Q4K ./mistralrs-server -i plain -m EricB/Mistral-7B-Instruct-v0.3-UQFF --from-uqff mistral0.3-7b-instruct-q4k.uqff
Q5K ./mistralrs-server -i plain -m EricB/Mistral-7B-Instruct-v0.3-UQFF --from-uqff mistral0.3-7b-instruct-q5k.uqff
Q8_0 ./mistralrs-server -i plain -m EricB/Mistral-7B-Instruct-v0.3-UQFF --from-uqff mistral0.3-7b-instruct-q8_0.uqff