Update README.md
Browse files
README.md
CHANGED
@@ -17,14 +17,15 @@ Run with [mistral.rs](https://github.com/EricLBuehler/mistral.rs). Documentation
|
|
17 |
2) **Reliable** π: Compatibility ensured with *embedded* and *checked* semantic versioning information from day 1.
|
18 |
3) **Easy** π€: Download UQFF models *easily* and *quickly* from Hugging Face, or use a local file.
|
19 |
3) **Customizable** π οΈ: Make and publish your own UQFF files in minutes.
|
20 |
-
|
|
|
21 |
|
22 |
|Quantization type(s)|Example|
|
23 |
|--|--|
|
24 |
-
|FP8|`./mistralrs-server -i plain -m EricB/Llama-3.2-11B-Vision-Instruct-UQFF --from-uqff llam3.2-vision-instruct-f8e4m3.uqff`|
|
25 |
-
|HQQ4|`./mistralrs-server -i plain -m EricB/Llama-3.2-11B-Vision-Instruct-UQFF --from-uqff llam3.2-vision-instruct-hqq4.uqff`|
|
26 |
-
|HQQ8|`./mistralrs-server -i plain -m EricB/Llama-3.2-11B-Vision-Instruct-UQFF --from-uqff llam3.2-vision-instruct-hqq8.uqff`|
|
27 |
-
|Q3K|`./mistralrs-server -i plain -m EricB/Llama-3.2-11B-Vision-Instruct-UQFF --from-uqff llam3.2-vision-instruct-q3k.uqff`|
|
28 |
-
|Q4K|`./mistralrs-server -i plain -m EricB/Llama-3.2-11B-Vision-Instruct-UQFF --from-uqff llam3.2-vision-instruct-q4k.uqff`|
|
29 |
-
|Q5K|`./mistralrs-server -i plain -m EricB/Llama-3.2-11B-Vision-Instruct-UQFF --from-uqff llam3.2-vision-instruct-q5k.uqff`|
|
30 |
-
|Q8_0|`./mistralrs-server -i plain -m EricB/Llama-3.2-11B-Vision-Instruct-UQFF --from-uqff llam3.2-vision-instruct-q8_0.uqff`|
|
|
|
17 |
2) **Reliable** π: Compatibility ensured with *embedded* and *checked* semantic versioning information from day 1.
|
18 |
3) **Easy** π€: Download UQFF models *easily* and *quickly* from Hugging Face, or use a local file.
|
19 |
3) **Customizable** π οΈ: Make and publish your own UQFF files in minutes.
|
20 |
+
|
21 |
+
## Examples
|
22 |
|
23 |
|Quantization type(s)|Example|
|
24 |
|--|--|
|
25 |
+
|FP8|`./mistralrs-server -i vision-plain -m EricB/Llama-3.2-11B-Vision-Instruct-UQFF -a vllama --from-uqff llam3.2-vision-instruct-f8e4m3.uqff`|
|
26 |
+
|HQQ4|`./mistralrs-server -i vision-plain -m EricB/Llama-3.2-11B-Vision-Instruct-UQFF -a vllama --from-uqff llam3.2-vision-instruct-hqq4.uqff`|
|
27 |
+
|HQQ8|`./mistralrs-server -i vision-plain -m EricB/Llama-3.2-11B-Vision-Instruct-UQFF -a vllama --from-uqff llam3.2-vision-instruct-hqq8.uqff`|
|
28 |
+
|Q3K|`./mistralrs-server -i vision-plain -m EricB/Llama-3.2-11B-Vision-Instruct-UQFF -a vllama --from-uqff llam3.2-vision-instruct-q3k.uqff`|
|
29 |
+
|Q4K|`./mistralrs-server -i vision-plain -m EricB/Llama-3.2-11B-Vision-Instruct-UQFF -a vllama --from-uqff llam3.2-vision-instruct-q4k.uqff`|
|
30 |
+
|Q5K|`./mistralrs-server -i vision-plain -m EricB/Llama-3.2-11B-Vision-Instruct-UQFF -a vllama --from-uqff llam3.2-vision-instruct-q5k.uqff`|
|
31 |
+
|Q8_0|`./mistralrs-server -i vision-plain -m EricB/Llama-3.2-11B-Vision-Instruct-UQFF -a vllama --from-uqff llam3.2-vision-instruct-q8_0.uqff`|
|