bunnycore
/

Phigments12-Q6_K-GGUF

liminerity/merge6

liminerity/merge3

Inference Endpoints

Model card Files Files and versions Community

bunnycore commited on Apr 6

Commit

5742757

•

1 Parent(s): 8a069dc

Update README.md

Files changed (1) hide show

README.md +16 -2

README.md CHANGED Viewed

@@ -9,8 +9,19 @@ tags:
 ---
 # bunnycore/Phigments12-Q6_K-GGUF
-This model was converted to GGUF format from [`liminerity/Phigments12`](https://huggingface.co/liminerity/Phigments12) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
-Refer to the [original model card](https://huggingface.co/liminerity/Phigments12) for more details on the model.
 ## Use with llama.cpp
 Install llama.cpp through brew.
@@ -37,3 +48,6 @@ Note: You can also use this checkpoint directly through the [usage steps](https:
 ```
 git clone https://github.com/ggerganov/llama.cpp &&             cd llama.cpp &&             make &&             ./main -m phigments12.Q6_K.gguf -n 128
 ```

 ---
 # bunnycore/Phigments12-Q6_K-GGUF
+Phigments12-Q6_K-GGUF is a quantized version of the liminerity/Phigments12: https://huggingface.co/liminerity/Phigments12 model. Quantization is a technique that reduces the size and memory footprint of a model, making it efficient to run on devices with limited resources. Phigments12-Q6_K-GGUF packs 2.78 billion parameters, making it a compact model that delivers high performance and decent benchmark results. This efficiency allows you to run the model on low-end laptops, phones, and even PCs without a dedicated GPU.
+Several platforms support running Phigments12-Q6_K-GGUF, including:
+```
+Jan.ai
+LM Studio
+Text Generation Web UI
+```
 ## Use with llama.cpp
 Install llama.cpp through brew.
 ```
 git clone https://github.com/ggerganov/llama.cpp &&             cd llama.cpp &&             make &&             ./main -m phigments12.Q6_K.gguf -n 128
 ```
+This model was converted to GGUF format from [`liminerity/Phigments12`](https://huggingface.co/liminerity/Phigments12) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
+Refer to the [original model card](https://huggingface.co/liminerity/Phigments12) for more details on the model.