Update README.md
Browse files
README.md
CHANGED
@@ -83,7 +83,7 @@ Currently, the HuggingFace's Inference Tool UI doesn't properly load the model.
|
|
83 |
|
84 |
## CPU
|
85 |
|
86 |
-
Best performance can be achieved downloading the [GGML 4 bits](https://huggingface.co/webpolis/zenos-gpt-j-6B-instruct-4bit/resolve/main/ggml-f16-q4_0.bin) model and doing inference
|
87 |
|
88 |
### Requirements
|
89 |
|
|
|
83 |
|
84 |
## CPU
|
85 |
|
86 |
+
Best performance can be achieved downloading the [GGML 4 bits](https://huggingface.co/webpolis/zenos-gpt-j-6B-instruct-4bit/resolve/main/ggml-f16-q4_0.bin) model and doing inference using the [rustformers' llm](https://github.com/rustformers/llm) tool.
|
87 |
|
88 |
### Requirements
|
89 |
|