webpolis
/

zenos-gpt-j-6B-instruct-4bit

Text Generation

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

webpolis commited on Oct 19, 2023

Commit

b6cdc0d

•

1 Parent(s): 884f45b

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -83,7 +83,7 @@ Currently, the HuggingFace's Inference Tool UI doesn't properly load the model.
 ## CPU
-Best performance can be achieved downloading the [GGML 4 bits](https://huggingface.co/webpolis/zenos-gpt-j-6B-instruct-4bit/resolve/main/ggml-f16-q4_0.bin) model and doing inference with the [rustformers' llm](https://github.com/rustformers/llm) tool.
 ### Requirements

 ## CPU
+Best performance can be achieved downloading the [GGML 4 bits](https://huggingface.co/webpolis/zenos-gpt-j-6B-instruct-4bit/resolve/main/ggml-f16-q4_0.bin) model and doing inference using the [rustformers' llm](https://github.com/rustformers/llm) tool.
 ### Requirements