TheBloke
/

Vicuna-13B-1.1-GPTQ

Text Generation

Model card Files Files and versions Community

TheBloke commited on Jun 20, 2023

Commit

78ad800

•

1 Parent(s): 7a5ceb8

Update README.md

Files changed (1) hide show

README.md +0 -8

README.md CHANGED Viewed

@@ -41,14 +41,6 @@ I have the following Vicuna 1.1 repositories available:
 * [GPTQ quantized 4bit 7B 1.1 for GPU - `safetensors` and `pt` formats](https://huggingface.co/TheBloke/vicuna-7B-1.1-GPTQ-4bit-128g)
 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU inference](https://huggingface.co/TheBloke/vicuna-7B-1.1-GGML)
-**GGMLs for CPU inference**
-I removed the GGMLs I originally made for Vicuna 1.1 because they were directly converted GPTQ -> GGML and this seemed to give poor results
-Instead I recommend you use eachadea's GGMLs:
-* [eachadea's Vicuna 13B 1.1 GGML format for `llama.cpp`](https://huggingface.co/eachadea/ggml-vicuna-13b-1.1)
-* [eachadea's Vicuna 7B 1.1 GGML format for `llama.cpp`](https://huggingface.co/eachadea/ggml-vicuna-7b-1.1)
 ## How to easily download and use this model in text-generation-webui
 Open the text-generation-webui UI as normal.

 * [GPTQ quantized 4bit 7B 1.1 for GPU - `safetensors` and `pt` formats](https://huggingface.co/TheBloke/vicuna-7B-1.1-GPTQ-4bit-128g)
 * [2, 3, 4, 5, 6 and 8-bit GGML models for CPU inference](https://huggingface.co/TheBloke/vicuna-7B-1.1-GGML)
 ## How to easily download and use this model in text-generation-webui
 Open the text-generation-webui UI as normal.