TheBloke
/

Guanaco-33B-SuperHOT-8K-GPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

TheBloke commited on Jun 26, 2023

Commit

5d3c877

•

1 Parent(s): 9cf7dc4

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -34,7 +34,9 @@ Please read carefully below to see how to use it.
 ## Repositories available
 * [4-bit GPTQ models for GPU inference](https://huggingface.co/TheBloke/Guanaco-33B-SuperHOT-8K-GPTQ)
-* [Unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/Panchovix/Guanaco-33B-SuperHOT-8k)
 ## How to easily download and use this model in text-generation-webui

 ## Repositories available
 * [4-bit GPTQ models for GPU inference](https://huggingface.co/TheBloke/Guanaco-33B-SuperHOT-8K-GPTQ)
+* [Panchovix's unquantised fp16 model in pytorch format, for GPU inference and for further conversions](https://huggingface.co/Panchovix/Guanaco-33B-SuperHOT-8k)
+GGML quants are not yet provided, as there is not yet support for SuperHOT in llama.cpp. This is being investigated and will hopefully come soon.
 ## How to easily download and use this model in text-generation-webui