zatochu
/

Alpacino-13b-ggml

Model card Files Files and versions Community

verymuchawful commited on Apr 16, 2023

Commit

3bb4765

•

1 Parent(s): 48e8b7a

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -1,4 +1,6 @@
 ---
 inference: false
 ---
-GGML conversion of https://huggingface.co/digitous/Alpacino13b using https://github.com/ggerganov/llama.cpp/pull/896. (Edited to write the model with ftype 2 so it won't be incorrectly identified as 4 - mostly q4_1 some f16.)

 ---
 inference: false
 ---
+GGML conversion of https://huggingface.co/digitous/Alpacino13b using https://github.com/ggerganov/llama.cpp/pull/896. (Edited to write the model with ftype 2 so it won't be incorrectly identified as 4 - mostly q4_1 some f16.)
+GPTQ(cuda) quantization available here: https://huggingface.co/gozfarb/alpacino-13b-4bit-128g