verymuchawful
commited on
Commit
•
3bb4765
1
Parent(s):
48e8b7a
Update README.md
Browse files
README.md
CHANGED
@@ -1,4 +1,6 @@
|
|
1 |
---
|
2 |
inference: false
|
3 |
---
|
4 |
-
GGML conversion of https://huggingface.co/digitous/Alpacino13b using https://github.com/ggerganov/llama.cpp/pull/896. (Edited to write the model with ftype 2 so it won't be incorrectly identified as 4 - mostly q4_1 some f16.)
|
|
|
|
|
|
1 |
---
|
2 |
inference: false
|
3 |
---
|
4 |
+
GGML conversion of https://huggingface.co/digitous/Alpacino13b using https://github.com/ggerganov/llama.cpp/pull/896. (Edited to write the model with ftype 2 so it won't be incorrectly identified as 4 - mostly q4_1 some f16.)
|
5 |
+
|
6 |
+
GPTQ(cuda) quantization available here: https://huggingface.co/gozfarb/alpacino-13b-4bit-128g
|