Virt-io
/

Google-Colab-Imatrix-GGUF

Model card Files Files and versions Community

Virt-io commited on Apr 18

Commit

cb116cc

•

1 Parent(s): a1e0c07

Update README.md

Files changed (1) hide show

README.md +4 -5

README.md CHANGED Viewed

@@ -4,9 +4,10 @@ tags:
 - GGUF
 ---
-# HELP WANTED
-Does anyone know how to build and host wheels for llama.cpp, but specifically of colab, to avoid wasted time.
 # Details
 [Thanks to mlabonne for the initial code](https://huggingface.co/mlabonne)
@@ -17,6 +18,4 @@ RP Imatrix is from [Lewdiculous](https://huggingface.co/Lewdiculous)
 Host files for a google colab notebook, hoping to make it easier to GGUF models with Imatrix.
-There are two imatrix datasets, one for general use and one for RP.
-After some testing, making the actual quants is really slow, recommended to only use it for the intial FP16 GGUF and imatrix.dat generation.

 - GGUF
 ---
+# Free Tier Colab
+This is only for making the intial FP16 gguf file and computing an imatrix.dat
+Quantizing is too slow on colab due to only having two available cores.
 # Details
 [Thanks to mlabonne for the initial code](https://huggingface.co/mlabonne)
 Host files for a google colab notebook, hoping to make it easier to GGUF models with Imatrix.
+There are two imatrix datasets, one for general use and one for RP.