Quantise to support llama.cpp

by TusharRay - opened Feb 22

Feb 22

Can this be quantised to support https://github.com/ggerganov/llama.cpp ? The llama.cpp is really performant and this model can then be widely used across multiple platforms!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment