augmxnt/shisa-7b-v1

Thanks @TheBloke for doing his thing :)

I'll keep this list updated if GGUFs come along (See https://huggingface.co/augmxnt/shisa-7b-v1/discussions/1 to follow along on that story, basically llama.cpp bugged atm for most BPE tokenizers so no point in quantizing).

AWQ: https://huggingface.co/TheBloke/shisa-7B-v1-AWQ
GPTQ: https://huggingface.co/TheBloke/shisa-7B-v1-GPTQ

It looks like mmnga was able to get GGUF conversion working with a custom_shisa.py conversion script that combines the extra BPE characters into the spm tokenizer. Seems to run great, thanks!

GGUF: https://huggingface.co/mmnga/shisa-7b-v1-gguf

If anyone does their own (EXLs, etc) feel free to post it in here.

augmxnt
/

shisa-7b-v1

Quants