8bit version of the model

This is a 4bit GPTQ model. I could make an 8bit GPTQ but there's no point because we can already load HF models in 8bit using bitsandbytes
If you want 8bit, please use https://huggingface.co/TheBloke/stable-vicuna-13B-HF and specify load_in_8bit=True like I told you on Github

TheBloke changed pull request status to closed May 4, 2023

May 4, 2023

Sure will do that

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment