NeoDim
/

starchat-alpha-GGML

Inference Endpoints

Model card Files Files and versions Community

Resources

View closed (0)

demo space

#4 opened over 1 year ago by

Looks like the starchat-alpha-ggml-q4_1.bin is broken

#3 opened over 1 year ago by

Which inference repo is this quantized for?

#2 opened over 1 year ago by

Can the quantized model be loaded in gpu to have faster inference ?

#1 opened over 1 year ago by