Can you help upload ollama download, currently directly merge to ollama is not available

#9
by windkkk - opened

Can you help upload ollama download, currently directly merge to ollama is not available

CausalLM org

PLEASE READ THE MODEL CARD FIRST!

GGUF (Text-Only, not recommended): There is a significant degradation, even with the F16.

GGUF for text-only should be working after PR #9194 was merged.

As GGUF implementation is not recommended at the moment, there won't be an ollama download. Please do use transformers or ChatGLM.cpp, and any quantization lower than 8-bit is not recommended.

JosephusCheung changed discussion status to closed

Sign up or log in to comment