latest llama.cpp using q5_0, q4_1 error: "is this really a GGML file?"

by Free-Radical - opened May 26, 2023

Discussion

Free-Radical

May 26, 2023

I am using latest llama.cpp (master-66874d4) and it only works for:

ggml-vicuna-13b-4bit.bin

but NOT for:

stable-vicuna-13B.ggmlv3.q5_0.bin
stable-vicuna-13B.ggmlv3.q4_1.bin

TheBloke

Owner May 26, 2023

Please check the sha256sum for the stable-vicuna-13B.ggmlv3.q5_0 and q4_1 files, or if in doubt download them again.

Those two files definitely work with the latest llama.cpp, so you likely have incomplete/corrupted downloads. (In fact the q5_0 file would work with older llama.cpp as well - only q4_0, q4_1 and q8_0 require the latest llama.cpp)

I just re-downloaded and tested q5_0 with llama.cpp compiled today and confirmed it works OK.

Free-Radical

May 26, 2023

Ok , MY BAD, 🫢 very embarrassed, i forgot to recompile. YES THEY WORK!

Free-Radical changed discussion status to closed May 26, 2023

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment