how to finetune and quantize the qwen1.5 model with gguf

by huntz47 - opened Apr 7

Apr 7

i am new in here. i tried finetuning the qwen model and and quantized it using llama factory and llama.cpp. but when i try to run the gguf file after quantizing, its getting error related to missing output.weight tensor file

jklj077

Qwen org Apr 18

It only happens to the 0.5B models which uses tie word embedings.
A fix has been merged: https://github.com/ggerganov/llama.cpp/pull/6738

jklj077 changed discussion status to closed Apr 18

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment