[llama.cpp PR#6844] Custom Quantizations

by Virt-io - opened Apr 28

Discussion

Virt-io

Apr 28

Conversation about quantization, mainly this PR

Virt-io

Apr 28

•

edited Apr 28

@Lewdiculous

I think it might be worth exploring, finding a good balance between quality and speed.

I am currently experimenting with the config below:

# Used for everything not specified below.
ftype=IQ4_NL

token_embd.weight=Q8_0
output.weight=Q8_0

# These are quite small, keeping them in a higher quantization to help with context.
blk.*.attn_output.weight=F16
blk.*.attn_?.weight=F16

Edit: Seems the code above is 6.95 BPW, I will try reducing it. It is pretty fast though.

Lewdiculous changed discussion title from Quantization to Custom Quantizations - llama.cpp PR#6844 Apr 28

Lewdiculous

LWDCLS Research org Apr 28

I never played with customizing layers as such.

Lewdiculous changed discussion title from Custom Quantizations - llama.cpp PR#6844 to Quantizations and llama.cpp PR#6844 Apr 28

WesPro

May 4

I updated the llama.cpp but still only get degraded quants. Are there any tutorials or something like that to use llama.cpp for llama 3 models? I only know the convert.py method (python convert.py ./models/myllama3merge --vocab-type bpe)

Lewdiculous

LWDCLS Research org May 4

•

edited May 4

@WesPro I added a notice about that in the GGUF script page.

https://huggingface.co/FantasiaFoundry/GGUF-Quantization-Script

More context:
https://huggingface.co/FantasiaFoundry/GGUF-Quantization-Script/discussions/27#66361fceccadfaaeacb0cdb5

WesPro

May 4

@WesPro I added a notice about that in the GGUF script page.

https://huggingface.co/FantasiaFoundry/GGUF-Quantization-Script

More context:
https://huggingface.co/FantasiaFoundry/GGUF-Quantization-Script/discussions/27#66361fceccadfaaeacb0cdb5

Related Discussion:
https://huggingface.co/FantasiaFoundry/GGUF-Quantization-Script/discussions/26#66317fb30a84b77d96b0c4e6

thanks I figured it out now... this helped me more than reading the whole issue thread on github ;)

Lewdiculous

LWDCLS Research org May 4

That's why we're here <3

Lewdiculous changed discussion title from Quantizations and llama.cpp PR#6844 to [llama.cpp PR#6844] Custom Quantizations Jun 2

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment