Bitwidth for LM Head

by denru - opened Aug 8

denru

Aug 8

What is the bitwidth used to quantize the LM head? Thanks!

Owner Aug 9

It's the default, 6bpw. If in doubt, check the "quantization_config" key in config.json, specifically "head_bits".

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment