Qwen2.5-72b-BNK-FP8 / recipe.yaml
Imran1's picture
Upload folder using huggingface_hub
030c3a4 verified
raw
history blame
134 Bytes
DEFAULT_stage:
DEFAULT_modifiers:
QuantizationModifier:
ignore: [lm_head]
targets: Linear
scheme: FP8_DYNAMIC