stan-hua's picture
Push folder to HuggingFace Hub
ee83c1d verified
raw
history blame
179 Bytes
DEFAULT_stage:
DEFAULT_modifiers:
SmoothQuantModifier: {smoothing_strength: 0.8}
QuantizationModifier:
ignore: [lm_head]
targets: Linear
scheme: W8A16