Text Generation
Transformers
Safetensors
starcoder2
code
Eval Results
text-generation-inference
Inference Endpoints
compressed-tensors
File size: 174 Bytes
7ad49b1
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
quant_stage:
  quant_modifiers:
    GPTQModifier:
      sequential_update: false
      dampening_frac: 0.01
      ignore: [lm_head]
      scheme: W8A16
      targets: Linear