Text Generation
Transformers
Safetensors
llama
Not-For-All-Audiences
nsfw
text-generation-inference
4-bit precision
awq
{ | |
"zero_point": true, | |
"q_group_size": 128, | |
"w_bit": 4, | |
"version": "GEMM" | |
} |