JustinLin610 commited on
Commit
3ade55f
1 Parent(s): ef28d43

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -75,7 +75,7 @@ generated_ids = [
75
  response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
76
  ```
77
 
78
- For quantized models, we advise you to use the GPTQ, AWQ, and GGUF correspondents, namely `Qwen1.5-72B-Chat-GPTQ`, `Qwen1.5-72B-Chat-AWQ`, and `Qwen1.5-72B-Chat-GGUF`.
79
 
80
 
81
  ## Tips
 
75
  response = tokenizer.batch_decode(generated_ids, skip_special_tokens=True)[0]
76
  ```
77
 
78
+ For quantized models, we advise you to use the GPTQ, AWQ, and GGUF correspondents, namely `Qwen1.5-72B-Chat-GPTQ-Int8`, `Qwen1.5-72B-Chat-AWQ`, and `Qwen1.5-72B-Chat-GGUF`.
79
 
80
 
81
  ## Tips