Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
keyfan
/
Qwen-72B-Chat-2bit
like
7
Text Generation
Transformers
PyTorch
qwen
custom_code
QUiP
License:
qianwen
Model card
Files
Files and versions
Community
3
Train
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (1)
量化设备
3
#3 opened 9 months ago by
tiantian7777
Is there a big performance difference between 2bit quantization and 4bit quantization conversations?
1
#2 opened 11 months ago by
xldistance