Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Qwen
/
Qwen1.5-72B-Chat-AWQ
like
23
Follow
Qwen
1,857
Text Generation
Transformers
Safetensors
English
qwen2
chat
conversational
text-generation-inference
Inference Endpoints
4-bit precision
awq
arxiv:
2309.16609
License:
tongyi-qianwen
Model card
Files
Files and versions
Community
2
Train
Deploy
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Incredibly slow inference speed
1
#2 opened 6 months ago by
famunir
What dataset was used for AWQ quanting?
#1 opened 9 months ago by
RonanMcGovern