Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Qwen
/
Qwen-1_8B-Chat-Int8
like
4
Follow
Qwen
1,848
Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code
8-bit precision
gptq
arxiv:
2309.16609
arxiv:
2305.08322
arxiv:
2009.03300
Model card
Files
Files and versions
Community
2
Train
Use this model
New discussion
New pull request
Resources
PR & discussions documentation
Code of Conduct
Hub documentation
All
Discussions
Pull requests
View closed (0)
Model doesnot run in Google Colab Free Tier
#2 opened 6 months ago by
sanjeev-bhandari01
ValueError: QWenLMHeadModel does not support Flash Attention 2.0 yet.
2
#1 opened 8 months ago by
sanjeev-bhandari01