Qwen
/

Qwen-1_8B-Chat-Int8

Text Generation

8-bit precision

Model card Files Files and versions Community

Resources

View closed (0)

Model doesnot run in Google Colab Free Tier

#2 opened 6 months ago by

sanjeev-bhandari01

ValueError: QWenLMHeadModel does not support Flash Attention 2.0 yet.

#1 opened 8 months ago by

sanjeev-bhandari01