Qwen
/

Text Generation
Transformers
Safetensors
Chinese
English
qwen
custom_code

Training loss?

#2
by borgr - opened

Hey, do you have the training checkpoints/loss/evaluations/other-ablations-tried to share?
Small note, you don't state which languages the model is expected to perform well at which people would look for.

Oh god, HF? I knew about arXiv... Guess I need to devise a new way...

Hey, do you have the training checkpoints/loss/evaluations/other-ablations-tried to share?

Currently, no we don't. But we have published a technical-ish report: https://arxiv.org/abs/2309.16609. I wish there is something useful to you.

Small note, you don't state which languages the model is expected to perform well at which people would look for.

Please look at the tags: Chinese and English.
image.png

It can work for other languages, but not that well (from GitHub README):
image.png

Likely won't be answered by the authors

Just haven't got the time to check HF. GitHub should be more responsive.

jklj077 changed discussion status to closed

Sign up or log in to comment