metadata
license: apache-2.0
metrics:
- cer
Belle-whisper-large-v2-zh
基于whisper-large-v2增强中文语音识别能力,相比于whisper-large-v2,Belle-whisper-large-v2-zh在中文ASR benchmark上相对提升30~70%
Fine-tuning
Model | (Re)Sample Rate | Train Datasets | Fine-tuning (full or peft) |
---|---|---|---|
Belle-whisper-large-v2-zh | 16KHz | AISHELL-1 AISHELL-2 WenetSpeech HKUST | full fine-tuning |
CER
Model | Language Tag | aishell_1_test | aishell_2_test | wenetspeech_net | wenetspeech_meeting | HKUST_dev |
---|---|---|---|---|---|---|
whisper-large-v2 | Chinese | 0.08818 | 0.06183 | 0.12343 | 0.26413 | 0.31917 |
Belle-whisper-large-v2-zh | Chinese | 0.02549 | 0.03746 | 0.08503 | 0.14598 | 0.16289 |