shuaijiang commited on
Commit
6121288
1 Parent(s): 882cc29

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +22 -1
README.md CHANGED
@@ -4,7 +4,28 @@ metrics:
4
  - cer
5
  ---
6
  # Belle-whisper-large-v2-zh
7
- 基于whisper-large-v2增强中文语音识别能力,相比于whisper-large-v2,Belle-whisper-large-v2-zh在中文ASR benchmark上相对提升30~70%
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
8
 
9
  ## Fine-tuning
10
  | Model | (Re)Sample Rate | Train Datasets | Fine-tuning (full or peft) |
 
4
  - cer
5
  ---
6
  # Belle-whisper-large-v2-zh
7
+ Fine tune whisper-large-v2 to improve Chinese speech recognition, Belle-whisper-large-v2-zh has 30-70% relative improvements on Chinese ASR benchmark(AISHELL1, AISHELL2, WENETSPEECH, HKUST).
8
+
9
+ ## Usage
10
+ ```python
11
+
12
+ from transformers import pipeline
13
+
14
+ transcriber = pipeline(
15
+ "automatic-speech-recognition",
16
+ model="BELLE-2/Belle-whisper-large-v2-zh"
17
+ )
18
+
19
+ transcriber.model.config.forced_decoder_ids = (
20
+ transcriber.tokenizer.get_decoder_prompt_ids(
21
+ language="zh",
22
+ task="transcribe"
23
+ )
24
+ )
25
+
26
+ transcription = transcriber("my_audio.wav")
27
+
28
+ ```
29
 
30
  ## Fine-tuning
31
  | Model | (Re)Sample Rate | Train Datasets | Fine-tuning (full or peft) |