doof-ferb
/

whisper-tiny-vi

Automatic Speech Recognition

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

doof-ferb commited on Feb 20

Commit

ecb6411

•

1 Parent(s): 49a2658

Update README.md

Files changed (1) hide show

README.md +11 -4

README.md CHANGED Viewed

@@ -56,11 +56,18 @@ model-index:
 whisper tiny fine-tuned on a very big collection of vietnamese speech datasets
 TODO:
-- [x] training then publish checkpoint (*no ETA*)
-- [x] evaluate WER on Common Voice & FLEURS
 - [ ] convert to `openai-whisper`, `whisper.cpp`, `faster-whisper`
-- [ ] convert to ONNX: to try `k2-fsa/sherpa-onnx` &amp; `zhuzilin/whisper-openvino`
 21k steps, warm-up 5%, batch size 16×2 (kaggle free T4×2)
-all training + evaluation scripts are on my repo: https://github.com/phineas-pta/fine-tune-whisper-vi

 whisper tiny fine-tuned on a very big collection of vietnamese speech datasets
 TODO:
+- [x] training then publish checkpoint
+- [x] evaluate WER on Common Voice &amp; FLEURS &amp; VIVOS
 - [ ] convert to `openai-whisper`, `whisper.cpp`, `faster-whisper`
+- [ ] convert to ONNX: to try https://github.com/k2-fsa/sherpa-onnx &amp; https://github.com/zhuzilin/whisper-openvino
+- [ ] convert to TensorRT: https://github.com/openai/whisper/discussions/169
 21k steps, warm-up 5%, batch size 16×2 (kaggle free T4×2)
+manually evaluate WER on test set - vietnamese part:
+| @ `float16` | `CommonVoice v16.1` | `FLEURS` | `VIVOS` |
+|---|---|---|---|
+| original `whisper-tiny` | &gt;100% | 88.6% | 62.5% |
+| this model | 26.6% | 37.1% | 18.7% |
+all training + evaluation scripts are on my repo: https://github.com/phineas-pta/fine-tune-whisper-vi