Speech Recognition Models
Collection
Models for Welsh language and bilingual speech recognition
•
14 items
•
Updated
This model is a version of openai/whisper-large-v3 fine-tuned with the
train_all
and other_with_excluded
custom splits from techiaith/commonvoice_18_0_cy
It achieves the following results on the Common Voice for Welsh release 18's standard test set:
N.B. this model performs considerably worse on English language speech, but better on Welsh than a bilingual model
from transformers import pipeline
transcriber = pipeline("automatic-speech-recognition", model="techiaith/whisper-large-v3-ft-cv-cy")
result = transcriber(<path or url to soundfile>)
print (result)
{'text': 'Mae hen wlad fy nhadau yn annwyl i mi.'}