andromeda0302 commited on
Commit
c770b49
1 Parent(s): f514292

Spanish fine-tune (pre-alpha)

Browse files

This is a really early version of an Spanish fine-tune. I am training it on around 150 hours of spanish audio databases with permissive licenses.
It has only been trained for 8 hours on one 3090. It's still bad but now it has spanish understanding.
I am sending it in case you are interested in adding it as an optional checkpoint. Also if you are interested I may send the next versions as it continues to train.
This may serve as a placeholder while no other options are available.

This are the URLs of the datasets I used:

https://www.kaggle.com/datasets/carlfm01/120h-spanish-speech
https://www.kaggle.com/datasets/bryanpark/spanish-single-speaker-speech-dataset?resource=download
http://openslr.org/61/
http://openslr.org/71/
https://www.openslr.org/72/
http://openslr.org/73/
http://openslr.org/74/
http://openslr.org/75/

Files changed (1) hide show
  1. spanish_model_pre_alpha_.pt +3 -0
spanish_model_pre_alpha_.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c59340a36420280d5f913ea1aaf3462805f2fbc7db26c6faa4c71a258852c98f
3
+ size 5394190092