colerobertson
/

wav2vec2-xls-r-phoneme-300m-tr-ogma-phoneme

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

wav2vec2-xls-r-phoneme-300m-tr-ogma-phoneme / README.md

colerobertson's picture

End of training

2718510 verified 9 months ago

|

3.26 kB

	---
	base_model: patrickvonplaten/wav2vec2-xls-r-phoneme-300m-tr
	tags:
	- generated_from_trainer
	model-index:
	- name: wav2vec2-xls-r-phoneme-300m-tr-ogma-phoneme
	results: []
	---

	<!-- This model card has been generated automatically according to the information the Trainer had access to. You
	should probably proofread and complete it, then remove this comment. -->

	# wav2vec2-xls-r-phoneme-300m-tr-ogma-phoneme

	This model is a fine-tuned version of [patrickvonplaten/wav2vec2-xls-r-phoneme-300m-tr](https://huggingface.co/patrickvonplaten/wav2vec2-xls-r-phoneme-300m-tr) on the None dataset.
	It achieves the following results on the evaluation set:
	- Loss: 2.6886
	- Cer: 0.7624

	## Model description

	More information needed

	## Intended uses & limitations

	More information needed

	## Training and evaluation data

	More information needed

	## Training procedure

	### Training hyperparameters

	The following hyperparameters were used during training:
	- learning_rate: 0.0001
	- train_batch_size: 8
	- eval_batch_size: 8
	- seed: 42
	- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
	- lr_scheduler_type: linear
	- lr_scheduler_warmup_steps: 100
	- num_epochs: 30
	- mixed_precision_training: Native AMP

	### Training results

	\| Training Loss \| Epoch \| Step \| Validation Loss \| Cer \|
	\|:-------------:\|:-----:\|:----:\|:---------------:\|:------:\|
	\| 9.8242 \| 1.0 \| 17 \| 10.2075 \| 1.7574 \|
	\| 7.5139 \| 2.0 \| 34 \| 6.2677 \| 1.8317 \|
	\| 4.93 \| 3.0 \| 51 \| 4.4229 \| 0.9950 \|
	\| 3.787 \| 4.0 \| 68 \| 3.6279 \| 0.9604 \|
	\| 3.2056 \| 5.0 \| 85 \| 3.3155 \| 0.9109 \|
	\| 2.8302 \| 6.0 \| 102 \| 3.0498 \| 0.8515 \|
	\| 2.6059 \| 7.0 \| 119 \| 2.9567 \| 0.8366 \|
	\| 2.3369 \| 8.0 \| 136 \| 2.8454 \| 0.8465 \|
	\| 2.0347 \| 9.0 \| 153 \| 2.7595 \| 0.8663 \|
	\| 1.787 \| 10.0 \| 170 \| 2.8327 \| 0.8416 \|
	\| 1.5493 \| 11.0 \| 187 \| 2.7142 \| 0.8465 \|
	\| 1.3992 \| 12.0 \| 204 \| 2.7668 \| 0.8713 \|
	\| 1.3539 \| 13.0 \| 221 \| 2.7595 \| 0.8465 \|
	\| 1.1791 \| 14.0 \| 238 \| 2.6278 \| 0.8366 \|
	\| 1.1649 \| 15.0 \| 255 \| 2.8350 \| 0.8564 \|
	\| 1.0361 \| 16.0 \| 272 \| 2.7286 \| 0.7921 \|
	\| 0.9179 \| 17.0 \| 289 \| 2.6409 \| 0.7772 \|
	\| 0.8338 \| 18.0 \| 306 \| 2.6040 \| 0.7574 \|
	\| 0.7847 \| 19.0 \| 323 \| 2.7403 \| 0.8564 \|
	\| 0.82 \| 20.0 \| 340 \| 2.6313 \| 0.8168 \|
	\| 0.753 \| 21.0 \| 357 \| 2.5469 \| 0.8168 \|
	\| 0.6124 \| 22.0 \| 374 \| 2.5799 \| 0.7822 \|
	\| 0.6236 \| 23.0 \| 391 \| 2.6548 \| 0.8069 \|
	\| 0.5955 \| 24.0 \| 408 \| 2.6331 \| 0.8317 \|
	\| 0.592 \| 25.0 \| 425 \| 2.6168 \| 0.8366 \|
	\| 0.5169 \| 26.0 \| 442 \| 2.6168 \| 0.8069 \|
	\| 0.5012 \| 27.0 \| 459 \| 2.5482 \| 0.7723 \|
	\| 0.44 \| 28.0 \| 476 \| 2.6088 \| 0.8020 \|
	\| 0.4243 \| 29.0 \| 493 \| 2.6753 \| 0.7871 \|
	\| 0.4824 \| 30.0 \| 510 \| 2.6886 \| 0.7624 \|


	### Framework versions

	- Transformers 4.38.2
	- Pytorch 2.1.0+cu121
	- Datasets 2.18.0
	- Tokenizers 0.15.2