colerobertson's picture
Upload tokenizer
0614c1c verified
|
raw
history blame
3.26 kB
metadata
tags:
  - generated_from_trainer
base_model: patrickvonplaten/wav2vec2-xls-r-phoneme-300m-tr
model-index:
  - name: wav2vec2-xls-r-phoneme-300m-tr-ogma-phoneme
    results: []

wav2vec2-xls-r-phoneme-300m-tr-ogma-phoneme

This model is a fine-tuned version of patrickvonplaten/wav2vec2-xls-r-phoneme-300m-tr on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 2.6886
  • Cer: 0.7624

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0001
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 100
  • num_epochs: 30
  • mixed_precision_training: Native AMP

Training results

Training Loss Epoch Step Validation Loss Cer
9.8242 1.0 17 10.2075 1.7574
7.5139 2.0 34 6.2677 1.8317
4.93 3.0 51 4.4229 0.9950
3.787 4.0 68 3.6279 0.9604
3.2056 5.0 85 3.3155 0.9109
2.8302 6.0 102 3.0498 0.8515
2.6059 7.0 119 2.9567 0.8366
2.3369 8.0 136 2.8454 0.8465
2.0347 9.0 153 2.7595 0.8663
1.787 10.0 170 2.8327 0.8416
1.5493 11.0 187 2.7142 0.8465
1.3992 12.0 204 2.7668 0.8713
1.3539 13.0 221 2.7595 0.8465
1.1791 14.0 238 2.6278 0.8366
1.1649 15.0 255 2.8350 0.8564
1.0361 16.0 272 2.7286 0.7921
0.9179 17.0 289 2.6409 0.7772
0.8338 18.0 306 2.6040 0.7574
0.7847 19.0 323 2.7403 0.8564
0.82 20.0 340 2.6313 0.8168
0.753 21.0 357 2.5469 0.8168
0.6124 22.0 374 2.5799 0.7822
0.6236 23.0 391 2.6548 0.8069
0.5955 24.0 408 2.6331 0.8317
0.592 25.0 425 2.6168 0.8366
0.5169 26.0 442 2.6168 0.8069
0.5012 27.0 459 2.5482 0.7723
0.44 28.0 476 2.6088 0.8020
0.4243 29.0 493 2.6753 0.7871
0.4824 30.0 510 2.6886 0.7624

Framework versions

  • Transformers 4.38.2
  • Pytorch 2.1.0+cu121
  • Datasets 2.18.0
  • Tokenizers 0.15.2