DahmL
/

wav2vec2-large-xls-r-300m-turkish-colab

@@ -22,7 +22,7 @@ model-index:
     metrics:
     - name: Wer
       type: wer
-      value: 0.87
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.8038
-- Wer: 0.87
 ## Model description
@@ -53,28 +53,31 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 0.0003
-- train_batch_size: 16
-- eval_batch_size: 8
 - seed: 42
 - gradient_accumulation_steps: 2
-- total_train_batch_size: 32
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 500
-- num_epochs: 30
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Wer   |
-|:-------------:|:-----:|:----:|:---------------:|:-----:|
-| 6.0471        | 12.7  | 400  | 0.9101          | 0.941 |
-| 0.3753        | 25.4  | 800  | 0.8038          | 0.87  |
 ### Framework versions
 - Transformers 4.36.1
-- Pytorch 2.1.0+cu121
 - Datasets 2.15.0
 - Tokenizers 0.15.0

     metrics:
     - name: Wer
       type: wer
+      value: 0.7091714338438826
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5247
+- Wer: 0.7092
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 0.0003
+- train_batch_size: 64
+- eval_batch_size: 16
 - seed: 42
 - gradient_accumulation_steps: 2
+- total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 300
+- num_epochs: 20
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer    |
+|:-------------:|:-----:|:----:|:---------------:|:------:|
+| 0.1953        | 3.86  | 400  | 0.5740          | 0.7963 |
+| 0.1959        | 7.73  | 800  | 0.5169          | 0.7743 |
+| 0.1486        | 11.59 | 1200 | 0.5334          | 0.7501 |
+| 0.1146        | 15.46 | 1600 | 0.5186          | 0.7226 |
+| 0.0885        | 19.32 | 2000 | 0.5247          | 0.7092 |
 ### Framework versions
 - Transformers 4.36.1
+- Pytorch 1.10.0+cu113
 - Datasets 2.15.0
 - Tokenizers 0.15.0

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2b6e92c19e8fd579ac9a9c70825f33963fb02143903b883b6f043561ffe3f347
 size 1261967332

 version https://git-lfs.github.com/spec/v1
+oid sha256:01b71a384a0ebe3caa4db77c198ea080996f4719e4d0bfe63721fa65340c5263
 size 1261967332