elmamounedieye
/

ASR

@@ -1,6 +1,4 @@
 ---
-language:
-- multilingual
 license: apache-2.0
 base_model: serge-wilson/whisper-small-wolof
 tags:
@@ -10,7 +8,7 @@ datasets:
 metrics:
 - wer
 model-index:
-- name: Whisper Wolof Lengo AI V5
   results:
   - task:
       name: Automatic Speech Recognition
@@ -24,19 +22,19 @@ model-index:
     metrics:
     - name: Wer
       type: wer
-      value: 39.312847261594285
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# Whisper Wolof Lengo AI V5
 This model is a fine-tuned version of [serge-wilson/whisper-small-wolof](https://huggingface.co/serge-wilson/whisper-small-wolof) on the audiofolder dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3500
-- Wer: 39.3128
-- Cer: 26.3187
 ## Model description
@@ -60,25 +58,25 @@ The following hyperparameters were used during training:
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.98) and epsilon=1e-05
-- lr_scheduler_type: polynomial
 - lr_scheduler_warmup_steps: 50
 - training_steps: 1990
 - mixed_precision_training: Native AMP
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Wer      | Cer      |
-|:-------------:|:-----:|:----:|:---------------:|:--------:|:--------:|
-| 1.3022        | 1.0   | 208  | 1.2112          | 146.8307 | 105.1707 |
-| 0.8551        | 2.0   | 416  | 0.9020          | 90.5318  | 83.8833  |
-| 0.5754        | 3.0   | 624  | 0.7127          | 118.9704 | 102.1064 |
-| 0.3739        | 4.0   | 832  | 0.5951          | 63.1591  | 45.8295  |
-| 0.2459        | 5.0   | 1040 | 0.4929          | 63.5446  | 50.1286  |
-| 0.1579        | 6.0   | 1248 | 0.4524          | 51.6158  | 35.0170  |
-| 0.0884        | 7.0   | 1456 | 0.4204          | 46.9554  | 30.6374  |
-| 0.0498        | 8.0   | 1664 | 0.3817          | 51.6158  | 33.7194  |
-| 0.0268        | 9.0   | 1872 | 0.3490          | 40.8550  | 27.1844  |
-| 0.012         | 9.57  | 1990 | 0.3500          | 39.3128  | 26.3187  |
 ### Framework versions

 ---
 license: apache-2.0
 base_model: serge-wilson/whisper-small-wolof
 tags:
 metrics:
 - wer
 model-index:
+- name: ASR
   results:
   - task:
       name: Automatic Speech Recognition
     metrics:
     - name: Wer
       type: wer
+      value: 36.047170881052274
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# ASR
 This model is a fine-tuned version of [serge-wilson/whisper-small-wolof](https://huggingface.co/serge-wilson/whisper-small-wolof) on the audiofolder dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3569
+- Wer: 36.0472
+- Cer: 22.5967
 ## Model description
 - eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.98) and epsilon=1e-05
+- lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 50
 - training_steps: 1990
 - mixed_precision_training: Native AMP
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Wer     | Cer     |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
+| 1.2672        | 1.0   | 208  | 1.2009          | 85.3838 | 63.2065 |
+| 0.875         | 2.0   | 416  | 0.8801          | 95.6117 | 69.2841 |
+| 0.5964        | 3.0   | 624  | 0.6979          | 88.4681 | 63.1476 |
+| 0.3953        | 4.0   | 832  | 0.6112          | 69.2255 | 57.6000 |
+| 0.2465        | 5.0   | 1040 | 0.5015          | 55.4825 | 44.1314 |
+| 0.161         | 6.0   | 1248 | 0.4401          | 53.7476 | 36.3715 |
+| 0.0903        | 7.0   | 1456 | 0.4081          | 47.1822 | 31.0320 |
+| 0.0553        | 8.0   | 1664 | 0.3751          | 44.7783 | 29.2044 |
+| 0.024         | 9.0   | 1872 | 0.3604          | 38.7686 | 25.2606 |
+| 0.011         | 9.57  | 1990 | 0.3569          | 36.0472 | 22.5967 |
 ### Framework versions