TKU410410103
/

uniTKU-hubert-japanese-asr

Automatic Speech Recognition

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

TKU410410103 commited on Apr 22

Commit

13ed52a

•

1 Parent(s): b39408d

Update README.md

Files changed (1) hide show

README.md +15 -17

README.md CHANGED Viewed

@@ -18,10 +18,10 @@ model-index:
     metrics:
     - name: Test WER
       type: wer
-      value: 27.447168
     - name: Test CER
       type: cer
-      value: 11.607944
 datasets:
 - mozilla-foundation/common_voice_11_0
 language:
@@ -40,16 +40,14 @@ Fine-tuning on the uniTKU dataset led to the following results:
 | Step  | Training Loss | Validation Loss | WER    |
 |-------|---------------|-----------------|--------|
-| 100  | 0.910100      | 1.051628        | 0.669118|
-| 200  | 0.747700      | 0.691642        | 0.551471|
-| 300  | 0.718000      | 0.705763        | 0.544118|
-| 400  | 0.663700      | 0.532831        | 0.397059|
-| 500  | 0.667700      | 0.491024        | 0.352941|
-| 600  | 0.546800      | 0.365637        | 0.330882|
-| 700  | 0.569000      | 0.274410        | 0.279412|
-| 800  | 0.591800      | 0.274801        | 0.235294|
-| 900  | 0.575400      | 0.257891        | 0.220588|
-| 1000 | 0.579100      | 0.280559        | 0.205882|
 ### Training hyperparameters
@@ -59,7 +57,7 @@ The training hyperparameters remained consistent throughout the fine-tuning proc
 - train_batch_size: 16
 - eval_batch_size: 16
 - gradient_accumulation_steps: 2
-- num_train_epochs: 15
 - lr_scheduler_type: linear
 ### How to evaluate the model
@@ -151,12 +149,12 @@ print("CER: {:2f}%".format(100 * cer_result))
 The final model was evaluated as follows:
 On uniTKU Dataset:
-- WER: 20.588235%
-- CER: 13.027523%
 On common_voice_11_0:
-- WER: 27.447168%
-- CER: 11.607944%
 ### Framework versions

     metrics:
     - name: Test WER
       type: wer
+      value: 27.511982
     - name: Test CER
       type: cer
+      value: 11.563649
 datasets:
 - mozilla-foundation/common_voice_11_0
 language:
 | Step  | Training Loss | Validation Loss | WER    |
 |-------|---------------|-----------------|--------|
+| 100  | 1.127100      | 1.089644        | 0.668508|
+| 200  | 0.873500      | 0.682353        | 0.508287|
+| 300  | 0.786200      | 0.482965        | 0.397790|
+| 400  | 0.670400      | 0.345377        | 0.381215|
+| 500  | 0.719500      | 0.387554        | 0.337017|
+| 600  | 0.707700      | 0.371083        | 0.292818|
+| 700  | 0.658300      | 0.236447        | 0.243094|
+| 800  | 0.611100      | 0.207679        | 0.193370|
 ### Training hyperparameters
 - train_batch_size: 16
 - eval_batch_size: 16
 - gradient_accumulation_steps: 2
+- max_steps: 800
 - lr_scheduler_type: linear
 ### How to evaluate the model
 The final model was evaluated as follows:
 On uniTKU Dataset:
+- WER: 19.003370%
+- CER: 11.027523%
 On common_voice_11_0:
+- WER: 27.511982%
+- CER: 11.563649%
 ### Framework versions