arabic-iti

This model is a fine-tuned version of facebook/wav2vec2-large-xlsr-53 on the common_voice dataset. It achieves the following results on the evaluation set:

Loss: 1.0154
Wer: 0.6350

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 0.0005
train_batch_size: 8
eval_batch_size: 1
seed: 42
gradient_accumulation_steps: 8
total_train_batch_size: 64
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
lr_scheduler_warmup_steps: 3000
num_epochs: 50
mixed_precision_training: Native AMP

Training results

Training Loss	Epoch	Step	Validation Loss	Wer
3.0355	2.36	400	3.0286	1.0
0.7999	4.73	800	0.8623	0.8067
0.4485	7.1	1200	0.6920	0.6651
0.3719	9.47	1600	0.6361	0.6591
0.3401	11.83	2000	0.6967	0.6497
0.3222	14.2	2400	0.6697	0.6246
0.3094	16.57	2800	0.7282	0.6537
0.2822	18.93	3200	0.8019	0.6816
0.2446	21.3	3600	0.7622	0.6608
0.235	23.67	4000	0.8644	0.6780
0.2362	26.04	4400	0.9083	0.6710
0.206	28.4	4800	0.8243	0.6598
0.1765	30.77	5200	0.8614	0.6647
0.1458	33.14	5600	0.8907	0.6447
0.1544	35.5	6000	0.9059	0.6523
0.2402	18.88	6400	0.9639	0.6970
0.2026	20.06	6800	0.9868	0.6817
0.185	21.24	7200	1.0043	0.6936
0.1951	22.42	7600	0.8918	0.6795
0.1933	23.6	8000	0.9367	0.6826
0.2272	24.78	8400	0.8540	0.6792
0.1922	25.96	8800	0.8983	0.6657
0.1547	27.14	9200	0.9742	0.6747
0.1579	28.32	9600	0.9066	0.6668
0.1642	29.5	10000	0.9440	0.6790
0.1726	30.68	10400	0.9654	0.6813
0.1656	31.86	10800	0.9880	0.6801
0.1741	33.04	11200	0.9707	0.6584
0.1494	34.22	11600	0.9801	0.6709
0.1482	35.4	12000	0.9258	0.6646
0.14	36.58	12400	0.9802	0.6635
0.142	37.76	12800	0.9268	0.6524
0.1281	38.94	13200	0.9615	0.6587
0.1051	40.12	13600	0.9721	0.6495
0.1074	41.3	14000	1.0045	0.6582
0.0879	42.48	14400	1.0290	0.6516
0.1015	43.66	14800	1.0514	0.6556
0.0932	44.84	15200	1.0287	0.6450
0.1008	46.02	15600	0.9940	0.6399
0.0968	47.2	16000	1.0206	0.6368
0.0858	48.38	16400	1.0452	0.6361
0.0886	49.56	16800	1.0154	0.6350

Framework versions

Transformers 4.11.3
Pytorch 1.10.1+cu102
Datasets 1.13.3
Tokenizers 0.10.3

maher13
/

arabic-iti

arabic-iti

Model description

Intended uses & limitations

Training and evaluation data

Training procedure

Training hyperparameters

Training results

Framework versions

Dataset used to train maher13/arabic-iti

Space using maher13/arabic-iti 1

Evaluation results