nlparabic commited on
Commit
1cc2d24
1 Parent(s): 8c78c41

Model save

Browse files
README.md ADDED
@@ -0,0 +1,80 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: aubmindlab/aragpt2-base
3
+ tags:
4
+ - generated_from_trainer
5
+ metrics:
6
+ - bleu
7
+ - rouge
8
+ model-index:
9
+ - name: res_nw_irq_aragpt2-base
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # res_nw_irq_aragpt2-base
17
+
18
+ This model is a fine-tuned version of [aubmindlab/aragpt2-base](https://huggingface.co/aubmindlab/aragpt2-base) on an unknown dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 0.2191
21
+ - Bleu: 0.0944
22
+ - Rouge1: 0.4103
23
+ - Rouge2: 0.1839
24
+ - Rougel: 0.4066
25
+
26
+ ## Model description
27
+
28
+ More information needed
29
+
30
+ ## Intended uses & limitations
31
+
32
+ More information needed
33
+
34
+ ## Training and evaluation data
35
+
36
+ More information needed
37
+
38
+ ## Training procedure
39
+
40
+ ### Training hyperparameters
41
+
42
+ The following hyperparameters were used during training:
43
+ - learning_rate: 5e-05
44
+ - train_batch_size: 8
45
+ - eval_batch_size: 8
46
+ - seed: 42
47
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
+ - lr_scheduler_type: linear
49
+ - lr_scheduler_warmup_steps: 500
50
+ - num_epochs: 20.0
51
+
52
+ ### Training results
53
+
54
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Rouge1 | Rouge2 | Rougel |
55
+ |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:------:|
56
+ | 0.9334 | 1.0 | 1057 | 0.2461 | 0.0032 | 0.1593 | 0.0185 | 0.1533 |
57
+ | 0.0868 | 2.0 | 2114 | 0.2332 | 0.0149 | 0.2455 | 0.0510 | 0.2394 |
58
+ | 0.0767 | 3.0 | 3171 | 0.2342 | 0.0252 | 0.2961 | 0.0782 | 0.2910 |
59
+ | 0.0696 | 4.0 | 4228 | 0.2278 | 0.0404 | 0.3300 | 0.1050 | 0.3252 |
60
+ | 0.0636 | 5.0 | 5285 | 0.2219 | 0.0517 | 0.3536 | 0.1215 | 0.3480 |
61
+ | 0.0587 | 6.0 | 6342 | 0.2237 | 0.0590 | 0.3654 | 0.1348 | 0.3611 |
62
+ | 0.0542 | 7.0 | 7399 | 0.2194 | 0.0667 | 0.3755 | 0.1440 | 0.3712 |
63
+ | 0.0502 | 8.0 | 8456 | 0.2080 | 0.0715 | 0.3802 | 0.1521 | 0.3761 |
64
+ | 0.0468 | 9.0 | 9513 | 0.2123 | 0.0770 | 0.3931 | 0.1616 | 0.3889 |
65
+ | 0.0438 | 10.0 | 10570 | 0.2112 | 0.0812 | 0.3921 | 0.1648 | 0.3884 |
66
+ | 0.0408 | 11.0 | 11627 | 0.2102 | 0.0816 | 0.3967 | 0.1653 | 0.3936 |
67
+ | 0.0384 | 12.0 | 12684 | 0.2078 | 0.0845 | 0.4018 | 0.1711 | 0.3978 |
68
+ | 0.0363 | 13.0 | 13741 | 0.2145 | 0.0870 | 0.4023 | 0.1720 | 0.3986 |
69
+ | 0.0343 | 14.0 | 14798 | 0.2165 | 0.0878 | 0.4063 | 0.1757 | 0.4023 |
70
+ | 0.0327 | 15.0 | 15855 | 0.2169 | 0.0920 | 0.4049 | 0.1792 | 0.4014 |
71
+ | 0.0313 | 16.0 | 16912 | 0.2175 | 0.0920 | 0.4078 | 0.1821 | 0.4048 |
72
+ | 0.0301 | 17.0 | 17969 | 0.2191 | 0.0944 | 0.4103 | 0.1839 | 0.4066 |
73
+
74
+
75
+ ### Framework versions
76
+
77
+ - Transformers 4.45.0.dev0
78
+ - Pytorch 2.3.1+cu121
79
+ - Datasets 2.19.2
80
+ - Tokenizers 0.19.1
egy_training_log.txt CHANGED
@@ -308,3 +308,5 @@ INFO:root:Epoch 15.0: Train Loss = 0.0343, Eval Loss = 0.21651192009449005
308
  INFO:absl:Using default tokenizer.
309
  INFO:root:Epoch 16.0: Train Loss = 0.0327, Eval Loss = 0.21689023077487946
310
  INFO:absl:Using default tokenizer.
 
 
 
308
  INFO:absl:Using default tokenizer.
309
  INFO:root:Epoch 16.0: Train Loss = 0.0327, Eval Loss = 0.21689023077487946
310
  INFO:absl:Using default tokenizer.
311
+ INFO:root:Epoch 17.0: Train Loss = 0.0313, Eval Loss = 0.21753403544425964
312
+ INFO:absl:Using default tokenizer.
generation_config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "_from_model_config": true,
3
+ "bos_token_id": 0,
4
+ "eos_token_id": 0,
5
+ "transformers_version": "4.45.0.dev0"
6
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ff7827fac3ce7eb11e5efe95bf5bbd11d42d8ff1d3748c347b2659fa09b6ae76
3
  size 540004992
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:98e93b91d2a93cc9c6e25eb09191af94f1b5c0e0412b5b944e3a710a2efbaf01
3
  size 540004992