temporary0-0name
/

run_opt

Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

temporary0-0name commited on Nov 14, 2023

Commit

62019fa

•

1 Parent(s): 7fa70fb

End of training

Files changed (1) hide show

README.md +20 -20

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on the wikitext dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.2727
 ## Model description
@@ -36,7 +36,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.0001
 - train_batch_size: 64
 - eval_batch_size: 64
 - seed: 42
@@ -51,24 +51,24 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 9.1471        | 0.55  | 50   | 7.9703          |
-| 7.1714        | 1.1   | 100  | 6.6558          |
-| 6.4707        | 1.65  | 150  | 6.2924          |
-| 6.072         | 2.19  | 200  | 5.8048          |
-| 5.1389        | 2.74  | 250  | 3.8826          |
-| 3.1897        | 3.29  | 300  | 2.3133          |
-| 1.9697        | 3.84  | 350  | 1.4230          |
-| 1.2783        | 4.39  | 400  | 0.9488          |
-| 0.8952        | 4.94  | 450  | 0.6810          |
-| 0.6593        | 5.49  | 500  | 0.5228          |
-| 0.5278        | 6.04  | 550  | 0.4249          |
-| 0.4339        | 6.58  | 600  | 0.3630          |
-| 0.3809        | 7.13  | 650  | 0.3237          |
-| 0.3443        | 7.68  | 700  | 0.2991          |
-| 0.3212        | 8.23  | 750  | 0.2843          |
-| 0.3094        | 8.78  | 800  | 0.2765          |
-| 0.3033        | 9.33  | 850  | 0.2734          |
-| 0.3           | 9.88  | 900  | 0.2727          |
 ### Framework versions

 This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on the wikitext dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0165
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0003
 - train_batch_size: 64
 - eval_batch_size: 64
 - seed: 42
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 8.562         | 0.55  | 50   | 6.9697          |
+| 6.63          | 1.1   | 100  | 6.3436          |
+| 5.938         | 1.65  | 150  | 5.1110          |
+| 3.0597        | 2.19  | 200  | 1.4150          |
+| 0.7989        | 2.74  | 250  | 0.3477          |
+| 0.2227        | 3.29  | 300  | 0.1284          |
+| 0.0925        | 3.84  | 350  | 0.0640          |
+| 0.0475        | 4.39  | 400  | 0.0412          |
+| 0.0314        | 4.94  | 450  | 0.0304          |
+| 0.0217        | 5.49  | 500  | 0.0246          |
+| 0.0181        | 6.04  | 550  | 0.0215          |
+| 0.0146        | 6.58  | 600  | 0.0194          |
+| 0.0132        | 7.13  | 650  | 0.0182          |
+| 0.012         | 7.68  | 700  | 0.0174          |
+| 0.0114        | 8.23  | 750  | 0.0169          |
+| 0.011         | 8.78  | 800  | 0.0167          |
+| 0.0108        | 9.33  | 850  | 0.0166          |
+| 0.0106        | 9.88  | 900  | 0.0165          |
 ### Framework versions