gsl22
/

flan-t5-ellis

Text2Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

gsl22 commited on Dec 20, 2023

Commit

292d17f

•

1 Parent(s): ca54a3e

update model card README.md

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -14,6 +14,8 @@ should probably proofread and complete it, then remove this comment. -->
 # flan-t5-ellis
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
 ## Model description
@@ -41,10 +43,17 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 1
 ### Training results
 ### Framework versions

 # flan-t5-ellis
 This model is a fine-tuned version of [google/flan-t5-base](https://huggingface.co/google/flan-t5-base) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 0.8109
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 50
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| 0.995         | 9.96  | 500  | 0.8963          |
+| 0.7497        | 19.93 | 1000 | 0.7965          |
+| 0.6163        | 29.89 | 1500 | 0.7912          |
+| 0.5471        | 39.85 | 2000 | 0.8116          |
+| 0.5249        | 49.81 | 2500 | 0.8109          |
 ### Framework versions