natanmb's picture
update model card README.md
e61f7ea
metadata
license: apache-2.0
tags:
  - summarization
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: t5-small-finetuned-multi-news
    results: []

t5-small-finetuned-multi-news

This model is a fine-tuned version of t5-small on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 2.8049
  • Rouge1: 15.1241
  • Rouge2: 4.9514
  • Rougel: 11.5019
  • Rougelsum: 13.3079

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5.6e-05
  • train_batch_size: 4
  • eval_batch_size: 4
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 7

Training results

Training Loss Epoch Step Validation Loss Rouge1 Rouge2 Rougel Rougelsum
3.2411 1.0 1250 2.8772 14.6774 4.7697 11.335 13.0082
3.079 2.0 2500 2.8438 14.9558 4.8748 11.4023 13.2198
3.0257 3.0 3750 2.8240 15.133 4.9814 11.572 13.3607
2.9903 4.0 5000 2.8153 15.1339 4.9123 11.5038 13.3464
2.9659 5.0 6250 2.8085 15.1134 5.0057 11.5478 13.3483
2.9461 6.0 7500 2.8066 15.154 4.9641 11.5276 13.3523
2.936 7.0 8750 2.8049 15.1241 4.9514 11.5019 13.3079

Framework versions

  • Transformers 4.28.1
  • Pytorch 2.0.0+cu118
  • Datasets 2.11.0
  • Tokenizers 0.13.3