metadata

license: apache-2.0
tags:
  - summarization
  - generated_from_trainer
metrics:
  - rouge
model-index:
  - name: t5-base-finetuned-multi-news
    results: []

t5-base-finetuned-multi-news

This model is a fine-tuned version of t5-base on the None dataset. It achieves the following results on the evaluation set:

Loss: 2.2612
Rouge1: 16.6322
Rouge2: 5.7556
Rougel: 12.4728
Rougelsum: 14.4814

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

learning_rate: 5.6e-05
train_batch_size: 4
eval_batch_size: 4
seed: 42
optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
lr_scheduler_type: linear
num_epochs: 8

Training results

Training Loss	Epoch	Step	Validation Loss	Rouge1	Rouge2	Rougel	Rougelsum
2.5641	1.0	1250	2.2636	16.6762	5.7127	12.4648	14.5499
2.3542	2.0	2500	2.2439	16.7381	5.7345	12.5515	14.5785
2.2487	3.0	3750	2.2388	16.8879	5.8792	12.6417	14.8011
2.1705	4.0	5000	2.2413	16.5921	5.7804	12.4539	14.4865
2.1083	5.0	6250	2.2459	16.6878	5.8593	12.5132	14.5473
2.0622	6.0	7500	2.2495	16.7267	5.7825	12.48	14.5309
2.0297	7.0	8750	2.2581	16.633	5.748	12.4418	14.4796
2.0084	8.0	10000	2.2612	16.6322	5.7556	12.4728	14.4814

Framework versions

Transformers 4.28.1
Pytorch 2.0.0+cu118
Datasets 2.11.0
Tokenizers 0.13.3