Model Card
This model is identical to allenai/led-base-16384, except the generation_config.json
has been updated from:
{
"_from_model_config": true,
"bos_token_id": 0,
"decoder_start_token_id": 2,
"eos_token_id": 2,
"pad_token_id": 1
}
to
{
"bos_token_id": 0,
"decoder_start_token_id": 2,
"eos_token_id": 2,
"pad_token_id": 1,
"early_stopping": true,
"length_penalty": 2.0,
"max_length": 512,
"min_length": 100,
"no_repeat_ngram_size": 3,
"num_beams": 4
}
which we found to be much more stable when fine-tuning the model for summarization tasks.
- Downloads last month
- 4
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social
visibility and check back later, or deploy to Inference Endpoints (dedicated)
instead.