whaleloops
/

longt5-tglobal-large-16384-pubmed-10k_steps

Text2Text Generation

text summarization

Inference Endpoints

Model card Files Files and versions Community

zhichao yang commited on Sep 9, 2022

Commit

0f5ed99

•

1 Parent(s): cd814cf

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -1,11 +1,15 @@
 ---
 license: apache-2.0
 ---
 ## Introduction
 [Google's LongT5: Efficient Text-To-Text Transformer for Long Sequences](https://arxiv.org/pdf/2112.07916.pdf).
-This is an unofficial longt5-large-16384-pubmed-10k_steps checkpoint. I.e., this is a large configuration of the LongT5 model with a transient-global attention fine-tuned on pubmed summarization dataset for 10,000 training steps.
 ## Results and Fine-tuning Details

 ---
+language: en
+datasets:
+ - ccdv/pubmed-summarization
 license: apache-2.0
 ---
 ## Introduction
 [Google's LongT5: Efficient Text-To-Text Transformer for Long Sequences](https://arxiv.org/pdf/2112.07916.pdf).
+This is an unofficial longt5-large-16384-pubmed-10k_steps checkpoint. I.e., this is a large configuration of the LongT5 model with a transient-global attention fine-tuned on [pubmed summarization dataset](https://huggingface.co/datasets/ccdv/pubmed-summarization) for 10,000 training steps.
 ## Results and Fine-tuning Details