UNIST-Eunchan
/

Research-Paper-Summarization-Pegasus-x-ArXiv

text2text-generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

UNIST-Eunchan commited on Nov 28, 2023

Commit

7427deb

•

1 Parent(s): 1a89701

Update README.md

Files changed (1) hide show

README.md +15 -7

README.md CHANGED Viewed

@@ -60,13 +60,25 @@ It achieves the following results on the evaluation set:
-## Model description
-More information needed
 ## Intended uses & limitations
-Paper Summarization
 ## Compare to Baseline
 - Pegasus-X-base **zero-shot** Performance:
@@ -104,10 +116,6 @@ Paper Summarization
-## Training and evaluation data
-We use full of dataset 'ccdv/arxiv-summarization'.
 ## Training procedure
 We use huggingface-based environment such as datasets, trainer, etc.

+**Base Model**: [Pegasus-x-base (State-of-the-art for Long Context Summarization)](https://huggingface.co/google/pegasus-x-base)
+**Finetuning Dataset**:
+- We used **full of ArXiv Dataset (Cohan et al., 2018, NAACL-HLT 2018)** [[PDF]](https://arxiv.org/abs/1804.05685)
+  - (Full length is 200,000+)
+**GPU**: (RTX A6000) x 1
+**Train time**: About 120 hours for 5 epochs
+**Test time**: About 8 hours for test dataset.
 ## Intended uses & limitations
+- **Research Paper Summarization**
 ## Compare to Baseline
 - Pegasus-X-base **zero-shot** Performance:
 ## Training procedure
 We use huggingface-based environment such as datasets, trainer, etc.