UNIST-Eunchan commited on
Commit
ed62c75
1 Parent(s): ec594e7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +12 -2
README.md CHANGED
@@ -60,9 +60,19 @@ More information needed
60
 
61
  ## Training and evaluation data
62
 
63
- More information needed
64
 
65
- ## Training procedure
 
 
 
 
 
 
 
 
 
 
 
66
 
67
  ### Training hyperparameters
68
 
 
60
 
61
  ## Training and evaluation data
62
 
 
63
 
64
+
65
+ Base Model: Pegasus-x-base (State-of-the-art for Long Context Summarization)
66
+
67
+ Finetuning Dataset: We used train[25000:100000] of ArXiv Dataset (Cohan et al., 2018) [NAACL-HLT 2018] [PDF](https://arxiv.org/abs/1804.05685)
68
+ (-Full length is 200,000+, We will upload full trained Model soon)
69
+
70
+ GPU: One A100 GPU
71
+ Train time: About 20 hours for 3 epochs
72
+
73
+ Test time: About 8 hours for test dataset.
74
+
75
+
76
 
77
  ### Training hyperparameters
78