File size: 3,091 Bytes
5306ad9
 
b0e716c
 
 
 
5f55d8a
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5306ad9
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
---
language: en
license: apache-2.0
tags:
- summarization
datasets: arxiv-summarization
model-index:
- name: ArtifactAI/led_large_16384_arxiv_summarization
  results:
  - task:
      type: summarization
      name: Summarization
    dataset:
      name: ccdv/arxiv-summarization
      type: ccdv/arxiv-summarization
      config: section
      split: test
    metrics:
    - type: rouge
      value: 37.9472
      name: ROUGE-1
      verified: true
      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiZDFkMzY4YTk0NGUyNDJjYzc2MWFiMGJlNWUyYTM2YjlmNjlkY2VkYmVhMDk2YjIxMjE3MjE4M2ZkOTAwODE2ZSIsInZlcnNpb24iOjF9.t2x5mqi0xM9Q0K9MscHZ6v_5pc-MOw8KieFTvFMqh5K4UAvvvcVGOGfGQi_Qb57gQa2DkrW0cNrJADY0VA1tAQ
    - type: rouge
      value: 11.3138
      name: ROUGE-2
      verified: true
      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNjdlYmQ4ZmRkNzc3YzE0NGQ2MTRhNDE4YTExNDYwYmNjODFhYjdmYTJlZWE4OTRhYWRiZmNmODZkMDZjMWY3NSIsInZlcnNpb24iOjF9.RPWY5CZMjaFaQ1vRQPoHyZxPD67dQdbXYL0UlJ53b_q1dMczXb7HtE_UmDNPi6F7thciVt6xWIzsckVmp9ZJCw
    - type: rouge
      value: 20.5557
      name: ROUGE-L
      verified: true
      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYWEwNTQ5MWViZTYwM2EyNzI0OWEyZDNlY2ExOTJiMjI3MmNjM2I4YmJjMzljYTQ3NjhkNjAzYzM5MDQzYjVkOCIsInZlcnNpb24iOjF9.ZgSkTbiUDaQRJGBIXjlTZKbtKmrIljEJ6btwhyfBsaz5oS0qmI76-b_vDRswnx96OcGTqdxICIjma6jgNbKiBA
    - type: rouge
      value: 33.8336
      name: ROUGE-LSUM
      verified: true
      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiY2EzNzNhMWVmYjM5ZWUwOTZkYjU0MGZjMWQ0YTQ1NzA1NWQ4MjBjNjNhM2FmMmE3MmM3NzQwMzVkN2QzMzQxZiIsInZlcnNpb24iOjF9.bhxtgWXjCEv5ZFY3F7Mp-r4EHrIU8BNZ8X2zhpjSoyVLmjbfdFB-lnJdoH3PfVZEa14T96SJqMSHa6yzlqGEAQ
    - type: loss
      value: 2.8064792156219482
      name: loss
      verified: true
      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYzBhMTE0ZTdhOTRmYWE1Mjk5ZmViYjZiMjBmNzc2YzQ4YmNhYWM3NzRjYWUwYTEyZjU1NGI5MjVhODQwOTBlNCIsInZlcnNpb24iOjF9.l0nIJCcjoFyPF9M7MHiQxBQ3wtyk6jXURY0ZF6Xny3_DpkDh5YHs9kF494GJp5eYj6XG5HRGCgqhfmU7-fywAw
    - type: gen_len
      value: 157.4174
      name: gen_len
      verified: true
      verifyToken: eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNDY0ZmE4M2VmOTU1NWY5M2I4YTYxNjM3NTkxNWU4NDY3N2Y0MTM1YWNlNmNjMGQ4N2UzM2ZkZWJhZTVmMjQ2OCIsInZlcnNpb24iOjF9.sAp6g7nt1tKTdGfOlGm3fdxzH1jxjNOZO65BNnVJkxDhu86j8QP3ZvNPv7PpD2sK4p6yM_HlHPPeX4bgmDi2BQ
---

## Introduction

A led-large-16384 model to summarize ArXiv papers. Inputs are the abstracts of papers and full documents, and outputs are the summaries of the papers.

[Allenai's Longformer Encoder-Decoder (LED)](https://github.com/allenai/longformer#longformer).

As described in [Longformer: The Long-Document Transformer](https://arxiv.org/pdf/2004.05150.pdf) by Iz Beltagy, Matthew E. Peters, Arman Cohan, 
*led-base-16384* was initialized from [*bart-base*](https://huggingface.co/facebook/bart-base) since both models share the exact same architecture. To 
be able to process 16K tokens, *bart-base*'s position embedding matrix was simply copied 16 times.