Update README.md
Browse files
README.md
CHANGED
@@ -52,7 +52,7 @@ widget:
|
|
52 |
- **Data:** BNE
|
53 |
|
54 |
## Model description
|
55 |
-
**GPT2-large-bne** is a transformer-based model for the Spanish language. It is based on the [GPT-2](
|
56 |
|
57 |
## Intended uses and limitations
|
58 |
You can use the raw model for text generation or fine-tune it to a downstream task.
|
|
|
52 |
- **Data:** BNE
|
53 |
|
54 |
## Model description
|
55 |
+
**GPT2-large-bne** is a transformer-based model for the Spanish language. It is based on the [GPT-2](https://d4mucfpksywv.cloudfront.net/better-language-models/language_models_are_unsupervised_multitask_learners.pdf) model and has been pre-trained using the largest Spanish corpus known to date, with a total of 570GB of clean and deduplicated text processed for this work, compiled from the web crawlings performed by the [National Library of Spain (Biblioteca Nacional de España)](http://www.bne.es/en/Inicio/index.html) from 2009 to 2019.
|
56 |
|
57 |
## Intended uses and limitations
|
58 |
You can use the raw model for text generation or fine-tune it to a downstream task.
|