yhavinga
/

t5-v1.1-base-dutch-cased

Text2Text Generation

text-generation-inference

Model card Files Files and versions Metrics Training metrics Community

yhavinga commited on Jan 16, 2022

Commit

1dcd613

•

1 Parent(s): d2f14e5

Update README.md

Files changed (1) hide show

README.md +7 -3

README.md CHANGED Viewed

@@ -12,11 +12,15 @@ inference: false
 # T5-base pre-trained on cleaned Dutch mC4 🇳🇱
-A t5-v1.1-base model pre-trained from scratch on Dutch mc4 cleaned. This model achieves an evaluation accuracy of 0,78 and loss 0,96.
-These models need to be finetuned, therefore the inference widget on the right has been turned off.
-For a demo of the Dutch CNN summarization models, head over to the Hugging Face Spaces for
 the **[Netherformer 📰](https://huggingface.co/spaces/flax-community/netherformer)** example application!
 ## Tokenizer

 # T5-base pre-trained on cleaned Dutch mC4 🇳🇱
+A [T5](https://ai.googleblog.com/2020/02/exploring-transfer-learning-with-t5.html) v1.1 base model pre-trained from scratch on [Dutch mC4](https://huggingface.co/datasets/yhavinga/mc4_nl_cleaned). This model achieves an evaluation accuracy of 0,78 and loss 0,96.
+* Pre-trained T5 models need to be finetuned before they can be used for downstream tasks, therefore the inference widget on the right has been turned off.
+* For a fine-tuned version for summarization, see [yhavinga/t5-v1.1-base-dutch-cnn-test](https://huggingface.co/yhavinga/t5-v1.1-base-dutch-cnn-test).
+* For a demo of the Dutch CNN summarization models, head over to the Hugging Face Spaces for
 the **[Netherformer 📰](https://huggingface.co/spaces/flax-community/netherformer)** example application!
+* T5 paper: [Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer](https://arxiv.org/pdf/1910.10683.pdf)
+![model image](https://camo.githubusercontent.com/623b4dea0b653f2ad3f36c71ebfe749a677ac0a1/68747470733a2f2f6d69726f2e6d656469756d2e636f6d2f6d61782f343030362f312a44304a31674e51663876727255704b657944387750412e706e67)
 ## Tokenizer