ai-forever
/

FRED-T5-1.7B

Text2Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

sberbank-ai commited on Jan 24, 2023

Commit

7458a9b

•

1 Parent(s): fc11dd7

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ It was trained on Russian language corpus (300GB).   The dataset is the same as
 Bbpe tokenizer. 50257 + special tokens 107. Prefix tokens: '\<LM\>', '\<SC1>',.. '\<SC6>'
-First half of the time model trained on the small part of all datasets (1%,3GB) and without prefixes in each task.
 For RSG, we trained as described in the T5 paper. First, we trained multitask for all tasks. Then we took the best checkpoint for the task and trained it further.
 RSG submit here https://russiansuperglue.com/login/submit_info/1936

 Bbpe tokenizer. 50257 + special tokens 107. Prefix tokens: '\<LM\>', '\<SC1>',.. '\<SC6>'
+First half of the time model trained on the small part of all dataset (1%,3GB) and without prefixes in each task.
 For RSG, we trained as described in the T5 paper. First, we trained multitask for all tasks. Then we took the best checkpoint for the task and trained it further.
 RSG submit here https://russiansuperglue.com/login/submit_info/1936