ai-forever
/

RUDOLPH-2.7B-FBC2

Model card Files Files and versions Community

sberbank-ai commited on Oct 5, 2022

Commit

4d53b21

•

1 Parent(s): 6b9d45c

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -41,14 +41,14 @@ The model was prepared as a baseline for FusionBrain Challenge 2.0 (as a part of
 ### Parameters
-<img src=https://github.com/ai-forever/ru-dolph/blob/master/pics/scheme-rudolph_27B.jpg>
 The maximum sequence length that this model may be used with depends on the modality and stands for 384 - 576 - 128 for the left text tokens, image tokens, and right text tokens, respectively.
 RUDOLPH 2.7B is a Transformer-based decoder model with the following parameters:
-* num-layers (32) — Number of hidden layers in the Transformer decoder.
-* hidden-size (2560) — Dimensionality of the hidden layers.
 * num\_attention\_heads (32) — Number of attention heads for each attention layer.
 ### Sparse Attention Mask

 ### Parameters
+<img src=https://github.com/ai-forever/ru-dolph/blob/master/pics/scheme-rudolph_27B.jpg height="60" border="2"/>
 The maximum sequence length that this model may be used with depends on the modality and stands for 384 - 576 - 128 for the left text tokens, image tokens, and right text tokens, respectively.
 RUDOLPH 2.7B is a Transformer-based decoder model with the following parameters:
+* num\_layers (32) — Number of hidden layers in the Transformer decoder.
+* hidden\_size (2560) — Dimensionality of the hidden layers.
 * num\_attention\_heads (32) — Number of attention heads for each attention layer.
 ### Sparse Attention Mask