RefalMachine
/

ruadapt_qwen2.5_3B_ext_u48_instruct_v4

Text Generation

Safetensors

Russian

qwen2

conversational

Model card Files Files and versions Community

RefalMachine commited on 21 days ago

Commit

ee1059f

•

1 Parent(s): 88c6a4e

Update README.md

Browse files

Files changed (1) hide show

README.md +27 -2

README.md CHANGED Viewed

@@ -10,13 +10,38 @@ base_model:
 - RefalMachine/ruadapt_qwen2.5_3B_ext_u48_full_lr5e4_peft_mlp_32_32_bs256
 ---
-# Model description
 Instruction-tuned version of RefalMachine/ruadapt_qwen2.5_3B_ext_u48_full_lr5e4_peft_mlp_32_32_bs256 with extended tokenizer after LEP (Learned Embedding Propagation, paper will be soon) procedure.
 Thanks to the extended tokenizer, the model works more efficiently with the Russian language (up to 60% speed up compared to Qwen-2.5-3B-Instruct in terms of characters)
-# How to cite:
 Tikhomirov M., Chernyshev D. Facilitating large language model Russian adaptation with Learned Embedding Propagation // 2024 (will be soon)

 - RefalMachine/ruadapt_qwen2.5_3B_ext_u48_full_lr5e4_peft_mlp_32_32_bs256
 ---
+### Model description
 Instruction-tuned version of RefalMachine/ruadapt_qwen2.5_3B_ext_u48_full_lr5e4_peft_mlp_32_32_bs256 with extended tokenizer after LEP (Learned Embedding Propagation, paper will be soon) procedure.
 Thanks to the extended tokenizer, the model works more efficiently with the Russian language (up to 60% speed up compared to Qwen-2.5-3B-Instruct in terms of characters)
+### Метрики и оценка качества
+#### Результаты на Ru-Arena-General
+В качестве референсых ответов, с которыми сравниваются модели выступают ответы от gpt-3.5-turbo-0125, поэтому она имеет винрейт 50%.
+Здесь приведена лишь часть лидерборда, подробнее смотрите в репозитории бенчмарка.
+| Model Name                                       | Winrate  | 95% CI             | Average # Tokens |
+|--------------------------------------------------|--------|--------------------|------------------|
+| gpt-4-1106-preview                               | 90.9   | (-1.3, 1.0)        | 541              |
+| gpt-4o-mini                                      | 83.9   | (-1.8, 1.1)        | 448              |
+| vikhr-nemo-12b-instruct-r-21-09-24               | 79.8   | (-2.2, 1.9)        | 627              |
+| gemma-2-9b-it-sppo-iter3                         | 73.6   | (-1.6, 2.2)        | 509              |
+| gemma-2-9b-it                                    | 69.2   | (-2.5, 1.9)        | 459              |
+| saiga_llama3_8b_v7                               | 67.6   | (?, ?)             | 503              |
+| **ruadapt_qwen2.5_3B_ext_u48_instruct_v4**           | **66.1**   | **(?, ?)**             | **531**              |
+| t-lite-instruct-0.1                              | 64.7   | (-2.1, 1.7)        | 810              |
+| vikhr-llama3.1-8b-instruct-r-21-09-24            | 63.4   | (-2.1, 2.5)        | 618              |
+| suzume-llama-3-8B-multilingual-orpo-borda-half   | 57.1   | (-1.9, 2.2)        | 682              |
+| mistral-nemo-instruct-2407                       | 50.5   | (-2.7, 2.6)        | 403              |
+| gpt-3.5-turbo-0125                               | 50.0   | (0.0, 0.0)         | 220              |
+| c4ai-command-r-v01                               | 49.0   | (-1.7, 2.2)        | 529              |
+| meta-llama-3.1-8b-instruct                       | 43.1   | (-2.8, 2.3)        | 628              |
+### How to cite:
 Tikhomirov M., Chernyshev D. Facilitating large language model Russian adaptation with Learned Embedding Propagation // 2024 (will be soon)