rccmsu
/

ruadapt_llama2_7b_v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

rccmsu commited on Nov 26, 2023

Commit

b53b44b

•

1 Parent(s): 28bf242

Update README.md

Files changed (1) hide show

README.md +6 -24

README.md CHANGED Viewed

@@ -1,37 +1,19 @@
----
-base_model: llama2_7b_darulm_unigram_init_tie_16_11_23
-tags:
-- generated_from_trainer
-metrics:
-- accuracy
-model-index:
-- name: llama2_7b_darulm_unigram_tie_2e_16_11_23
-  results: []
----
-<!-- This model card has been generated automatically according to the information the Trainer had access to. You
-should probably proofread and complete it, then remove this comment. -->
-# llama2_7b_darulm_unigram_tie_2e_16_11_23
-This model is a fine-tuned version of [llama2_7b_darulm_unigram_init_tie_16_11_23](https://huggingface.co/llama2_7b_darulm_unigram_init_tie_16_11_23) on the None dataset.
 It achieves the following results on the evaluation set:
 - Loss: 2.7569
 - Accuracy: 0.4617
 ## Model description
-More information needed
 ## Intended uses & limitations
-More information needed
-## Training and evaluation data
-More information needed
-## Training procedure
 ### Training hyperparameters
@@ -336,4 +318,4 @@ The following hyperparameters were used during training:
 - Transformers 4.34.0
 - Pytorch 2.0.1+cu118
 - Datasets 2.14.5
-- Tokenizers 0.14.1

+# TheBloke/Llama-2-13B-fp16
+This model is a fine-tuned (embeddings, lm head) version of TheBloke/Llama-2-7B-fp16 on the Russian dataset (33GB).
 It achieves the following results on the evaluation set:
 - Loss: 2.7569
 - Accuracy: 0.4617
 ## Model description
+Russian adaptation of LLaMa-2-7B by replacing the tokenizer.
+Paper: Tikhomirov M.M., Chernyshev D.I., Impact of Tokenization on LLaMa Russian Adaptation (will be soon)
 ## Intended uses & limitations
+LLAMA 2 COMMUNITY LICENSE AGREEMENT
 ### Training hyperparameters
 - Transformers 4.34.0
 - Pytorch 2.0.1+cu118
 - Datasets 2.14.5
+- Tokenizers 0.14.1