lemonilia
/

LimaRP-Mistral-7B-v0.1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

lemonilia commited on Sep 27, 2023

Commit

ac34889

•

1 Parent(s): 7c97ac6

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -67,7 +67,7 @@ You should:
 ### Message length control
-Inspired by the previously named "Roleplay" preset in SillyTavern, starting from this
 version of LimaRP it is possible to append a length modifier to the response instruction
 sequence, like this:
@@ -111,13 +111,13 @@ training process closer to a full finetune. It's suggested to merge the adapter
 the base Mistral-7B-v0.1 model.
 ### Training hyperparameters
-- learning_rate: 0.0010
 - lr_scheduler_type: cosine
 - lora_r: 256
 - lora_alpha: 16
 - lora_dropout: 0.05
 - lora_target_linear: True
-- num_epochs: 2
 - bf16: True
 - tf32: True
 - load_in_8bit: True

 ### Message length control
+Inspired by the previously named "Roleplay" preset in SillyTavern, with this
 version of LimaRP it is possible to append a length modifier to the response instruction
 sequence, like this:
 the base Mistral-7B-v0.1 model.
 ### Training hyperparameters
+- learning_rate: 0.001
 - lr_scheduler_type: cosine
+- num_epochs: 2
 - lora_r: 256
 - lora_alpha: 16
 - lora_dropout: 0.05
 - lora_target_linear: True
 - bf16: True
 - tf32: True
 - load_in_8bit: True