kromeurus
/

L3.1-Siithamo-v0.4-8B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

kromeurus commited on Aug 20

Commit

325e6e0

•

1 Parent(s): f545d6b

Update README.md

Files changed (1) hide show

README.md +37 -42

README.md CHANGED Viewed

@@ -1,49 +1,44 @@
 ---
-base_model: []
 library_name: transformers
 tags:
 - mergekit
 - merge
 ---
-# siithamov3
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
-## Merge Details
-### Merge Method
-This model was merged using the breadcrumbs_ties merge method using merge/siithamol3.1 as a base.
-### Models Merged
-The following models were included in the merge:
-* merge/formaxext.3.1
-### Configuration
-The following YAML configuration was used to produce this model:
-```yaml
-base_model: merge/siithamol3.1
-dtype: float32
-merge_method: breadcrumbs_ties
-out_dtype: bfloat16
-parameters:
-  int8_mask: 1.0
-  normalize: 0.0
-slices:
-- sources:
-  - layer_range: [0, 32]
-    model: merge/siithamol3.1
-    parameters:
-      density: 0.9
-      gamma: 0.01
-      weight: [0.5, 0.0, 8.0, 0.8, 0.9, 1.0]
-  - layer_range: [0, 32]
-    model: merge/formaxext.3.1
-    parameters:
-      density: 0.9
-      gamma: 0.01
-      weight: [0.5, 0.2, 0.2, 0.1, 0.0]
 ```

 ---
+base_model:
+- ArliAI/ArliAI-Llama-3-8B-Formax-v1.0
+- gradientai/Llama-3-8B-Instruct-Gradient-1048k
+- ArliAI/Llama-3.1-8B-ArliAI-Formax-v1.0
+- Sao10K/L3-8B-Niitama-v1
+- Sao10K/L3-8B-Stheno-v3.3-32K
+- tohur/natsumura-storytelling-rp-1.0-llama-3.1-8b
+- Sao10K/L3-8B-Tamamo-v1
+- vicgalle/Roleplay-Hermes-3-Llama-3.1-8B
 library_name: transformers
 tags:
 - mergekit
 - merge
 ---
+HUZZAH, a model that's actually good! Just took seven tries. Fixed spacial understanding and literacy, toned down a little of the clingy instruct following, and more turn based RP forward.
+### Quants
+[A few GGUFs](https://huggingface.co/kromquant/L3.1-Siithamo-v0.4-8B-GGUFs) by me.
+### Details & Recommended Settings
+(Still testing; details subject to change)
+Sticks to instructs well, dynamic writing, roleplay focused generations, and more solid intelligence. Less rambley though still outputs a bit of text. Has near perfect recall up to
+32K. Be clear and explicit with model instructs, including the intended format (Asterix, quotes, etc).
+Rec. Settings:
+```
+Template: L3
+Temperature: 1.3
+Min P: 0.1
+Repeat Penalty: 1.05
+Repeat Penalty Tokens: 256
+Dyn Temp: 0.9-1.05 at 0.1
+Smooth Sampl: 0.18
 ```
+### Merge Theory
+This sucked
+### Config