File size: 2,517 Bytes

d502fb0
fb2b2df
 
 
 
 
 
 
 
 
d502fb0
 
 
 
 
 
eff7142
 
d502fb0
fb2b2df
d502fb0
a12306c
d502fb0
fb2b2df
d502fb0
eff7142
d502fb0
fb2b2df
 
 
 
 
 
 
 
 
 
d502fb0
fb2b2df
d502fb0
fb2b2df
d502fb0
 
d22c836
 
 
 
 
 
 
 
 
 
d502fb0
 
d22c836
 
 
 
 
 
 
d502fb0
d22c836
 
 
 
 
 
 
 
 
 
 
 
 
d502fb0
d22c836
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
d502fb0
 
d22c836
d502fb0
d22c836
d502fb0
 
d22c836
 
 
 
 
 
 
 
d502fb0

---
base_model:
- ArliAI/ArliAI-Llama-3-8B-Formax-v1.0
- gradientai/Llama-3-8B-Instruct-Gradient-1048k
- ArliAI/Llama-3.1-8B-ArliAI-Formax-v1.0
- Sao10K/L3-8B-Niitama-v1
- Sao10K/L3-8B-Stheno-v3.3-32K
- tohur/natsumura-storytelling-rp-1.0-llama-3.1-8b
- Sao10K/L3-8B-Tamamo-v1
- vicgalle/Roleplay-Hermes-3-Llama-3.1-8B
library_name: transformers
tags:
- mergekit
- merge

---
Attempt number three (five) that aims to fix the overly chatty and flower language of v0.2. 
Updated version [here](https://huggingface.co/kromeurus/L3.1-Siithamo-v0.4-8B)

### Quants

[A few GGUFs](https://huggingface.co/kromquant/L3.1-Siithamo-v0.3-8B-GGUFs) by me.

### Details & Recommended Settings

Mid, same properties of v0.1 but suffers at long context.

Rec. Settings:
```
Template: L3
Temperature: 1.3
Min P: 0.1
Repeat Penalty: 1.05
Repeat Penalty Tokens: 256
Dyn Temp: 0.9-1.05 at 0.1
Smooth Sampl: 0.18
```

### Merge Theory

Can't be asked rn.

```yaml
slices:
- sources:
  - layer_range: [0, 16]
    model: ArliAI/ArliAI-Llama-3-8B-Formax-v1.0
- sources:
  - layer_range: [16, 32]
    model: gradientai/Llama-3-8B-Instruct-Gradient-1048k
parameters:
  int8_mask: true
merge_method: passthrough
dtype: float32
out_dtype: bfloat16
name: formax.ext
---
models: 
  - model: formax.ext
    parameters:
      weight: 1
base_model: ArliAI/Llama-3.1-8B-ArliAI-Formax-v1.0
parameters:
  normalize: false
  int8_mask: true
merge_method: dare_linear
dtype: float32
out_dtype: bfloat16
tokenizer_source: base
name: formaxext.3.1
---
models: 
  - model: Sao10K/L3-8B-Niitama-v1
    parameters:
      weight: 0.6
  - model: Sao10K/L3-8B-Stheno-v3.3-32K
    parameters:
      weight: 0.5
base_model: tohur/natsumura-storytelling-rp-1.0-llama-3.1-8b
parameters:
  normalize: false
  int8_mask: true
merge_method: dare_linear
dtype: float32
out_dtype: bfloat16
tokenizer_source: base
name: siith.3.1
---
models:
    - model: Sao10K/L3-8B-Tamamo-v1
    - model: siith.3.1
base_model: vicgalle/Roleplay-Hermes-3-Llama-3.1-8B
parameters:
  normalize: false
  int8_mask: true
merge_method: model_stock
dtype: float32
out_dtype: bfloat16
name: siithamol3.1
---
models: 
  - model: siithamol3.1
    parameters:
      weight: [0.5, 0.8, 0.9, 1]
      density: 0.9
      gamma: 0.01
  - model: formaxext.3.1
    parameters:
      weight: [0.5, 0.2, 0.1, 0]
      density: 0.9
      gamma: 0.01
base_model: siithamol3.1
parameters:
  normalize: false
  int8_mask: true
merge_method: breadcrumbs
dtype: float32
out_dtype: bfloat16
name: siithamov3
```