kromeurus's picture
Upload folder using huggingface_hub
f545d6b verified
|
raw
history blame
985 Bytes
---
base_model: []
library_name: transformers
tags:
- mergekit
- merge
---
# siithamov3
This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
## Merge Details
### Merge Method
This model was merged using the breadcrumbs_ties merge method using merge/siithamol3.1 as a base.
### Models Merged
The following models were included in the merge:
* merge/formaxext.3.1
### Configuration
The following YAML configuration was used to produce this model:
```yaml
base_model: merge/siithamol3.1
dtype: float32
merge_method: breadcrumbs_ties
out_dtype: bfloat16
parameters:
int8_mask: 1.0
normalize: 0.0
slices:
- sources:
- layer_range: [0, 32]
model: merge/siithamol3.1
parameters:
density: 0.9
gamma: 0.01
weight: [0.5, 0.0, 8.0, 0.8, 0.9, 1.0]
- layer_range: [0, 32]
model: merge/formaxext.3.1
parameters:
density: 0.9
gamma: 0.01
weight: [0.5, 0.2, 0.2, 0.1, 0.0]
```