--- base_model: - djuna/L3.1-Romes-Ninomos - djuna/L3.1-gramamax library_name: transformers tags: - mergekit - merge --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the breadcrumbs_ties merge method using [djuna/L3.1-gramamax](https://huggingface.co/djuna/L3.1-gramamax) as a base. ### Models Merged The following models were included in the merge: * [djuna/L3.1-Romes-Ninomos](https://huggingface.co/djuna/L3.1-Romes-Ninomos) ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: djuna/L3.1-gramamax parameters: weight: [0.5, 0.6, 0.3, 0.9] density: 0.9 gamma: 0.1 - model: djuna/L3.1-Romes-Ninomos parameters: weight: [0.5, 0.4, 0.7, 0.1] density: 0.9 gamma: 0.1 base_model: djuna/L3.1-gramamax parameters: normalize: false tokenizer_source: base merge_method: breadcrumbs_ties out_dtype: bfloat16 ```