--- base_model: - Darkknight535/Moonlight-L3-15B-v2-64k - Tremontaine/L3-Lunaris-v1-15B library_name: transformers tags: - mergekit - merge - not-for-all-audiences --- ### MoonLight-L3-15B-V2.5-64K --- ![Waifu](https://huggingface.co/Darkknight535/Moonlight-L3-15B-v2-64k/resolve/main/waifu.png) --- - **Enhancements**: - **Merging**: Merge with lunaris to fix some issues. Note : This model is experimental so feedback is needed. --- ### Quants (Thanks to mradermacher) [Imatrix](https://huggingface.co/mradermacher/Moonlight-L3-15B-v2.5-64k-i1-GGUF) [Normal](https://huggingface.co/mradermacher/Moonlight-L3-15B-v2.5-64k-GGUF) ### Instruct Template Default llama3 instruct and context preset, but here is the one i use. [Instruct](https://huggingface.co/Darkknight535/OpenCrystal-12B-L3/blob/main/Llama%203%20%5BInstruct%5D.json) [Context](https://huggingface.co/Darkknight535/OpenCrystal-12B-L3/blob/main/Llama%203%20%5BContext%5D.json) ### Samplers ## Creative ``` Temp : 1.2 Min P : 0.01 Repetition Penalty : 1.05 OR Dry 0.7, 1.75, 2,0 XTC 0.1, 0.5 [And everything else neutral] ``` ## Normal(Precise) ``` Temp : 0.6 - 0.8 Min P : 0.1 Repetition Penalty : 1.1 OR Dry 1, 1.75, 2,0 XTC 0.1, 0.2 [And everything else neutral] ``` ### System Prompt ``` Develop the plot slowly, always stay in character. Describe all actions in full, elaborate, explicit, graphic, and vivid detail. Mention all relevant sensory perceptions. Keep the story immersive and engaging. Speak as other person when needed and prefix with the name of person you're speaking as except {{user}}. ``` ### FeedBack [FeedBack here](https://huggingface.co/Darkknight535/Moonlight-L3-15B-v2.5-64k/discussions/1) # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the SLERP merge method. ### Models Merged The following models were included in the merge: * [Darkknight535/Moonlight-L3-15B-v2-64k](https://huggingface.co/Darkknight535/Moonlight-L3-15B-v2-64k) * [Tremontaine/L3-Lunaris-v1-15B](https://huggingface.co/Tremontaine/L3-Lunaris-v1-15B) ### Configuration The following YAML configuration was used to produce this model: ```yaml slices: - sources: - model: Darkknight535/Moonlight-L3-15B-v2-64k layer_range: [0, 64] - model: Tremontaine/L3-Lunaris-v1-15B layer_range: [0, 64] merge_method: slerp base_model: Darkknight535/Moonlight-L3-15B-v2-64k parameters: t: - filter: self_attn value: [0, 0.5, 0.3, 0.7, 1] - filter: mlp value: [1, 0.5, 0.7, 0.3, 0] - value: 0.5 # fallback for rest of tensors dtype: bfloat16 ```