File size: 7,125 Bytes
a3c8d05 2fd7e11 a3c8d05 2fd7e11 a3c8d05 2fd7e11 a3c8d05 2fd7e11 a3c8d05 2fd7e11 a3c8d05 2fd7e11 a3c8d05 2fd7e11 a3c8d05 776d7ae 880f829 776d7ae 2fd7e11 a3c8d05 2fd7e11 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 |
---
base_model: ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B
license: apache-2.0
tags:
- merge
- mergekit
- lazymergekit
- kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B
- djuna/L3.1-Purosani-2-8B
- llama-cpp
- gguf-my-repo
---
# ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B-Q4_0-GGUF
This model was converted to GGUF format from [`ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B`](https://huggingface.co/ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
Refer to the [original model card](https://huggingface.co/ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B) for more details on the model.
## Use with LMStudio or llama.cpp
Install LMStudio through their website: [`LM Studio`](https://lmstudio.ai/)
or
Install llama.cpp through their instructions on Github (works on Mac and Linux) [`Llama.cpp Usage`](https://github.com/ggerganov/llama.cpp/blob/master/README.md#usage)
# Original Model Card:
**ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B** is a cutting-edge merged model that blends the best features of two highly optimized architectures to create an **advanced**, **adaptive**, and **powerful** model. Whether for scientific research, complex instruction-following, or immersive roleplay scenarios, this model excels at every task itβs thrown into.
## π Family Tree
This model is a merger of the following:
- [**kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B**](https://huggingface.co/kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B)
- [**djuna/L3.1-Purosani-2-8B**](https://huggingface.co/djuna/L3.1-Purosani-2-8B)
These parent models are themselves the result of **complex merges** of various high-performance models, making ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B a **super hybrid** capable of handling diverse tasks with efficiency and finesse.
## π³ Model Family Genealogy
[View the ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B Model Family Genealogy](https://imgur.com/a/oXMwVAj)
This image represents the complex lineage of our model, showcasing its rich heritage and the diverse range of capabilities it inherits from its ancestors.
## 𧬠Detailed Model Lineage
### **A: kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B**
Merged using the **TIES merge method**, this model utilizes **unsloth/Meta-Llama-3.1-8B** as its base, combining:
- **arcee-ai/Llama-3.1-SuperNova-Lite**: A distilled 8B parameter version of the **Llama-3.1-405B-Instruct** model, designed to maintain high performance while minimizing resource consumption. Its training, via **EvolKit**, offers instruction-following precision and domain-specific adaptability.
- **NousResearch/Hermes-3-Llama-3.1-8B**: Known for its robustness, this model enhances long-range contextual understanding, making it ideal for complex, multi-layered tasks.
### **B: djuna/L3.1-Purosani-2-8B**
This merge incorporates:
- **hf-100/Llama-3-Spellbound-Instruct-8B-0.3**
- **arcee-ai/Llama-3.1-SuperNova-Lite**
- **grimjim/Llama-3-Instruct-abliteration-LoRA-8B**
- **THUDM/LongWriter-llama3.1-8B**, capable of generating over **10,000 words** in one pass, making it perfect for long-form content generation.
Further contributors include **ResplendentAI/Smarts_Llama3** and **djuna/L3.1-Suze-Vume-2-calc**, making this model highly adaptable to a broad range of applications.
## π οΈ Merge Details
The model was merged using the **della merge method** with **kromeurus/L3.1-Aglow-Vulca-v0.1-8B** as the base. This method, combined with the following models, ensures both **precision** and **adaptability**:
- **djuna/L3.1-Noraian**
- **Casual-Autopsy/L3-Super-Nova-RP-8B**
- **TheDrummer/Llama-3SOME-8B-v2**
- **djuna/L3.1-ForStHS**
- **Blackroot/Llama-3-8B-Abomination-LORA**
## π§ Technical Configuration
The merging process used advanced methods to ensure smooth integration and consistent performance across various tasks:
```yaml
slices:
- sources:
- model: kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B
layer_range: [0, 32]
- model: djuna/L3.1-Purosani-2-8B
layer_range: [0, 32]
merge_method: slerp
base_model: djuna/L3.1-Purosani-2-8B
parameters:
t:
- filter: self_attn
value: [0, 0.5, 0.3, 0.7, 1]
- filter: mlp
value: [1, 0.5, 0.7, 0.3, 0]
- value: 0.5
dtype: bfloat16
```
## π― Extended Support for Roleplay & Immersive Storytelling
**ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B** has been optimized for **extended roleplay support**, making it an exceptional choice for **interactive storytelling** and **deep character development**. With its ability to understand long-form context and generate cohesive responses over extensive interactions, this model excels in:
- **Character-driven interactions**: Develop rich, nuanced personalities that respond in believable and engaging ways.
- **World-building & Lore creation**: Create vast, interconnected universes with intricate lore, all generated in real-time.
- **Dynamic NPC dialogues**: Use the model to generate complex, reactive conversations for game NPCs, offering a fluid, immersive experience for players.
## π Key Features & Capabilities
### **Advanced Roleplay and Long-Form Content Generation**
With models like **THUDM/LongWriter-llama3.1-8B** contributing their expertise, this model is perfect for generating **long-form narratives** while maintaining coherence and creativity.
### **Instruction Following & Task Adaptability**
Combining the capabilities of **Hermes** and **SuperNovaLite**, this model can efficiently follow detailed instructions, making it ideal for:
- **Task automation**
- **Virtual assistants**
- **Research generation**
### **Efficiency Without Compromise**
Distilled models like **SuperNovaLite** ensure that this model delivers high performance without the extensive resource requirements of larger models.
## π― Use Case & Applications
- **Roleplay & Interactive Storytelling**: The perfect companion for storytellers, RPG enthusiasts, and game developers. Whether crafting dynamic NPC interactions or generating deep, immersive worlds, this model can handle it all.
- **Instruction-based AI**: With enhanced instruction-following abilities, this model is ideal for developing intelligent assistants or chatbots that require high accuracy and quick adaptability.
- **Long-Form Writing**: From novels to research papers, this model can generate lengthy, well-structured content with ease, thanks to its extensive training on long-form data.
## π License
This model is open-sourced under the **Apache-2.0 License**, allowing others to use and modify it freely, as long as they give proper attribution.
## π‘ Tags
- merge
- mergekit
- lazymergekit
- Hermes3
- SuperNovaLite
- Purosani
- Llama3.1
- kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B
- djuna/L3.1-Purosani-2-8B
- instruction-following
- long-form-generation
- roleplay
- storytelling
|