ZeroXClem
/

Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B-Q4_0-GGUF

@@ -15,42 +15,126 @@ tags:
 This model was converted to GGUF format from [`ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B`](https://huggingface.co/ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B) for more details on the model.
-## Use with llama.cpp
-Install llama.cpp through brew (works on Mac and Linux)
-```bash
-brew install llama.cpp
-```
-Invoke the llama.cpp server or the CLI.
-### CLI:
-```bash
-llama-cli --hf-repo ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B-Q4_0-GGUF --hf-file llama3.1-hermes3-supernova-8b-l3.1-purosani-2-8b-q4_0.gguf -p "The meaning to life and the universe is"
-```
-### Server:
-```bash
-llama-server --hf-repo ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B-Q4_0-GGUF --hf-file llama3.1-hermes3-supernova-8b-l3.1-purosani-2-8b-q4_0.gguf -c 2048
-```
-Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
-Step 1: Clone llama.cpp from GitHub.
-```
-git clone https://github.com/ggerganov/llama.cpp
-```
-Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
-```
-cd llama.cpp && LLAMA_CURL=1 make
-```
-Step 3: Run inference through the main binary.
-```
-./llama-cli --hf-repo ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B-Q4_0-GGUF --hf-file llama3.1-hermes3-supernova-8b-l3.1-purosani-2-8b-q4_0.gguf -p "The meaning to life and the universe is"
-```
-or
-```
-./llama-server --hf-repo ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B-Q4_0-GGUF --hf-file llama3.1-hermes3-supernova-8b-l3.1-purosani-2-8b-q4_0.gguf -c 2048
 ```

 This model was converted to GGUF format from [`ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B`](https://huggingface.co/ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B) for more details on the model.
+## Use with LMStudio or llama.cpp
+Install LMStudio through their website: [`LM Studio`](https://lmstudio.ai/)
+or
+Install llama.cpp through their instructions on Github (works on Mac and Linux) [`Llama.cpp Usage`](https://github.com/ggerganov/llama.cpp/blob/master/README.md#usage)
+# Original Model Card:
+**ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B** is a cutting-edge merged model that blends the best features of two highly optimized architectures to create an **advanced**, **adaptive**, and **powerful** model. Whether for scientific research, complex instruction-following, or immersive roleplay scenarios, this model excels at every task it’s thrown into.
+## 🌟 Family Tree
+This model is a merger of the following:
+- [**kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B**](https://huggingface.co/kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B)
+- [**djuna/L3.1-Purosani-2-8B**](https://huggingface.co/djuna/L3.1-Purosani-2-8B)
+These parent models are themselves the result of **complex merges** of various high-performance models, making ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B a **super hybrid** capable of handling diverse tasks with efficiency and finesse.
+## 🧬 Detailed Model Lineage
+### **A: kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B**
+Merged using the **TIES merge method**, this model utilizes **unsloth/Meta-Llama-3.1-8B** as its base, combining:
+- **arcee-ai/Llama-3.1-SuperNova-Lite**: A distilled 8B parameter version of the **Llama-3.1-405B-Instruct** model, designed to maintain high performance while minimizing resource consumption. Its training, via **EvolKit**, offers instruction-following precision and domain-specific adaptability.
+- **NousResearch/Hermes-3-Llama-3.1-8B**: Known for its robustness, this model enhances long-range contextual understanding, making it ideal for complex, multi-layered tasks.
+### **B: djuna/L3.1-Purosani-2-8B**
+This merge incorporates:
+- **hf-100/Llama-3-Spellbound-Instruct-8B-0.3**
+- **arcee-ai/Llama-3.1-SuperNova-Lite**
+- **grimjim/Llama-3-Instruct-abliteration-LoRA-8B**
+- **THUDM/LongWriter-llama3.1-8B**, capable of generating over **10,000 words** in one pass, making it perfect for long-form content generation.
+Further contributors include **ResplendentAI/Smarts_Llama3** and **djuna/L3.1-Suze-Vume-2-calc**, making this model highly adaptable to a broad range of applications.
+## 🛠️ Merge Details
+The model was merged using the **della merge method** with **kromeurus/L3.1-Aglow-Vulca-v0.1-8B** as the base. This method, combined with the following models, ensures both **precision** and **adaptability**:
+- **djuna/L3.1-Noraian**
+- **Casual-Autopsy/L3-Super-Nova-RP-8B**
+- **TheDrummer/Llama-3SOME-8B-v2**
+- **djuna/L3.1-ForStHS**
+- **Blackroot/Llama-3-8B-Abomination-LORA**
+## 🔧 Technical Configuration
+The merging process used advanced methods to ensure smooth integration and consistent performance across various tasks:
+```yaml
+slices:
+  - sources:
+      - model: kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B
+        layer_range: [0, 32]
+      - model: djuna/L3.1-Purosani-2-8B
+        layer_range: [0, 32]
+merge_method: slerp
+base_model: djuna/L3.1-Purosani-2-8B
+parameters:
+  t:
+    - filter: self_attn
+      value: [0, 0.5, 0.3, 0.7, 1]
+    - filter: mlp
+      value: [1, 0.5, 0.7, 0.3, 0]
+    - value: 0.5
+dtype: bfloat16
 ```
+## 🎯 Extended Support for Roleplay & Immersive Storytelling
+**ZeroXClem/Llama3.1-Hermes3-SuperNova-8B-L3.1-Purosani-2-8B** has been optimized for **extended roleplay support**, making it an exceptional choice for **interactive storytelling** and **deep character development**. With its ability to understand long-form context and generate cohesive responses over extensive interactions, this model excels in:
+- **Character-driven interactions**: Develop rich, nuanced personalities that respond in believable and engaging ways.
+- **World-building & Lore creation**: Create vast, interconnected universes with intricate lore, all generated in real-time.
+- **Dynamic NPC dialogues**: Use the model to generate complex, reactive conversations for game NPCs, offering a fluid, immersive experience for players.
+## 🚀 Key Features & Capabilities
+### **Advanced Roleplay and Long-Form Content Generation**
+With models like **THUDM/LongWriter-llama3.1-8B** contributing their expertise, this model is perfect for generating **long-form narratives** while maintaining coherence and creativity.
+### **Instruction Following & Task Adaptability**
+Combining the capabilities of **Hermes** and **SuperNovaLite**, this model can efficiently follow detailed instructions, making it ideal for:
+- **Task automation**
+- **Virtual assistants**
+- **Research generation**
+### **Efficiency Without Compromise**
+Distilled models like **SuperNovaLite** ensure that this model delivers high performance without the extensive resource requirements of larger models.
+## 🎯 Use Case & Applications
+- **Roleplay & Interactive Storytelling**: The perfect companion for storytellers, RPG enthusiasts, and game developers. Whether crafting dynamic NPC interactions or generating deep, immersive worlds, this model can handle it all.
+- **Instruction-based AI**: With enhanced instruction-following abilities, this model is ideal for developing intelligent assistants or chatbots that require high accuracy and quick adaptability.
+- **Long-Form Writing**: From novels to research papers, this model can generate lengthy, well-structured content with ease, thanks to its extensive training on long-form data.
+## 📜 License
+This model is open-sourced under the **Apache-2.0 License**, allowing others to use and modify it freely, as long as they give proper attribution.
+## 💡 Tags
+- merge
+- mergekit
+- lazymergekit
+- Hermes3
+- SuperNovaLite
+- Purosani
+- Llama3.1
+- kotyKD/Llama3.1-Hermes3-SuperNovaLite-merged-with-base-8B
+- djuna/L3.1-Purosani-2-8B
+- instruction-following
+- long-form-generation
+- roleplay
+- storytelling