Quant-Cartel
/

Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

InferenceIllusionist commited on 20 days ago

Commit

0361c1a

•

1 Parent(s): 9843c49

Update README.md

Files changed (1) hide show

README.md +64 -3

README.md CHANGED Viewed

@@ -1,3 +1,64 @@
----
-license: cc-by-nc-4.0
----

+---
+license: cc-by-nc-4.0
+base_model_relation: quantized
+quantized_by: Quant-Cartel
+base_model: Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B
+pipeline_tag: text-generation
+tags:
+- iMat
+- GGUF
+- merge
+---
+```
+  e88 88e                               d8
+ d888 888b  8888 8888  ,"Y88b 888 8e   d88
+C8888 8888D 8888 8888 "8" 888 888 88b d88888
+ Y888 888P  Y888 888P ,ee 888 888 888  888
+  "88 88"    "88 88"  "88 888 888 888  888
+      b
+      8b,
+  e88'Y88                  d8           888
+ d888  'Y  ,"Y88b 888,8,  d88    ,e e,  888
+C8888     "8" 888 888 "  d88888 d88 88b 888
+ Y888  ,d ,ee 888 888     888   888   , 888
+  "88,d88 "88 888 888     888    "YeeP" 888
+PROUDLY PRESENTS
+```
+# Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-GGUF
+Quantized with love from fp16.
+Original model author: [Envoid](https://huggingface.co/Envoid/)
+* Importance Matrix calculated using [groups_merged.txt](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384)
+*  88 chunks
+*  n_ctx=512
+*  Calculation uses f16 precision model weights
+Original model README [here](https://huggingface.co/Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B) and below:
+![](https://files.catbox.moe/07cjw5.jpg)
+# Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B
+is a 40/60 SLERP Merge of [Envoid/Llama-3-TenyxChat-DaybreakStorywriter-70B](https://huggingface.co/Envoid/Llama-3-TenyxChat-DaybreakStorywriter-70B?not-for-all-audiences=true) onto [nvidia/Llama-3.1-Nemotron-70B-Instruct-HF](https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF) utilizing the following config:
+```
+models:
+  - model: ./Envoid_Llama-3-TenyxChat-DaybreakStorywriter-70B
+  - model: ./nvidia_Llama-3.1-Nemotron-70B-Instruct-HF
+merge_method: slerp
+base_model: ./nvidia_Llama-3.1-Nemotron-70B-Instruct-HF
+parameters:
+  t:
+    - value: 0.4
+dtype: bfloat16
+```
+## Caution: As is always the case with SLERP merges there may be edge cases inwhich certain unintended model behaviors emerge. So always use with caution.
+The 'sloppiness' of Nemotron seems to be somewhat reigned in (but still exists) while maintaining its personable assistant personality and safety (In assistant mode it will still prompt you with a warning before producing sensitive content).
+Overall it provides a solid option for RP and creative writing while still functioning as an assistant model, if desired. If used to continue a roleplay it will generally follow the ongoing cadence of the conversation.
+### It utilizes the Llama 3 prompt format.