InferenceIllusionist commited on
Commit
0361c1a
1 Parent(s): 9843c49

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +64 -3
README.md CHANGED
@@ -1,3 +1,64 @@
1
- ---
2
- license: cc-by-nc-4.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ base_model_relation: quantized
4
+ quantized_by: Quant-Cartel
5
+ base_model: Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B
6
+ pipeline_tag: text-generation
7
+ tags:
8
+ - iMat
9
+ - GGUF
10
+ - merge
11
+ ---
12
+ ```
13
+ e88 88e d8
14
+ d888 888b 8888 8888 ,"Y88b 888 8e d88
15
+ C8888 8888D 8888 8888 "8" 888 888 88b d88888
16
+ Y888 888P Y888 888P ,ee 888 888 888 888
17
+ "88 88" "88 88" "88 888 888 888 888
18
+ b
19
+ 8b,
20
+
21
+ e88'Y88 d8 888
22
+ d888 'Y ,"Y88b 888,8, d88 ,e e, 888
23
+ C8888 "8" 888 888 " d88888 d88 88b 888
24
+ Y888 ,d ,ee 888 888 888 888 , 888
25
+ "88,d88 "88 888 888 888 "YeeP" 888
26
+
27
+ PROUDLY PRESENTS
28
+ ```
29
+ # Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B-iMat-GGUF
30
+
31
+ Quantized with love from fp16.
32
+
33
+ Original model author: [Envoid](https://huggingface.co/Envoid/)
34
+
35
+ * Importance Matrix calculated using [groups_merged.txt](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384)
36
+ * 88 chunks
37
+ * n_ctx=512
38
+ * Calculation uses f16 precision model weights
39
+
40
+ Original model README [here](https://huggingface.co/Envoid/Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B) and below:
41
+
42
+ ![](https://files.catbox.moe/07cjw5.jpg)
43
+
44
+ # Llama-3.05-Nemotron-Tenyxchat-Storybreaker-70B
45
+
46
+ is a 40/60 SLERP Merge of [Envoid/Llama-3-TenyxChat-DaybreakStorywriter-70B](https://huggingface.co/Envoid/Llama-3-TenyxChat-DaybreakStorywriter-70B?not-for-all-audiences=true) onto [nvidia/Llama-3.1-Nemotron-70B-Instruct-HF](https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF) utilizing the following config:
47
+ ```
48
+ models:
49
+ - model: ./Envoid_Llama-3-TenyxChat-DaybreakStorywriter-70B
50
+ - model: ./nvidia_Llama-3.1-Nemotron-70B-Instruct-HF
51
+ merge_method: slerp
52
+ base_model: ./nvidia_Llama-3.1-Nemotron-70B-Instruct-HF
53
+ parameters:
54
+ t:
55
+ - value: 0.4
56
+ dtype: bfloat16
57
+ ```
58
+ ## Caution: As is always the case with SLERP merges there may be edge cases inwhich certain unintended model behaviors emerge. So always use with caution.
59
+
60
+ The 'sloppiness' of Nemotron seems to be somewhat reigned in (but still exists) while maintaining its personable assistant personality and safety (In assistant mode it will still prompt you with a warning before producing sensitive content).
61
+
62
+ Overall it provides a solid option for RP and creative writing while still functioning as an assistant model, if desired. If used to continue a roleplay it will generally follow the ongoing cadence of the conversation.
63
+
64
+ ### It utilizes the Llama 3 prompt format.