InferenceIllusionist commited on
Commit
6cdab59
1 Parent(s): 683313d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +69 -1
README.md CHANGED
@@ -1 +1,69 @@
1
- GGUF files for [Magic-Dolphin-7b](https://huggingface.co/InferenceIllusionist/Magic-Dolphin-7b)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ ---
4
+ GGUF files for [Magic-Dolphin-7b](https://huggingface.co/InferenceIllusionist/Magic-Dolphin-7b)
5
+
6
+ ---
7
+ # Magic-Dolphin-7b
8
+ <img src="https://huggingface.co/InferenceIllusionist/Magic-Dolphin-7b/resolve/main/magic-dolphin.jfif" width="500"/>
9
+
10
+ For GGUF files please look [here](https://huggingface.co/InferenceIllusionist/Magic-Dolphin-7b-GGUF)
11
+
12
+ A linear merge of:
13
+ - [cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser](https://huggingface.co/cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser)
14
+ - [Locutusque/Hyperion-1.5-Mistral-7B](https://huggingface.co/Locutusque/Hyperion-1.5-Mistral-7B)
15
+ - [ibm/merlinite-7b](https://huggingface.co/ibm/merlinite-7b)
16
+
17
+
18
+ These three models showed excellent acumen in technical topics so I wanted to see how they would behave together in a merge. Several different ratios were tested before this release, in the end a higher weighting for merlinite-7b helped smooth out some edges. This model is a test of how LAB tuning is impacted by merges with models leveraging DPO.
19
+
20
+ ### Benchmark Performance
21
+ | Name | Avg. | ARC | HellaSwag | MMLU | TruthfulQA | Winograde | GSM8K |
22
+ | ---- | ---- | ---- | ---- | ---- | ---- | ---- | ---- |
23
+ | <b>Magic-Dolphin-7b</b> | <u><b>67.48</b></u> | 65.78 | 85.61 | 64.64 | 58.01 | <u><b>79.64</b></u> | <u><b>51.18</b></u> |
24
+ | dolphin-2.6-mistral-7b-dpo-laser | 67.28 | 66.3 | 85.73 | 63.16 | 61.71 | 79.16 | 47.61 |
25
+ | merlinite-7b | N/A | 63.99 | 84.37 | 64.88 | N/A | 78.24 | N/A |
26
+ | Hyperion-1.5-Mistral-7B | 61.43 | 60.49 | 83.64 | 63.57 | 41.78 | 78.61 | 40.49 |
27
+
28
+ This was my first experiment with merging models so any feedback is greatly appreciated.
29
+
30
+ Uses Alpaca template.
31
+
32
+ <p align="center">
33
+
34
+ </p>
35
+
36
+ <b>Sample Question</b>
37
+ <img src="https://huggingface.co/InferenceIllusionist/Magic-Dolphin-7b/resolve/main/magic-dolphin.JPG" width="750"/>
38
+
39
+ ## Merge Details
40
+ ### Merge Method
41
+
42
+ This model was merged using the [linear](https://arxiv.org/abs/2203.05482) merge method.
43
+
44
+ ### Models Merged
45
+
46
+ The following models were included in the merge:
47
+ * [cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser](https://huggingface.co/cognitivecomputations/dolphin-2.6-mistral-7b-dpo-laser)
48
+ * [Locutusque/Hyperion-1.5-Mistral-7B](https://huggingface.co/Locutusque/Hyperion-1.5-Mistral-7B)
49
+ * [ibm/merlinite-7b](https://huggingface.co/ibm/merlinite-7b)
50
+
51
+ ### Configuration
52
+
53
+ The following YAML configuration was used to produce this model:
54
+
55
+ ```yaml
56
+ models:
57
+ - model: models/dolphin-2.6-mistral-7b-dpo-laser
58
+ parameters:
59
+ weight: 1.0
60
+ - model: models/Hyperion-1.5-Mistral-7B
61
+ parameters:
62
+ weight: 0.3
63
+ - model: models/merlinite-7b
64
+ parameters:
65
+ weight: 0.5
66
+ merge_method: linear
67
+ dtype: float16
68
+
69
+ ```