Solshine commited on
Commit
9e85390
1 Parent(s): 3ae75f3

Upload folder using huggingface_hub

Browse files
This view is limited to 50 files because it contains too many changes.   See raw diff
Files changed (50) hide show
  1. README.md +476 -0
  2. added_tokens.json +24 -0
  3. config.json +28 -0
  4. mergekit_config.yml +447 -0
  5. merges.txt +0 -0
  6. model-00001-of-00057.safetensors +3 -0
  7. model-00002-of-00057.safetensors +3 -0
  8. model-00003-of-00057.safetensors +3 -0
  9. model-00004-of-00057.safetensors +3 -0
  10. model-00005-of-00057.safetensors +3 -0
  11. model-00006-of-00057.safetensors +3 -0
  12. model-00007-of-00057.safetensors +3 -0
  13. model-00008-of-00057.safetensors +3 -0
  14. model-00009-of-00057.safetensors +3 -0
  15. model-00010-of-00057.safetensors +3 -0
  16. model-00011-of-00057.safetensors +3 -0
  17. model-00012-of-00057.safetensors +3 -0
  18. model-00013-of-00057.safetensors +3 -0
  19. model-00014-of-00057.safetensors +3 -0
  20. model-00015-of-00057.safetensors +3 -0
  21. model-00016-of-00057.safetensors +3 -0
  22. model-00017-of-00057.safetensors +3 -0
  23. model-00018-of-00057.safetensors +3 -0
  24. model-00019-of-00057.safetensors +3 -0
  25. model-00020-of-00057.safetensors +3 -0
  26. model-00021-of-00057.safetensors +3 -0
  27. model-00022-of-00057.safetensors +3 -0
  28. model-00023-of-00057.safetensors +3 -0
  29. model-00024-of-00057.safetensors +3 -0
  30. model-00025-of-00057.safetensors +3 -0
  31. model-00026-of-00057.safetensors +3 -0
  32. model-00027-of-00057.safetensors +3 -0
  33. model-00028-of-00057.safetensors +3 -0
  34. model-00029-of-00057.safetensors +3 -0
  35. model-00030-of-00057.safetensors +3 -0
  36. model-00031-of-00057.safetensors +3 -0
  37. model-00032-of-00057.safetensors +3 -0
  38. model-00033-of-00057.safetensors +3 -0
  39. model-00034-of-00057.safetensors +3 -0
  40. model-00035-of-00057.safetensors +3 -0
  41. model-00036-of-00057.safetensors +3 -0
  42. model-00037-of-00057.safetensors +3 -0
  43. model-00038-of-00057.safetensors +3 -0
  44. model-00039-of-00057.safetensors +3 -0
  45. model-00040-of-00057.safetensors +3 -0
  46. model-00041-of-00057.safetensors +3 -0
  47. model-00042-of-00057.safetensors +3 -0
  48. model-00043-of-00057.safetensors +3 -0
  49. model-00044-of-00057.safetensors +3 -0
  50. model-00045-of-00057.safetensors +3 -0
README.md ADDED
@@ -0,0 +1,476 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - Qwen/Qwen2.5-72B-Instruct
4
+ library_name: transformers
5
+ tags:
6
+ - mergekit
7
+ - merge
8
+
9
+ ---
10
+ # merge
11
+
12
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
13
+
14
+ ## Merge Details
15
+ ### Merge Method
16
+
17
+ This model was merged using the passthrough merge method.
18
+
19
+ ### Models Merged
20
+
21
+ The following models were included in the merge:
22
+ * [Qwen/Qwen2.5-72B-Instruct](https://huggingface.co/Qwen/Qwen2.5-72B-Instruct)
23
+
24
+ ### Configuration
25
+
26
+ The following YAML configuration was used to produce this model:
27
+
28
+ ```yaml
29
+ slices:
30
+ - sources:
31
+ - model: Qwen/Qwen2.5-72B-Instruct
32
+ layer_range: [0, 4]
33
+ - sources:
34
+ - model: Qwen/Qwen2.5-72B-Instruct
35
+ layer_range: [4, 5]
36
+ - sources:
37
+ - model: Qwen/Qwen2.5-72B-Instruct
38
+ layer_range: [4, 5]
39
+ - sources:
40
+ - model: Qwen/Qwen2.5-72B-Instruct
41
+ layer_range: [5, 6]
42
+ - sources:
43
+ - model: Qwen/Qwen2.5-72B-Instruct
44
+ layer_range: [5, 6]
45
+ - sources:
46
+ - model: Qwen/Qwen2.5-72B-Instruct
47
+ layer_range: [6, 7]
48
+ - sources:
49
+ - model: Qwen/Qwen2.5-72B-Instruct
50
+ layer_range: [6, 7]
51
+ - sources:
52
+ - model: Qwen/Qwen2.5-72B-Instruct
53
+ layer_range: [7, 8]
54
+ - sources:
55
+ - model: Qwen/Qwen2.5-72B-Instruct
56
+ layer_range: [7, 8]
57
+ - sources:
58
+ - model: Qwen/Qwen2.5-72B-Instruct
59
+ layer_range: [8, 9]
60
+ - sources:
61
+ - model: Qwen/Qwen2.5-72B-Instruct
62
+ layer_range: [8, 9]
63
+ - sources:
64
+ - model: Qwen/Qwen2.5-72B-Instruct
65
+ layer_range: [9, 10]
66
+ - sources:
67
+ - model: Qwen/Qwen2.5-72B-Instruct
68
+ layer_range: [9, 10]
69
+ - sources:
70
+ - model: Qwen/Qwen2.5-72B-Instruct
71
+ layer_range: [10, 11]
72
+ - sources:
73
+ - model: Qwen/Qwen2.5-72B-Instruct
74
+ layer_range: [10, 11]
75
+ - sources:
76
+ - model: Qwen/Qwen2.5-72B-Instruct
77
+ layer_range: [11, 12]
78
+ - sources:
79
+ - model: Qwen/Qwen2.5-72B-Instruct
80
+ layer_range: [11, 12]
81
+ - sources:
82
+ - model: Qwen/Qwen2.5-72B-Instruct
83
+ layer_range: [12, 13]
84
+ - sources:
85
+ - model: Qwen/Qwen2.5-72B-Instruct
86
+ layer_range: [12, 13]
87
+ - sources:
88
+ - model: Qwen/Qwen2.5-72B-Instruct
89
+ layer_range: [13, 14]
90
+ - sources:
91
+ - model: Qwen/Qwen2.5-72B-Instruct
92
+ layer_range: [13, 14]
93
+ - sources:
94
+ - model: Qwen/Qwen2.5-72B-Instruct
95
+ layer_range: [14, 15]
96
+ - sources:
97
+ - model: Qwen/Qwen2.5-72B-Instruct
98
+ layer_range: [14, 15]
99
+ - sources:
100
+ - model: Qwen/Qwen2.5-72B-Instruct
101
+ layer_range: [15, 16]
102
+ - sources:
103
+ - model: Qwen/Qwen2.5-72B-Instruct
104
+ layer_range: [15, 16]
105
+ - sources:
106
+ - model: Qwen/Qwen2.5-72B-Instruct
107
+ layer_range: [16, 17]
108
+ - sources:
109
+ - model: Qwen/Qwen2.5-72B-Instruct
110
+ layer_range: [16, 17]
111
+ - sources:
112
+ - model: Qwen/Qwen2.5-72B-Instruct
113
+ layer_range: [17, 18]
114
+ - sources:
115
+ - model: Qwen/Qwen2.5-72B-Instruct
116
+ layer_range: [17, 18]
117
+ - sources:
118
+ - model: Qwen/Qwen2.5-72B-Instruct
119
+ layer_range: [18, 19]
120
+ - sources:
121
+ - model: Qwen/Qwen2.5-72B-Instruct
122
+ layer_range: [18, 19]
123
+ - sources:
124
+ - model: Qwen/Qwen2.5-72B-Instruct
125
+ layer_range: [19, 20]
126
+ - sources:
127
+ - model: Qwen/Qwen2.5-72B-Instruct
128
+ layer_range: [19, 20]
129
+ - sources:
130
+ - model: Qwen/Qwen2.5-72B-Instruct
131
+ layer_range: [20, 21]
132
+ - sources:
133
+ - model: Qwen/Qwen2.5-72B-Instruct
134
+ layer_range: [20, 21]
135
+ - sources:
136
+ - model: Qwen/Qwen2.5-72B-Instruct
137
+ layer_range: [21, 22]
138
+ - sources:
139
+ - model: Qwen/Qwen2.5-72B-Instruct
140
+ layer_range: [21, 22]
141
+ - sources:
142
+ - model: Qwen/Qwen2.5-72B-Instruct
143
+ layer_range: [22, 23]
144
+ - sources:
145
+ - model: Qwen/Qwen2.5-72B-Instruct
146
+ layer_range: [22, 23]
147
+ - sources:
148
+ - model: Qwen/Qwen2.5-72B-Instruct
149
+ layer_range: [23, 24]
150
+ - sources:
151
+ - model: Qwen/Qwen2.5-72B-Instruct
152
+ layer_range: [23, 24]
153
+ - sources:
154
+ - model: Qwen/Qwen2.5-72B-Instruct
155
+ layer_range: [24, 25]
156
+ - sources:
157
+ - model: Qwen/Qwen2.5-72B-Instruct
158
+ layer_range: [24, 25]
159
+ - sources:
160
+ - model: Qwen/Qwen2.5-72B-Instruct
161
+ layer_range: [25, 26]
162
+ - sources:
163
+ - model: Qwen/Qwen2.5-72B-Instruct
164
+ layer_range: [25, 26]
165
+ - sources:
166
+ - model: Qwen/Qwen2.5-72B-Instruct
167
+ layer_range: [26, 27]
168
+ - sources:
169
+ - model: Qwen/Qwen2.5-72B-Instruct
170
+ layer_range: [26, 27]
171
+ - sources:
172
+ - model: Qwen/Qwen2.5-72B-Instruct
173
+ layer_range: [27, 28]
174
+ - sources:
175
+ - model: Qwen/Qwen2.5-72B-Instruct
176
+ layer_range: [27, 28]
177
+ - sources:
178
+ - model: Qwen/Qwen2.5-72B-Instruct
179
+ layer_range: [28, 29]
180
+ - sources:
181
+ - model: Qwen/Qwen2.5-72B-Instruct
182
+ layer_range: [28, 29]
183
+ - sources:
184
+ - model: Qwen/Qwen2.5-72B-Instruct
185
+ layer_range: [29, 30]
186
+ - sources:
187
+ - model: Qwen/Qwen2.5-72B-Instruct
188
+ layer_range: [29, 30]
189
+ - sources:
190
+ - model: Qwen/Qwen2.5-72B-Instruct
191
+ layer_range: [30, 31]
192
+ - sources:
193
+ - model: Qwen/Qwen2.5-72B-Instruct
194
+ layer_range: [30, 31]
195
+ - sources:
196
+ - model: Qwen/Qwen2.5-72B-Instruct
197
+ layer_range: [31, 32]
198
+ - sources:
199
+ - model: Qwen/Qwen2.5-72B-Instruct
200
+ layer_range: [31, 32]
201
+ - sources:
202
+ - model: Qwen/Qwen2.5-72B-Instruct
203
+ layer_range: [32, 33]
204
+ - sources:
205
+ - model: Qwen/Qwen2.5-72B-Instruct
206
+ layer_range: [32, 33]
207
+ - sources:
208
+ - model: Qwen/Qwen2.5-72B-Instruct
209
+ layer_range: [33, 34]
210
+ - sources:
211
+ - model: Qwen/Qwen2.5-72B-Instruct
212
+ layer_range: [33, 34]
213
+ - sources:
214
+ - model: Qwen/Qwen2.5-72B-Instruct
215
+ layer_range: [34, 35]
216
+ - sources:
217
+ - model: Qwen/Qwen2.5-72B-Instruct
218
+ layer_range: [34, 35]
219
+ - sources:
220
+ - model: Qwen/Qwen2.5-72B-Instruct
221
+ layer_range: [35, 36]
222
+ - sources:
223
+ - model: Qwen/Qwen2.5-72B-Instruct
224
+ layer_range: [35, 36]
225
+ - sources:
226
+ - model: Qwen/Qwen2.5-72B-Instruct
227
+ layer_range: [36, 37]
228
+ - sources:
229
+ - model: Qwen/Qwen2.5-72B-Instruct
230
+ layer_range: [36, 37]
231
+ - sources:
232
+ - model: Qwen/Qwen2.5-72B-Instruct
233
+ layer_range: [37, 38]
234
+ - sources:
235
+ - model: Qwen/Qwen2.5-72B-Instruct
236
+ layer_range: [37, 38]
237
+ - sources:
238
+ - model: Qwen/Qwen2.5-72B-Instruct
239
+ layer_range: [38, 39]
240
+ - sources:
241
+ - model: Qwen/Qwen2.5-72B-Instruct
242
+ layer_range: [38, 39]
243
+ - sources:
244
+ - model: Qwen/Qwen2.5-72B-Instruct
245
+ layer_range: [39, 40]
246
+ - sources:
247
+ - model: Qwen/Qwen2.5-72B-Instruct
248
+ layer_range: [39, 40]
249
+ - sources:
250
+ - model: Qwen/Qwen2.5-72B-Instruct
251
+ layer_range: [40, 41]
252
+ - sources:
253
+ - model: Qwen/Qwen2.5-72B-Instruct
254
+ layer_range: [40, 41]
255
+ - sources:
256
+ - model: Qwen/Qwen2.5-72B-Instruct
257
+ layer_range: [41, 42]
258
+ - sources:
259
+ - model: Qwen/Qwen2.5-72B-Instruct
260
+ layer_range: [41, 42]
261
+ - sources:
262
+ - model: Qwen/Qwen2.5-72B-Instruct
263
+ layer_range: [42, 43]
264
+ - sources:
265
+ - model: Qwen/Qwen2.5-72B-Instruct
266
+ layer_range: [42, 43]
267
+ - sources:
268
+ - model: Qwen/Qwen2.5-72B-Instruct
269
+ layer_range: [43, 44]
270
+ - sources:
271
+ - model: Qwen/Qwen2.5-72B-Instruct
272
+ layer_range: [43, 44]
273
+ - sources:
274
+ - model: Qwen/Qwen2.5-72B-Instruct
275
+ layer_range: [44, 45]
276
+ - sources:
277
+ - model: Qwen/Qwen2.5-72B-Instruct
278
+ layer_range: [44, 45]
279
+ - sources:
280
+ - model: Qwen/Qwen2.5-72B-Instruct
281
+ layer_range: [45, 46]
282
+ - sources:
283
+ - model: Qwen/Qwen2.5-72B-Instruct
284
+ layer_range: [45, 46]
285
+ - sources:
286
+ - model: Qwen/Qwen2.5-72B-Instruct
287
+ layer_range: [46, 47]
288
+ - sources:
289
+ - model: Qwen/Qwen2.5-72B-Instruct
290
+ layer_range: [46, 47]
291
+ - sources:
292
+ - model: Qwen/Qwen2.5-72B-Instruct
293
+ layer_range: [47, 48]
294
+ - sources:
295
+ - model: Qwen/Qwen2.5-72B-Instruct
296
+ layer_range: [47, 48]
297
+ - sources:
298
+ - model: Qwen/Qwen2.5-72B-Instruct
299
+ layer_range: [48, 49]
300
+ - sources:
301
+ - model: Qwen/Qwen2.5-72B-Instruct
302
+ layer_range: [48, 49]
303
+ - sources:
304
+ - model: Qwen/Qwen2.5-72B-Instruct
305
+ layer_range: [49, 50]
306
+ - sources:
307
+ - model: Qwen/Qwen2.5-72B-Instruct
308
+ layer_range: [49, 50]
309
+ - sources:
310
+ - model: Qwen/Qwen2.5-72B-Instruct
311
+ layer_range: [50, 51]
312
+ - sources:
313
+ - model: Qwen/Qwen2.5-72B-Instruct
314
+ layer_range: [50, 51]
315
+ - sources:
316
+ - model: Qwen/Qwen2.5-72B-Instruct
317
+ layer_range: [51, 52]
318
+ - sources:
319
+ - model: Qwen/Qwen2.5-72B-Instruct
320
+ layer_range: [51, 52]
321
+ - sources:
322
+ - model: Qwen/Qwen2.5-72B-Instruct
323
+ layer_range: [52, 53]
324
+ - sources:
325
+ - model: Qwen/Qwen2.5-72B-Instruct
326
+ layer_range: [52, 53]
327
+ - sources:
328
+ - model: Qwen/Qwen2.5-72B-Instruct
329
+ layer_range: [53, 54]
330
+ - sources:
331
+ - model: Qwen/Qwen2.5-72B-Instruct
332
+ layer_range: [53, 54]
333
+ - sources:
334
+ - model: Qwen/Qwen2.5-72B-Instruct
335
+ layer_range: [54, 55]
336
+ - sources:
337
+ - model: Qwen/Qwen2.5-72B-Instruct
338
+ layer_range: [54, 55]
339
+ - sources:
340
+ - model: Qwen/Qwen2.5-72B-Instruct
341
+ layer_range: [55, 56]
342
+ - sources:
343
+ - model: Qwen/Qwen2.5-72B-Instruct
344
+ layer_range: [55, 56]
345
+ - sources:
346
+ - model: Qwen/Qwen2.5-72B-Instruct
347
+ layer_range: [56, 57]
348
+ - sources:
349
+ - model: Qwen/Qwen2.5-72B-Instruct
350
+ layer_range: [56, 57]
351
+ - sources:
352
+ - model: Qwen/Qwen2.5-72B-Instruct
353
+ layer_range: [57, 58]
354
+ - sources:
355
+ - model: Qwen/Qwen2.5-72B-Instruct
356
+ layer_range: [57, 58]
357
+ - sources:
358
+ - model: Qwen/Qwen2.5-72B-Instruct
359
+ layer_range: [58, 59]
360
+ - sources:
361
+ - model: Qwen/Qwen2.5-72B-Instruct
362
+ layer_range: [58, 59]
363
+ - sources:
364
+ - model: Qwen/Qwen2.5-72B-Instruct
365
+ layer_range: [59, 60]
366
+ - sources:
367
+ - model: Qwen/Qwen2.5-72B-Instruct
368
+ layer_range: [59, 60]
369
+ - sources:
370
+ - model: Qwen/Qwen2.5-72B-Instruct
371
+ layer_range: [60, 61]
372
+ - sources:
373
+ - model: Qwen/Qwen2.5-72B-Instruct
374
+ layer_range: [60, 61]
375
+ - sources:
376
+ - model: Qwen/Qwen2.5-72B-Instruct
377
+ layer_range: [61, 62]
378
+ - sources:
379
+ - model: Qwen/Qwen2.5-72B-Instruct
380
+ layer_range: [61, 62]
381
+ - sources:
382
+ - model: Qwen/Qwen2.5-72B-Instruct
383
+ layer_range: [62, 63]
384
+ - sources:
385
+ - model: Qwen/Qwen2.5-72B-Instruct
386
+ layer_range: [62, 63]
387
+ - sources:
388
+ - model: Qwen/Qwen2.5-72B-Instruct
389
+ layer_range: [63, 64]
390
+ - sources:
391
+ - model: Qwen/Qwen2.5-72B-Instruct
392
+ layer_range: [63, 64]
393
+ - sources:
394
+ - model: Qwen/Qwen2.5-72B-Instruct
395
+ layer_range: [64, 65]
396
+ - sources:
397
+ - model: Qwen/Qwen2.5-72B-Instruct
398
+ layer_range: [64, 65]
399
+ - sources:
400
+ - model: Qwen/Qwen2.5-72B-Instruct
401
+ layer_range: [65, 66]
402
+ - sources:
403
+ - model: Qwen/Qwen2.5-72B-Instruct
404
+ layer_range: [65, 66]
405
+ - sources:
406
+ - model: Qwen/Qwen2.5-72B-Instruct
407
+ layer_range: [66, 67]
408
+ - sources:
409
+ - model: Qwen/Qwen2.5-72B-Instruct
410
+ layer_range: [66, 67]
411
+ - sources:
412
+ - model: Qwen/Qwen2.5-72B-Instruct
413
+ layer_range: [67, 68]
414
+ - sources:
415
+ - model: Qwen/Qwen2.5-72B-Instruct
416
+ layer_range: [67, 68]
417
+ - sources:
418
+ - model: Qwen/Qwen2.5-72B-Instruct
419
+ layer_range: [68, 69]
420
+ - sources:
421
+ - model: Qwen/Qwen2.5-72B-Instruct
422
+ layer_range: [68, 69]
423
+ - sources:
424
+ - model: Qwen/Qwen2.5-72B-Instruct
425
+ layer_range: [69, 70]
426
+ - sources:
427
+ - model: Qwen/Qwen2.5-72B-Instruct
428
+ layer_range: [69, 70]
429
+ - sources:
430
+ - model: Qwen/Qwen2.5-72B-Instruct
431
+ layer_range: [70, 71]
432
+ - sources:
433
+ - model: Qwen/Qwen2.5-72B-Instruct
434
+ layer_range: [70, 71]
435
+ - sources:
436
+ - model: Qwen/Qwen2.5-72B-Instruct
437
+ layer_range: [71, 72]
438
+ - sources:
439
+ - model: Qwen/Qwen2.5-72B-Instruct
440
+ layer_range: [71, 72]
441
+ - sources:
442
+ - model: Qwen/Qwen2.5-72B-Instruct
443
+ layer_range: [72, 73]
444
+ - sources:
445
+ - model: Qwen/Qwen2.5-72B-Instruct
446
+ layer_range: [72, 73]
447
+ - sources:
448
+ - model: Qwen/Qwen2.5-72B-Instruct
449
+ layer_range: [73, 74]
450
+ - sources:
451
+ - model: Qwen/Qwen2.5-72B-Instruct
452
+ layer_range: [73, 74]
453
+ - sources:
454
+ - model: Qwen/Qwen2.5-72B-Instruct
455
+ layer_range: [74, 75]
456
+ - sources:
457
+ - model: Qwen/Qwen2.5-72B-Instruct
458
+ layer_range: [74, 75]
459
+ - sources:
460
+ - model: Qwen/Qwen2.5-72B-Instruct
461
+ layer_range: [75, 76]
462
+ - sources:
463
+ - model: Qwen/Qwen2.5-72B-Instruct
464
+ layer_range: [75, 76]
465
+ - sources:
466
+ - model: Qwen/Qwen2.5-72B-Instruct
467
+ layer_range: [76, 77]
468
+ - sources:
469
+ - model: Qwen/Qwen2.5-72B-Instruct
470
+ layer_range: [76, 77]
471
+ - sources:
472
+ - model: Qwen/Qwen2.5-72B-Instruct
473
+ layer_range: [77, 80]
474
+ merge_method: passthrough
475
+ dtype: float16
476
+ ```
added_tokens.json ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "</tool_call>": 151658,
3
+ "<tool_call>": 151657,
4
+ "<|box_end|>": 151649,
5
+ "<|box_start|>": 151648,
6
+ "<|endoftext|>": 151643,
7
+ "<|file_sep|>": 151664,
8
+ "<|fim_middle|>": 151660,
9
+ "<|fim_pad|>": 151662,
10
+ "<|fim_prefix|>": 151659,
11
+ "<|fim_suffix|>": 151661,
12
+ "<|im_end|>": 151645,
13
+ "<|im_start|>": 151644,
14
+ "<|image_pad|>": 151655,
15
+ "<|object_ref_end|>": 151647,
16
+ "<|object_ref_start|>": 151646,
17
+ "<|quad_end|>": 151651,
18
+ "<|quad_start|>": 151650,
19
+ "<|repo_name|>": 151663,
20
+ "<|video_pad|>": 151656,
21
+ "<|vision_end|>": 151653,
22
+ "<|vision_pad|>": 151654,
23
+ "<|vision_start|>": 151652
24
+ }
config.json ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_name_or_path": "Qwen/Qwen2.5-72B-Instruct",
3
+ "architectures": [
4
+ "Qwen2ForCausalLM"
5
+ ],
6
+ "attention_dropout": 0.0,
7
+ "bos_token_id": 151643,
8
+ "eos_token_id": 151645,
9
+ "hidden_act": "silu",
10
+ "hidden_size": 8192,
11
+ "initializer_range": 0.02,
12
+ "intermediate_size": 29568,
13
+ "max_position_embeddings": 32768,
14
+ "max_window_layers": 70,
15
+ "model_type": "qwen2",
16
+ "num_attention_heads": 64,
17
+ "num_hidden_layers": 153,
18
+ "num_key_value_heads": 8,
19
+ "rms_norm_eps": 1e-06,
20
+ "rope_theta": 1000000.0,
21
+ "sliding_window": null,
22
+ "tie_word_embeddings": false,
23
+ "torch_dtype": "float16",
24
+ "transformers_version": "4.44.1",
25
+ "use_cache": true,
26
+ "use_sliding_window": false,
27
+ "vocab_size": 152064
28
+ }
mergekit_config.yml ADDED
@@ -0,0 +1,447 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ slices:
2
+ - sources:
3
+ - model: Qwen/Qwen2.5-72B-Instruct
4
+ layer_range: [0, 4]
5
+ - sources:
6
+ - model: Qwen/Qwen2.5-72B-Instruct
7
+ layer_range: [4, 5]
8
+ - sources:
9
+ - model: Qwen/Qwen2.5-72B-Instruct
10
+ layer_range: [4, 5]
11
+ - sources:
12
+ - model: Qwen/Qwen2.5-72B-Instruct
13
+ layer_range: [5, 6]
14
+ - sources:
15
+ - model: Qwen/Qwen2.5-72B-Instruct
16
+ layer_range: [5, 6]
17
+ - sources:
18
+ - model: Qwen/Qwen2.5-72B-Instruct
19
+ layer_range: [6, 7]
20
+ - sources:
21
+ - model: Qwen/Qwen2.5-72B-Instruct
22
+ layer_range: [6, 7]
23
+ - sources:
24
+ - model: Qwen/Qwen2.5-72B-Instruct
25
+ layer_range: [7, 8]
26
+ - sources:
27
+ - model: Qwen/Qwen2.5-72B-Instruct
28
+ layer_range: [7, 8]
29
+ - sources:
30
+ - model: Qwen/Qwen2.5-72B-Instruct
31
+ layer_range: [8, 9]
32
+ - sources:
33
+ - model: Qwen/Qwen2.5-72B-Instruct
34
+ layer_range: [8, 9]
35
+ - sources:
36
+ - model: Qwen/Qwen2.5-72B-Instruct
37
+ layer_range: [9, 10]
38
+ - sources:
39
+ - model: Qwen/Qwen2.5-72B-Instruct
40
+ layer_range: [9, 10]
41
+ - sources:
42
+ - model: Qwen/Qwen2.5-72B-Instruct
43
+ layer_range: [10, 11]
44
+ - sources:
45
+ - model: Qwen/Qwen2.5-72B-Instruct
46
+ layer_range: [10, 11]
47
+ - sources:
48
+ - model: Qwen/Qwen2.5-72B-Instruct
49
+ layer_range: [11, 12]
50
+ - sources:
51
+ - model: Qwen/Qwen2.5-72B-Instruct
52
+ layer_range: [11, 12]
53
+ - sources:
54
+ - model: Qwen/Qwen2.5-72B-Instruct
55
+ layer_range: [12, 13]
56
+ - sources:
57
+ - model: Qwen/Qwen2.5-72B-Instruct
58
+ layer_range: [12, 13]
59
+ - sources:
60
+ - model: Qwen/Qwen2.5-72B-Instruct
61
+ layer_range: [13, 14]
62
+ - sources:
63
+ - model: Qwen/Qwen2.5-72B-Instruct
64
+ layer_range: [13, 14]
65
+ - sources:
66
+ - model: Qwen/Qwen2.5-72B-Instruct
67
+ layer_range: [14, 15]
68
+ - sources:
69
+ - model: Qwen/Qwen2.5-72B-Instruct
70
+ layer_range: [14, 15]
71
+ - sources:
72
+ - model: Qwen/Qwen2.5-72B-Instruct
73
+ layer_range: [15, 16]
74
+ - sources:
75
+ - model: Qwen/Qwen2.5-72B-Instruct
76
+ layer_range: [15, 16]
77
+ - sources:
78
+ - model: Qwen/Qwen2.5-72B-Instruct
79
+ layer_range: [16, 17]
80
+ - sources:
81
+ - model: Qwen/Qwen2.5-72B-Instruct
82
+ layer_range: [16, 17]
83
+ - sources:
84
+ - model: Qwen/Qwen2.5-72B-Instruct
85
+ layer_range: [17, 18]
86
+ - sources:
87
+ - model: Qwen/Qwen2.5-72B-Instruct
88
+ layer_range: [17, 18]
89
+ - sources:
90
+ - model: Qwen/Qwen2.5-72B-Instruct
91
+ layer_range: [18, 19]
92
+ - sources:
93
+ - model: Qwen/Qwen2.5-72B-Instruct
94
+ layer_range: [18, 19]
95
+ - sources:
96
+ - model: Qwen/Qwen2.5-72B-Instruct
97
+ layer_range: [19, 20]
98
+ - sources:
99
+ - model: Qwen/Qwen2.5-72B-Instruct
100
+ layer_range: [19, 20]
101
+ - sources:
102
+ - model: Qwen/Qwen2.5-72B-Instruct
103
+ layer_range: [20, 21]
104
+ - sources:
105
+ - model: Qwen/Qwen2.5-72B-Instruct
106
+ layer_range: [20, 21]
107
+ - sources:
108
+ - model: Qwen/Qwen2.5-72B-Instruct
109
+ layer_range: [21, 22]
110
+ - sources:
111
+ - model: Qwen/Qwen2.5-72B-Instruct
112
+ layer_range: [21, 22]
113
+ - sources:
114
+ - model: Qwen/Qwen2.5-72B-Instruct
115
+ layer_range: [22, 23]
116
+ - sources:
117
+ - model: Qwen/Qwen2.5-72B-Instruct
118
+ layer_range: [22, 23]
119
+ - sources:
120
+ - model: Qwen/Qwen2.5-72B-Instruct
121
+ layer_range: [23, 24]
122
+ - sources:
123
+ - model: Qwen/Qwen2.5-72B-Instruct
124
+ layer_range: [23, 24]
125
+ - sources:
126
+ - model: Qwen/Qwen2.5-72B-Instruct
127
+ layer_range: [24, 25]
128
+ - sources:
129
+ - model: Qwen/Qwen2.5-72B-Instruct
130
+ layer_range: [24, 25]
131
+ - sources:
132
+ - model: Qwen/Qwen2.5-72B-Instruct
133
+ layer_range: [25, 26]
134
+ - sources:
135
+ - model: Qwen/Qwen2.5-72B-Instruct
136
+ layer_range: [25, 26]
137
+ - sources:
138
+ - model: Qwen/Qwen2.5-72B-Instruct
139
+ layer_range: [26, 27]
140
+ - sources:
141
+ - model: Qwen/Qwen2.5-72B-Instruct
142
+ layer_range: [26, 27]
143
+ - sources:
144
+ - model: Qwen/Qwen2.5-72B-Instruct
145
+ layer_range: [27, 28]
146
+ - sources:
147
+ - model: Qwen/Qwen2.5-72B-Instruct
148
+ layer_range: [27, 28]
149
+ - sources:
150
+ - model: Qwen/Qwen2.5-72B-Instruct
151
+ layer_range: [28, 29]
152
+ - sources:
153
+ - model: Qwen/Qwen2.5-72B-Instruct
154
+ layer_range: [28, 29]
155
+ - sources:
156
+ - model: Qwen/Qwen2.5-72B-Instruct
157
+ layer_range: [29, 30]
158
+ - sources:
159
+ - model: Qwen/Qwen2.5-72B-Instruct
160
+ layer_range: [29, 30]
161
+ - sources:
162
+ - model: Qwen/Qwen2.5-72B-Instruct
163
+ layer_range: [30, 31]
164
+ - sources:
165
+ - model: Qwen/Qwen2.5-72B-Instruct
166
+ layer_range: [30, 31]
167
+ - sources:
168
+ - model: Qwen/Qwen2.5-72B-Instruct
169
+ layer_range: [31, 32]
170
+ - sources:
171
+ - model: Qwen/Qwen2.5-72B-Instruct
172
+ layer_range: [31, 32]
173
+ - sources:
174
+ - model: Qwen/Qwen2.5-72B-Instruct
175
+ layer_range: [32, 33]
176
+ - sources:
177
+ - model: Qwen/Qwen2.5-72B-Instruct
178
+ layer_range: [32, 33]
179
+ - sources:
180
+ - model: Qwen/Qwen2.5-72B-Instruct
181
+ layer_range: [33, 34]
182
+ - sources:
183
+ - model: Qwen/Qwen2.5-72B-Instruct
184
+ layer_range: [33, 34]
185
+ - sources:
186
+ - model: Qwen/Qwen2.5-72B-Instruct
187
+ layer_range: [34, 35]
188
+ - sources:
189
+ - model: Qwen/Qwen2.5-72B-Instruct
190
+ layer_range: [34, 35]
191
+ - sources:
192
+ - model: Qwen/Qwen2.5-72B-Instruct
193
+ layer_range: [35, 36]
194
+ - sources:
195
+ - model: Qwen/Qwen2.5-72B-Instruct
196
+ layer_range: [35, 36]
197
+ - sources:
198
+ - model: Qwen/Qwen2.5-72B-Instruct
199
+ layer_range: [36, 37]
200
+ - sources:
201
+ - model: Qwen/Qwen2.5-72B-Instruct
202
+ layer_range: [36, 37]
203
+ - sources:
204
+ - model: Qwen/Qwen2.5-72B-Instruct
205
+ layer_range: [37, 38]
206
+ - sources:
207
+ - model: Qwen/Qwen2.5-72B-Instruct
208
+ layer_range: [37, 38]
209
+ - sources:
210
+ - model: Qwen/Qwen2.5-72B-Instruct
211
+ layer_range: [38, 39]
212
+ - sources:
213
+ - model: Qwen/Qwen2.5-72B-Instruct
214
+ layer_range: [38, 39]
215
+ - sources:
216
+ - model: Qwen/Qwen2.5-72B-Instruct
217
+ layer_range: [39, 40]
218
+ - sources:
219
+ - model: Qwen/Qwen2.5-72B-Instruct
220
+ layer_range: [39, 40]
221
+ - sources:
222
+ - model: Qwen/Qwen2.5-72B-Instruct
223
+ layer_range: [40, 41]
224
+ - sources:
225
+ - model: Qwen/Qwen2.5-72B-Instruct
226
+ layer_range: [40, 41]
227
+ - sources:
228
+ - model: Qwen/Qwen2.5-72B-Instruct
229
+ layer_range: [41, 42]
230
+ - sources:
231
+ - model: Qwen/Qwen2.5-72B-Instruct
232
+ layer_range: [41, 42]
233
+ - sources:
234
+ - model: Qwen/Qwen2.5-72B-Instruct
235
+ layer_range: [42, 43]
236
+ - sources:
237
+ - model: Qwen/Qwen2.5-72B-Instruct
238
+ layer_range: [42, 43]
239
+ - sources:
240
+ - model: Qwen/Qwen2.5-72B-Instruct
241
+ layer_range: [43, 44]
242
+ - sources:
243
+ - model: Qwen/Qwen2.5-72B-Instruct
244
+ layer_range: [43, 44]
245
+ - sources:
246
+ - model: Qwen/Qwen2.5-72B-Instruct
247
+ layer_range: [44, 45]
248
+ - sources:
249
+ - model: Qwen/Qwen2.5-72B-Instruct
250
+ layer_range: [44, 45]
251
+ - sources:
252
+ - model: Qwen/Qwen2.5-72B-Instruct
253
+ layer_range: [45, 46]
254
+ - sources:
255
+ - model: Qwen/Qwen2.5-72B-Instruct
256
+ layer_range: [45, 46]
257
+ - sources:
258
+ - model: Qwen/Qwen2.5-72B-Instruct
259
+ layer_range: [46, 47]
260
+ - sources:
261
+ - model: Qwen/Qwen2.5-72B-Instruct
262
+ layer_range: [46, 47]
263
+ - sources:
264
+ - model: Qwen/Qwen2.5-72B-Instruct
265
+ layer_range: [47, 48]
266
+ - sources:
267
+ - model: Qwen/Qwen2.5-72B-Instruct
268
+ layer_range: [47, 48]
269
+ - sources:
270
+ - model: Qwen/Qwen2.5-72B-Instruct
271
+ layer_range: [48, 49]
272
+ - sources:
273
+ - model: Qwen/Qwen2.5-72B-Instruct
274
+ layer_range: [48, 49]
275
+ - sources:
276
+ - model: Qwen/Qwen2.5-72B-Instruct
277
+ layer_range: [49, 50]
278
+ - sources:
279
+ - model: Qwen/Qwen2.5-72B-Instruct
280
+ layer_range: [49, 50]
281
+ - sources:
282
+ - model: Qwen/Qwen2.5-72B-Instruct
283
+ layer_range: [50, 51]
284
+ - sources:
285
+ - model: Qwen/Qwen2.5-72B-Instruct
286
+ layer_range: [50, 51]
287
+ - sources:
288
+ - model: Qwen/Qwen2.5-72B-Instruct
289
+ layer_range: [51, 52]
290
+ - sources:
291
+ - model: Qwen/Qwen2.5-72B-Instruct
292
+ layer_range: [51, 52]
293
+ - sources:
294
+ - model: Qwen/Qwen2.5-72B-Instruct
295
+ layer_range: [52, 53]
296
+ - sources:
297
+ - model: Qwen/Qwen2.5-72B-Instruct
298
+ layer_range: [52, 53]
299
+ - sources:
300
+ - model: Qwen/Qwen2.5-72B-Instruct
301
+ layer_range: [53, 54]
302
+ - sources:
303
+ - model: Qwen/Qwen2.5-72B-Instruct
304
+ layer_range: [53, 54]
305
+ - sources:
306
+ - model: Qwen/Qwen2.5-72B-Instruct
307
+ layer_range: [54, 55]
308
+ - sources:
309
+ - model: Qwen/Qwen2.5-72B-Instruct
310
+ layer_range: [54, 55]
311
+ - sources:
312
+ - model: Qwen/Qwen2.5-72B-Instruct
313
+ layer_range: [55, 56]
314
+ - sources:
315
+ - model: Qwen/Qwen2.5-72B-Instruct
316
+ layer_range: [55, 56]
317
+ - sources:
318
+ - model: Qwen/Qwen2.5-72B-Instruct
319
+ layer_range: [56, 57]
320
+ - sources:
321
+ - model: Qwen/Qwen2.5-72B-Instruct
322
+ layer_range: [56, 57]
323
+ - sources:
324
+ - model: Qwen/Qwen2.5-72B-Instruct
325
+ layer_range: [57, 58]
326
+ - sources:
327
+ - model: Qwen/Qwen2.5-72B-Instruct
328
+ layer_range: [57, 58]
329
+ - sources:
330
+ - model: Qwen/Qwen2.5-72B-Instruct
331
+ layer_range: [58, 59]
332
+ - sources:
333
+ - model: Qwen/Qwen2.5-72B-Instruct
334
+ layer_range: [58, 59]
335
+ - sources:
336
+ - model: Qwen/Qwen2.5-72B-Instruct
337
+ layer_range: [59, 60]
338
+ - sources:
339
+ - model: Qwen/Qwen2.5-72B-Instruct
340
+ layer_range: [59, 60]
341
+ - sources:
342
+ - model: Qwen/Qwen2.5-72B-Instruct
343
+ layer_range: [60, 61]
344
+ - sources:
345
+ - model: Qwen/Qwen2.5-72B-Instruct
346
+ layer_range: [60, 61]
347
+ - sources:
348
+ - model: Qwen/Qwen2.5-72B-Instruct
349
+ layer_range: [61, 62]
350
+ - sources:
351
+ - model: Qwen/Qwen2.5-72B-Instruct
352
+ layer_range: [61, 62]
353
+ - sources:
354
+ - model: Qwen/Qwen2.5-72B-Instruct
355
+ layer_range: [62, 63]
356
+ - sources:
357
+ - model: Qwen/Qwen2.5-72B-Instruct
358
+ layer_range: [62, 63]
359
+ - sources:
360
+ - model: Qwen/Qwen2.5-72B-Instruct
361
+ layer_range: [63, 64]
362
+ - sources:
363
+ - model: Qwen/Qwen2.5-72B-Instruct
364
+ layer_range: [63, 64]
365
+ - sources:
366
+ - model: Qwen/Qwen2.5-72B-Instruct
367
+ layer_range: [64, 65]
368
+ - sources:
369
+ - model: Qwen/Qwen2.5-72B-Instruct
370
+ layer_range: [64, 65]
371
+ - sources:
372
+ - model: Qwen/Qwen2.5-72B-Instruct
373
+ layer_range: [65, 66]
374
+ - sources:
375
+ - model: Qwen/Qwen2.5-72B-Instruct
376
+ layer_range: [65, 66]
377
+ - sources:
378
+ - model: Qwen/Qwen2.5-72B-Instruct
379
+ layer_range: [66, 67]
380
+ - sources:
381
+ - model: Qwen/Qwen2.5-72B-Instruct
382
+ layer_range: [66, 67]
383
+ - sources:
384
+ - model: Qwen/Qwen2.5-72B-Instruct
385
+ layer_range: [67, 68]
386
+ - sources:
387
+ - model: Qwen/Qwen2.5-72B-Instruct
388
+ layer_range: [67, 68]
389
+ - sources:
390
+ - model: Qwen/Qwen2.5-72B-Instruct
391
+ layer_range: [68, 69]
392
+ - sources:
393
+ - model: Qwen/Qwen2.5-72B-Instruct
394
+ layer_range: [68, 69]
395
+ - sources:
396
+ - model: Qwen/Qwen2.5-72B-Instruct
397
+ layer_range: [69, 70]
398
+ - sources:
399
+ - model: Qwen/Qwen2.5-72B-Instruct
400
+ layer_range: [69, 70]
401
+ - sources:
402
+ - model: Qwen/Qwen2.5-72B-Instruct
403
+ layer_range: [70, 71]
404
+ - sources:
405
+ - model: Qwen/Qwen2.5-72B-Instruct
406
+ layer_range: [70, 71]
407
+ - sources:
408
+ - model: Qwen/Qwen2.5-72B-Instruct
409
+ layer_range: [71, 72]
410
+ - sources:
411
+ - model: Qwen/Qwen2.5-72B-Instruct
412
+ layer_range: [71, 72]
413
+ - sources:
414
+ - model: Qwen/Qwen2.5-72B-Instruct
415
+ layer_range: [72, 73]
416
+ - sources:
417
+ - model: Qwen/Qwen2.5-72B-Instruct
418
+ layer_range: [72, 73]
419
+ - sources:
420
+ - model: Qwen/Qwen2.5-72B-Instruct
421
+ layer_range: [73, 74]
422
+ - sources:
423
+ - model: Qwen/Qwen2.5-72B-Instruct
424
+ layer_range: [73, 74]
425
+ - sources:
426
+ - model: Qwen/Qwen2.5-72B-Instruct
427
+ layer_range: [74, 75]
428
+ - sources:
429
+ - model: Qwen/Qwen2.5-72B-Instruct
430
+ layer_range: [74, 75]
431
+ - sources:
432
+ - model: Qwen/Qwen2.5-72B-Instruct
433
+ layer_range: [75, 76]
434
+ - sources:
435
+ - model: Qwen/Qwen2.5-72B-Instruct
436
+ layer_range: [75, 76]
437
+ - sources:
438
+ - model: Qwen/Qwen2.5-72B-Instruct
439
+ layer_range: [76, 77]
440
+ - sources:
441
+ - model: Qwen/Qwen2.5-72B-Instruct
442
+ layer_range: [76, 77]
443
+ - sources:
444
+ - model: Qwen/Qwen2.5-72B-Instruct
445
+ layer_range: [77, 80]
446
+ merge_method: passthrough
447
+ dtype: float16
merges.txt ADDED
The diff for this file is too large to render. See raw diff
 
model-00001-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a03e3140f1a7018c2057324060aec952d072a9c7041f685148757c69e8fa08a2
3
+ size 4982849880
model-00002-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1f967a6a95d2069f6aa356b54e8c3f8c5d4187e454b3412a5b4720abb4e23394
3
+ size 4964084872
model-00003-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:22a44914b798978ac70c8553f9324f1a790f7c25137b22e9ca253ad649fdf84e
3
+ size 4997660376
model-00004-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b51a234408960856a456bcffcb53f94ed54830d3990a4c1f2691ff9118aa56f7
3
+ size 4565680264
model-00005-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e3c3faf98d6a831b3163687c83febe35c7f501c2956a812214a4fcd820d194e7
3
+ size 4964068400
model-00006-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e64943643e37b86ba9bde301671bc835232227e4ff821e94c45f4d1d52157a6c
3
+ size 4915904664
model-00007-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4b5fadfe778bd568c7dd7c745d7c57b46e14e21895405e5bd92902112c53fc7b
3
+ size 4647435976
model-00008-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e28d56257ae1d8613be16cc743d7d1643991b378909f8bd6ad2915aa4a5dfc32
3
+ size 4964068392
model-00009-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2b49001d7d34b32ed53ff608905347d3bb3a0755066311cce341cbea22e72f4e
3
+ size 4599255744
model-00010-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:737ad8bf3981af8ed89f4f4bf5e256ea5fe2ed89aebeac3656964611127073d0
3
+ size 4964101384
model-00011-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5622354c5d6862ea7cb2798046de93972e888b367e96f2a3036ec7b6604b0fec
3
+ size 4781653824
model-00012-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:04306f1e38559645b50bbdd359244135bf65ad0c85f028b462a489c25be551cf
3
+ size 4964068392
model-00013-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b14c36a4104183e58ed50e5db465458c6e514d57d5d9d9a96fb3cbefc4024494
3
+ size 4599272248
model-00014-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:85b993720749c5582681230f51e455a717468d52498ecbccd6fadae854766d33
3
+ size 4964068400
model-00015-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9f6af8da7cd715412f51f43aedbdcfeafe8a1bbf38183683d92667a6f1495376
3
+ size 4997660376
model-00016-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:36f4867d615876376085c591f53796f867bd69935ebae21a3e77816c11f05085
3
+ size 4565680264
model-00017-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:65f2318623b14be77aec53920d6d77dddb3acfecb440fec0902155417c915f31
3
+ size 4781670312
model-00018-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5eac5deaae09d12b090b90c1b4d4214251c0bef2839c58f6eb8eccde4745ac7e
3
+ size 4964068400
model-00019-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ee3a08437131211b1422422956ebe62dcc1210c9b754bbbdbd24d3b6937da615
3
+ size 4997660376
model-00020-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0e3e846fcbb5ad6aec392b0acd6b779fe906f1f6c687a8fc27318ff72a149e01
3
+ size 4565680264
model-00021-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec351d82f19c193925cde4a9f0f968d79c52204cc145e9064b28188a0228aa74
3
+ size 4964068400
model-00022-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4e35173e5d3a851b4d5acd14f850071de41e2fb942e432c7ca8812a6cdbab382
3
+ size 4915904664
model-00023-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:18d3a0f738ce56f27f7025a35a09131f39f7b58fbca83fbe6ba342c3c9dffeec
3
+ size 4647435976
model-00024-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:97db69174de64dff8d5d8c00e5b45e665b10edcad974cdd8cb3305885cb112ec
3
+ size 4964068392
model-00025-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c2424f5f93f37f520203ddd2cf92fdfbc071c0486782332bdeaad98f44abc9c3
3
+ size 4599272240
model-00026-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ac114be58171bb02ebfde7f336eceb63b5c55a2d515421f47dc00bd80538ed12
3
+ size 4964068400
model-00027-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:91d0a9b1c99a69d22be2c04e40e8342e1d425daf159d2ac06b99c8be2af95d52
3
+ size 4997660376
model-00028-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a7ca30b16861df44bef9932c8ea562f3d4debd380f331554895e8d566a767e83
3
+ size 4565680264
model-00029-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9b62597ad675e7f636ac51f42214cae22e920c075e9edc2a6bae2986774a17b3
3
+ size 4964068400
model-00030-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:89d9df5ccec6b528758a4409a113f3c171c68088d41ae5d02a674c70c13a4d25
3
+ size 4915904664
model-00031-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:40a80ee8b062b529bab75e0286e6ada247ab48553a8b3633f0c2e948cf046c97
3
+ size 4647435976
model-00032-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ce0103aab316b1b1b56cae71baea2e11b7590161eb7877e1b1b095fcc184f57c
3
+ size 4964068392
model-00033-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ba07f93b0246a923434045d683fae434f4dc9eb55f678480016bfc9ec393bcb1
3
+ size 4599272240
model-00034-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:afa7b400d3e0fe9fee57b2d38bbe3a9e77753f8fe253fe4b4e3091683db56ddc
3
+ size 4964068400
model-00035-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5b4b7a03a12e080afd1881ed9faebbe53ca08b962283a5bf97948d6c6f0a7695
3
+ size 4997660392
model-00036-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:85f6a2c28a3ecabe70def592d2c4a6fe4e61502c3a5a9ac39dbdd76998aaa907
3
+ size 4565680304
model-00037-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:699dffed7b0d372c6dedb934f97f59556bdb6676fbc35d98538326c4863b6527
3
+ size 4964068424
model-00038-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8f216f422a104c36bc83c9cf2a6376660d60aaf45b0b20fd11074eab96b69b71
3
+ size 4915904704
model-00039-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b5f43c71dfd17d6ba19c6b4f2b4ec0bdefa6fc825f894d11581713dd0a117e5b
3
+ size 4647436016
model-00040-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:077a4d6f4089055c73ce0fba5075b8b6224f3648d5a4c4191c6cd9ab52169c9d
3
+ size 4964068416
model-00041-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d825cfbe4af38d58d65bbc9c3d586d8025f1e8235667a76a3aa9bf84b306da5d
3
+ size 4599272248
model-00042-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:89cdcdc04779573ae4745c51410f720fc25fa3ab3fc0d7462312765eb6ffcca9
3
+ size 4964068424
model-00043-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d53e231ac5eaa15bad031fd320f63e44de58a523116eff0ed89a0bf07b7c8212
3
+ size 4997660416
model-00044-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7d3bce2da50dc7a853d74a4c1c00cc7ecf8ed6b85add208143c03079e6fcc8e8
3
+ size 4565680304
model-00045-of-00057.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:24d4fcc2c3b52869fb66a7d665a6850061fbc150b95e290f70aeb8905da762f3
3
+ size 4964068424