Walmart-the-bag
commited on
Commit
•
7e94718
1
Parent(s):
9f0534a
Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,59 @@
|
|
1 |
---
|
2 |
license: mit
|
3 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
license: mit
|
3 |
---
|
4 |
+
---
|
5 |
+
license: mit
|
6 |
+
pipeline_tag: text-generation
|
7 |
+
tags:
|
8 |
+
- causal_lm
|
9 |
+
base_model: Walmart-the-bag/zephyr-quiklang-3b
|
10 |
+
inference: false
|
11 |
+
---
|
12 |
+
|
13 |
+
# Description
|
14 |
+
This is the 4K version of https://huggingface.co/Walmart-the-bag/zephyr-quiklang-3b
|
15 |
+
|
16 |
+
# Original Model Description
|
17 |
+
This is a finetune of [StableLM-Zephyr-3B](https://huggingface.co/stabilityai/stablelm-zephyr-3b) with 2 datasets, toxic-dpo and openhermes with 10000 samples.
|
18 |
+
|
19 |
+
# Training Parameters
|
20 |
+
- 1xA6000-48GB
|
21 |
+
- batch_size: 6
|
22 |
+
- learning_rate: 5e-5
|
23 |
+
|
24 |
+
# Datasets:
|
25 |
+
- unalignment/toxic-dpo-v0.1
|
26 |
+
- teknium/openhermes
|
27 |
+
|
28 |
+
# Metrics/Basic Eval:
|
29 |
+
"predict_bleu-4": 31.594154999999997,
|
30 |
+
"predict_rouge-1": 44.092935,
|
31 |
+
"predict_rouge-2": 22.276081000000005,
|
32 |
+
"predict_rouge-l": 34.506909,
|
33 |
+
"predict_runtime": 121.7549,
|
34 |
+
"predict_samples_per_second": 0.821,
|
35 |
+
"predict_steps_per_second": 0.107
|
36 |
+
|
37 |
+
# License
|
38 |
+
### MIT
|
39 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6530994e70a88b63f007324d/Zf3wrU5zn2uVyoYAZ47rQ.png)
|
40 |
+
|
41 |
+
```
|
42 |
+
Permission is hereby granted, free of charge, to any person obtaining a copy
|
43 |
+
of this software and associated documentation files (the "Software"), to deal
|
44 |
+
in the Software without restriction, including without limitation the rights
|
45 |
+
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
|
46 |
+
copies of the Software, and to permit persons to whom the Software is
|
47 |
+
furnished to do so, subject to the following conditions:
|
48 |
+
|
49 |
+
The above copyright notice and this permission notice shall be included in all
|
50 |
+
copies or substantial portions of the Software.
|
51 |
+
|
52 |
+
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
|
53 |
+
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
|
54 |
+
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
|
55 |
+
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
|
56 |
+
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
|
57 |
+
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
|
58 |
+
SOFTWARE.
|
59 |
+
```
|