|
--- |
|
license: mit |
|
pipeline_tag: text-generation |
|
tags: |
|
- causal_lm |
|
base_model: Walmart-the-bag/zephyr-quiklang-3b |
|
inference: false |
|
datasets: |
|
- teknium/openhermes |
|
--- |
|
|
|
# Description |
|
This is the 4K version of https://huggingface.co/Walmart-the-bag/zephyr-quiklang-3b with 1000 more samples of openhermes. |
|
|
|
# Original Model Description |
|
This is a finetune of [StableLM-Zephyr-3B](https://huggingface.co/stabilityai/stablelm-zephyr-3b) with 2 datasets, toxic-dpo and openhermes with 10000 samples. |
|
|
|
# Training Parameters |
|
- 1xA6000-48GB |
|
- batch_size: 6 |
|
- learning_rate: 5e-5 |
|
|
|
# Datasets: |
|
- unalignment/toxic-dpo-v0.1 |
|
- teknium/openhermes |
|
|
|
# Metrics/Basic Eval: |
|
"predict_bleu-4": 31.594154999999997, |
|
"predict_rouge-1": 44.092935, |
|
"predict_rouge-2": 22.276081000000005, |
|
"predict_rouge-l": 34.506909, |
|
"predict_runtime": 121.7549, |
|
"predict_samples_per_second": 0.821, |
|
"predict_steps_per_second": 0.107 |
|
|
|
# License |
|
### MIT |
|
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6530994e70a88b63f007324d/Zf3wrU5zn2uVyoYAZ47rQ.png) |
|
|
|
``` |
|
Permission is hereby granted, free of charge, to any person obtaining a copy |
|
of this software and associated documentation files (the "Software"), to deal |
|
in the Software without restriction, including without limitation the rights |
|
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell |
|
copies of the Software, and to permit persons to whom the Software is |
|
furnished to do so, subject to the following conditions: |
|
|
|
The above copyright notice and this permission notice shall be included in all |
|
copies or substantial portions of the Software. |
|
|
|
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR |
|
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, |
|
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE |
|
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER |
|
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, |
|
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE |
|
SOFTWARE. |
|
``` |