ohwi
/

japanese-stablelm-instruct-gamma-7b-repro

Text Generation

japanese-stablelm

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ohwi commited on Feb 11

Commit

ca2c22c

•

1 Parent(s): be6a47b

Create README.md

Files changed (1) hide show

README.md +60 -0

README.md ADDED Viewed

	@@ -0,0 +1,60 @@

+---
+language:
+- ja
+tags:
+- japanese-stablelm
+- causal-lm
+pipeline_tag: text-generation
+base_model: stabilityai/japanese-stablelm-base-gamma-7b
+datasets: argilla/ultrafeedback-binarized-preferences-cleaned
+license: apache-2.0
+extra_gated_fields:
+  Name: text
+  Email: text
+  Country: text
+  Organization or Affiliation: text
+  I allow Stability AI to contact me about information related to its models and research: checkbox
+---
+# Reproduced Japanese Stable LM Instruct Gamma 7B
+## Model Description
+This is a reproduction of 7B-parameter decoder-only Japanese language model fine-tuned on instruction-following datasets, built on top of the base model [Japanese Stable LM Base Gamma 7B](https://huggingface.co/stabilityai/japanese-stablelm-base-gamma-7b).
+This model is trained with [notus](https://github.com/argilla-io/notus) code base.
+*If you are in search of the official model, please check [Japanese Stable LM Instruct Gamma 7B](https://huggingface.co/stabilityai/japanese-stablelm-instruct-gamma-7b).*
+## Model Details
+### Training Datasets
+- [Japanese translation of the Databricks Dolly-15k dataset](https://huggingface.co/datasets/kunishou/databricks-dolly-15k-ja)
+- [Japanese translation of the subset of the Anthropic HH dataset](https://huggingface.co/datasets/fujiki/japanese_hh-rlhf-49k)
+- [Wikinews](https://ja.wikinews.org/wi) [subset](https://huggingface.co/datasets/fujiki/llm-japanese-dataset_wikinews) of the [izumi-lab/llm-japanese-dataset](https://huggingface.co/datasets/izumi-lab/llm-japanese-dataset)
+### Benchmarks
+The result is evaluated by [Nejumi-leaderboard Neo](https://github.com/wandb/llm-leaderboard/tree/b2723944d4955768cb93c18ffe162a8ff4e88955).
+- llm-jp-eval:
+  |AVG   |EL |FA |MC  |MR |NLI  |QA    |RC    |chabsa_set_f1|jamp_exact_match|janli_exact_match|jcommonsenseqa_exact_match|jemhopqa_char_f1|jnli_exact_match|jsem_exact_match|jsick_exact_match|jsquad_char_f1|niilc_char_f1|
+  |------|---|---|----|---|-----|------|------|-------------|----------------|-----------------|--------------------------|----------------|----------------|----------------|-----------------|--------------|-------------|
+  |0.1691|0.0|0.0|0.24|0.0|0.286|0.1688|0.4887|0.0          |0.3             |0.56             |0.24                      |0.1334          |0.08            |0.28            |0.21             |0.4887        |0.2042       |
+- Japanese Mt-Bench:
+  |coding|extraction|humanities|math|reasoning|roleplay|stem|writing|
+  |------|----------|----------|----|---------|--------|----|-------|
+  |1.3   |1.75      |2.35      |1.45|3.4      |5.8     |4.3 |3.1    |
+- Overall Average: 0.266