AXCXEPT/EZO-Llama-3.2-3B-Instruct-dpoE

モデル情報 / Model Information

This model is based on Meta AI's Llama 3.2 with fine tuning to improve performance on Japanese tasks. Based on the base Llama-3.2-3B-Instruct, performance was improved on the Japanese-English dataset.

このモデルは、Meta AI の Llama 3.2 をベースに、日本語タスクでの性能を向上させるためにファインチューニングを行ったものです。ベースとなるLlama-3.2-3B-Instructをもとに、日英のデータセットで性能向上を行いました。

法的通知 / Legal Notice

This model is subject to the Llama 3.2 Community License Agreement. For detailed information, please refer to the official Llama license page: Llama 3.2 License

このモデルは Llama 3.2 Community License Agreement に従います。詳細については、Llama の公式ライセンスページをご参照ください。

使用方法 / Usage

import torch
from transformers import pipeline

model_id = "AXCXEPT/EZO-Llama-3.2-3B-Instruct-dpoE"
pipe = pipeline(
    "text-generation",
    model=model_id,
    torch_dtype=torch.bfloat16,
    device_map="auto",
)
messages = [
    {"role": "system", "content": "You are a pirate chatbot who always responds in pirate speak!"},
    {"role": "user", "content": "Who are you?"},
]
outputs = pipe(
    messages,
    max_new_tokens=256,
)
print(outputs[0]["generated_text"][-1])

ベンチマーク結果 / Benchmark Results

[Disclaimer]

This model is provided for research and development purposes only and should be considered an experimental prototype. It is not intended for commercial use or deployment in mission-critical environments. Use of this model is at the user's own risk and its performance and results are not guaranteed. Axcxept Corporation shall not be liable for any direct, indirect, special, incidental, or consequential damages or any loss whatsoever resulting from the use of this model, regardless of the results obtained. Users fully understand the risks associated with the use of this model and use it at their own discretion.

このモデルは研究開発のみを目的として提供されるものであり、実験的なプロトタイプとみなされるべきモデルです。商業的な使用やミッションクリティカルな環境への配備を意図したものではありません。本モデルの使用は、使用者の責任において行われるものとし、その性能および結果は保証されません。 Axcxept株式会社は、直接的、間接的、特別、偶発的、結果的な損害、または本モデルの使用から生じるいかなる損失に対しても、得られた結果にかかわらず、一切の責任を負いません。利用者は、本モデルの使用に伴うリスクを十分に理解し、自己の判断で使用するものとします。

[Hardware]

H100 × 8(Running in 3h)

クレジット / Credits

This model is based on Meta AI's Llama 3.2. We acknowledge and thank the Meta AI team for their work on the base model.

このモデルは Meta AI の Llama 3.2 をベースにしています。ベースモデルの開発に携わった Meta AI チームに感謝と尊敬の意を表します。

AXCXEPT
/

EZO-Llama-3.2-3B-Instruct-dpoE