MarkrAI
/

RAG-KO-Mixtral-7Bx2-v2.0

Text Generation

Retrieval Augmented Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

RAG-KO-Mixtral-7Bx2-v2.0 / README.md

kyujinpy's picture

Upload README.md

9171d44 verified 9 months ago

|

history blame contribute delete

1.29 kB

	---
	license: cc-by-nc-sa-4.0
	datasets:
	- HumanF-MarkrAI/Korean-RAG-ver2
	language:
	- ko
	tags:
	- Retrieval Augmented Generation
	- RAG
	- Multi-domain
	---

	# MarkrAI/RAG-KO-Mixtral-7Bx2-v2.0

	# Model Details

	## Model Developers
	MarkrAI - AI Researchers

	## Base Model
	[DopeorNope/Ko-Mixtral-v1.4-MoE-7Bx2](https://huggingface.co/DopeorNope/Ko-Mixtral-v1.4-MoE-7Bx2).

	## Instruction tuning Method
	Using QLoRA.
	```
	4-bit quantization
	Lora_r: 64
	Lora_alpha: 64
	Lora_dropout: 0.05
	Lora_target_modules: [embed_tokens, q_proj, k_proj, v_proj, o_proj, gate, w1, w2, w3, lm_head]
	```

	## Hyperparameters
	```
	Epoch: 5
	Batch size: 64
	Learning_rate: 1e-5
	Learning scheduler: linear
	Warmup_ratio: 0.06
	```

	## Datasets
	Private datasets: [HumanF-MarkrAI/Korean-RAG-ver2](https://huggingface.co/datasets/HumanF-MarkrAI/Korean-RAG-ver2)
	```
	Aihub datasets 활용하여서 제작함.
	```

	## Implmentation Code
	```
	from transformers import AutoModelForCausalLM, AutoTokenizer
	import torch

	repo = "MarkrAI/RAG-KO-Mixtral-7Bx2-v2.0"
	markrAI_RAG = AutoModelForCausalLM.from_pretrained(
	repo,
	return_dict=True,
	torch_dtype=torch.float16,
	device_map='auto'
	)
	markrAI_RAG_tokenizer = AutoTokenizer.from_pretrained(repo)
	```

	# Model Benchmark
	- Coming soon...