infCapital
/

llama2-7b-chatvi

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

llama2-7b-chatvi / README.md

hungeni's picture

Update README.md

d25a484 about 1 year ago

|

history blame contribute delete

402 Bytes

	---
	datasets:
	- infCapital/vnnews_corpus_100K
	language:
	- vi
	---

	## Base Model: LLaMa2 7B Chat HF
	+ Extend vocab to 44,800 for better Vietnamese understanding
	+ Continual Pre-Train with >2B tokens Vietnamese
	+ Trainning profile: LoRa (rank=32, alpha=128, 16fp), 1 epoch, block size = 512. Takes 300GPU Hours x RXT4090 24GB

	## Can be better use for
	+ Futher training / Fine-tuning for Vietnamese tasks