qgyd2021
/

chinese_chitchat

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

chinese_chitchat / README.md

qgyd2021's picture

Update README.md

83fe105 12 months ago

|

934 Bytes

	---
	base_model: qgyd2021/chinese_chitchat
	tags:
	- generated_from_trainer
	model-index:
	- name: chinese_chitchat
	results: []
	---

	<!-- This model card has been generated automatically according to the information the Trainer had access to. You
	should probably proofread and complete it, then remove this comment. -->

	# chinese_chitchat

	这个模型是基于 [uer/gpt2-chinese-cluecorpussmall](https://huggingface.co/uer/gpt2-chinese-cluecorpussmall) 在 [qgyd2021/chinese_chitchat](https://huggingface.co/datasets/qgyd2021/chinese_chitchat) 数据集的 [xiaohuangji](https://huggingface.co/datasets/qgyd2021/chinese_chitchat/viewer/xiaohuangji) 子集上进行微调的。

	由于该数据集(xiaohuangji)中问答不相关(答非所问)的样本很多，噪音大，因此虽然有45万样本，但感觉效果并不太好。

	训练了 2 次，第一次 26000 步，第二次 8000 步，总共大约是 10 个 epoch 的样子。