kangqi-ni
/

zephyr-7b-beta_bio-tutor_sft

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

zephyr-7b-beta_bio-tutor_sft / README.md

kangqi-ni's picture

Update README.md

36b3ec0 verified about 1 month ago

|

613 Bytes

	---
	license: apache-2.0
	language:
	- en
	tags:
	- dpo
	- biology
	- education
	- zephyr
	---

	This model is fine-tuned on zephyr-7b-beta with SFT. The purpose is to develop a more capable educational chatbot that helps students study biology.

	If you use this work, please cite:
	```
	@misc{sonkar2024pedagogical,
	title={Pedagogical Alignment of Large Language Models},
	author={Shashank Sonkar and Kangqi Ni and Sapana Chaudhary and Richard G. Baraniuk},
	year={2024},
	eprint={2402.05000},
	archivePrefix={arXiv},
	primaryClass={cs.CL},
	url={https://arxiv.org/abs/2402.05000}
	}
	```