kangqi-ni commited on
Commit
dba63ce
1 Parent(s): cbad772

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -3
README.md CHANGED
@@ -8,9 +8,9 @@ tags:
8
  - biology
9
  - education
10
  ---
11
- This model is trained on zephyr-7b-beta with FastChat (for SFT) and TRL (for DPO). The purpose is to develop a more capable educational chatbot that helps students learn biology.
12
 
13
- If you use this work, please cite: Pedagogical Alignment of Large Language Models https://arxiv.org/abs/2402.05000
14
  ```
15
  @misc{sonkar2024pedagogical,
16
  title={Pedagogical Alignment of Large Language Models},
@@ -18,6 +18,7 @@ If you use this work, please cite: Pedagogical Alignment of Large Language Model
18
  year={2024},
19
  eprint={2402.05000},
20
  archivePrefix={arXiv},
21
- primaryClass={cs.CL}
 
22
  }
23
  ```
 
8
  - biology
9
  - education
10
  ---
11
+ This model is fine-tuned on Mistral-7B-Instruct-v0.2 with SFT and DPO. The purpose is to develop a more capable educational chatbot that helps students study biology.
12
 
13
+ If you use this work, please cite:
14
  ```
15
  @misc{sonkar2024pedagogical,
16
  title={Pedagogical Alignment of Large Language Models},
 
18
  year={2024},
19
  eprint={2402.05000},
20
  archivePrefix={arXiv},
21
+ primaryClass={cs.CL},
22
+ url={https://arxiv.org/abs/2402.05000}
23
  }
24
  ```