klyang
/

MentaLLaMA-chat-13B

@@ -11,7 +11,7 @@ tags:
 # MentaLLaMA-chat-13B
 MentaLLaMA-chat-13B is part of the [MentaLLaMA](https://github.com/SteveKGYang/MentalLLaMA) project, the first open-source large language model (LLM) series for
-interpretable mental health analysis with instruction-following capability.
 The model is expected to make complex mental health analyses for various mental health conditions and give reliable explanations for each of its predictions.
 It is fine-tuned on the IMHI dataset with 75K high-quality natural language instructions to boost its performance in downstream tasks.
 We perform a comprehensive evaluation on the IMHI benchmark with 20K test samples. The result shows that MentalLLaMA approaches state-of-the-art discriminative
@@ -21,11 +21,16 @@ methods in correctness and generates high-quality explanations.
 In addition to MentaLLaMA-chat-13B, the MentaLLaMA project includes another model: MentaLLaMA-chat-7B, MentalBART, MentalT5.
-- **MentaLLaMA-chat-7B**: This model
-- **MentalBART**: This model
-- **MentalT5**: This model
 ## Usage
@@ -44,11 +49,6 @@ use the GPU if it's available.
 MentaLLaMA-chat-13B is licensed under MIT. For more details, please see the MIT file.
-## About
-This model is part of the MentaLLaMA project.
-For more information, you can visit the [MentaLLaMA](https://github.com/SteveKGYang/MentalLLaMA) project on GitHub.
 ## Citation
 If you use MentaLLaMA-chat-7B in your work, please cite our paper:

 # MentaLLaMA-chat-13B
 MentaLLaMA-chat-13B is part of the [MentaLLaMA](https://github.com/SteveKGYang/MentalLLaMA) project, the first open-source large language model (LLM) series for
+interpretable mental health analysis with instruction-following capability. This model is finetuned based on the Meta LLaMA2-chat-13B foundation model and the full IMHI instruction tuning data.
 The model is expected to make complex mental health analyses for various mental health conditions and give reliable explanations for each of its predictions.
 It is fine-tuned on the IMHI dataset with 75K high-quality natural language instructions to boost its performance in downstream tasks.
 We perform a comprehensive evaluation on the IMHI benchmark with 20K test samples. The result shows that MentalLLaMA approaches state-of-the-art discriminative
 In addition to MentaLLaMA-chat-13B, the MentaLLaMA project includes another model: MentaLLaMA-chat-7B, MentalBART, MentalT5.
+- **MentaLLaMA-chat-7B**: This model is finetuned based on the Meta LLaMA2-chat-7B foundation model and the full IMHI instruction tuning data. The training data covers
+  10 mental health analysis tasks.
+- **MentalBART**: This model is finetuned based on the BART-large foundation model and the full IMHI-completion data. The training data covers
+  10 mental health analysis tasks. This model doesn't have instruction-following ability but is more lightweight and performs well in interpretable mental health analysis
+  in a completion-based manner.
+- **MentalT5**: This model is finetuned based on the T5-large foundation model and the full IMHI-completion data. The training data covers
+  10 mental health analysis tasks. This model doesn't have instruction-following ability but is more lightweight and performs well in interpretable mental health analysis
+  in a completion-based manner.
 ## Usage
 MentaLLaMA-chat-13B is licensed under MIT. For more details, please see the MIT file.
 ## Citation
 If you use MentaLLaMA-chat-7B in your work, please cite our paper: