NECOUDBFM
/

Jellyfish-13B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

HCZhang commited on Dec 4, 2023

Commit

9de1c20

•

1 Parent(s): 1b77441

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -115,6 +115,7 @@ We used LoRA to speed up the training process, targeting the q_proj and v_proj m
 ## Uses
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 We provide the prompts used for both the model's fine-tuning and inference.
 You can structure your data according to these prompts.

 ## Uses
+For improved practical inference speed, we strongly recommend running Jellyfish using [vLLM](https://github.com/vllm-project/vllm).
 <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
 We provide the prompts used for both the model's fine-tuning and inference.
 You can structure your data according to these prompts.