microsoft
/

phi-2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Update README.md

#2

by mojanjp - opened Dec 13, 2023

base: refs/heads/main

←

from: refs/pr/2

Discussion Files changed

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -16,6 +16,16 @@ Our model hasn't been fine-tuned through reinforcement learning from human feedb
 ## Intended Uses
 Phi-2 is intended for research purposes only. Given the nature of the training data, the phi-2 model is best suited for prompts using the QA format, the chat format, and the code format.
 #### QA format:

 ## Intended Uses
+Below are example codes to load phi-2, we support two modes of execution for the model:
+ 1. loading in fp-16 format with flash-attention support:
+	 ```python
+	 model = AutoModelForCausalLM.from_pretrained('microsoft/phi-2', torch_dtype='auto', flash_attn=True, flash_rotary=True, fused_dense=True, trust_remote_code=True)
+	 ```
+ 2. loading in fp-16 without flash-attention
+	 ```python
+	 model = AutoModelForCausalLM.from_pretrained('microsoft/phi-2', torch_dtype='auto', trust_remote_code=True)
+	 ```
 Phi-2 is intended for research purposes only. Given the nature of the training data, the phi-2 model is best suited for prompts using the QA format, the chat format, and the code format.
 #### QA format: