chansurgeplus
commited on
Commit
•
5713e4b
1
Parent(s):
cf54174
Update README.md
Browse files
README.md
CHANGED
@@ -29,17 +29,21 @@ OpenBezoar-HH-RLHF-SFT is an LLM that is built upon the OpenLLaMA 3B v2 architec
|
|
29 |
|
30 |
## Instruction Format
|
31 |
|
32 |
-
We follow
|
33 |
```
|
34 |
-
### System:
|
|
|
35 |
|
36 |
-
### Instruction:
|
|
|
37 |
|
38 |
### Response:
|
39 |
```
|
40 |
|
41 |
Notice that **no** end-of-sentence (eos) token is being appended.
|
42 |
|
|
|
|
|
43 |
## Limitations
|
44 |
|
45 |
- The model might not consistently show improved abilities to follow instructions, and it could respond inappropriately or get stuck in loops.
|
|
|
29 |
|
30 |
## Instruction Format
|
31 |
|
32 |
+
We follow a modified version of the Alpaca prompt template as shown below. It is important to utilize this template in order to obtain best responses for instruction related tasks.
|
33 |
```
|
34 |
+
### System:
|
35 |
+
Below is an instruction that describes a task, optionally paired with an input that provides further context following that instruction. Write a response that appropriately completes the request.
|
36 |
|
37 |
+
### Instruction:
|
38 |
+
{instruction}
|
39 |
|
40 |
### Response:
|
41 |
```
|
42 |
|
43 |
Notice that **no** end-of-sentence (eos) token is being appended.
|
44 |
|
45 |
+
*Note: The system prompt shown in the following figure is the one that the model has been trained on most of the time. However, you may attempt to use any other system prompt that is available in the [Orca](https://arxiv.org/abs/2306.02707) scheme.*
|
46 |
+
|
47 |
## Limitations
|
48 |
|
49 |
- The model might not consistently show improved abilities to follow instructions, and it could respond inappropriately or get stuck in loops.
|