chansurgeplus commited on
Commit
5713e4b
1 Parent(s): cf54174

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -3
README.md CHANGED
@@ -29,17 +29,21 @@ OpenBezoar-HH-RLHF-SFT is an LLM that is built upon the OpenLLaMA 3B v2 architec
29
 
30
  ## Instruction Format
31
 
32
- We follow the typical format for instruction-based prompt templates, with a system prompt followed up by the user prompt. Both begins with a prefix and ends with two newline characters as described below. It is important to utilize this template in order to obtain best responses for instruction fine-tuning related tasks.
33
  ```
34
- ### System: {system}
 
35
 
36
- ### Instruction: {instruction}
 
37
 
38
  ### Response:
39
  ```
40
 
41
  Notice that **no** end-of-sentence (eos) token is being appended.
42
 
 
 
43
  ## Limitations
44
 
45
  - The model might not consistently show improved abilities to follow instructions, and it could respond inappropriately or get stuck in loops.
 
29
 
30
  ## Instruction Format
31
 
32
+ We follow a modified version of the Alpaca prompt template as shown below. It is important to utilize this template in order to obtain best responses for instruction related tasks.
33
  ```
34
+ ### System:
35
+ Below is an instruction that describes a task, optionally paired with an input that provides further context following that instruction. Write a response that appropriately completes the request.
36
 
37
+ ### Instruction:
38
+ {instruction}
39
 
40
  ### Response:
41
  ```
42
 
43
  Notice that **no** end-of-sentence (eos) token is being appended.
44
 
45
+ *Note: The system prompt shown in the following figure is the one that the model has been trained on most of the time. However, you may attempt to use any other system prompt that is available in the [Orca](https://arxiv.org/abs/2306.02707) scheme.*
46
+
47
  ## Limitations
48
 
49
  - The model might not consistently show improved abilities to follow instructions, and it could respond inappropriately or get stuck in loops.