HCZhang commited on
Commit
6d4ddfd
1 Parent(s): ac3ca25

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +87 -1
README.md CHANGED
@@ -10,6 +10,9 @@ language:
10
  -->
11
  <img src="https://i.imgur.com/E1vqCIw.png" alt="PicToModel" width="330"/>
12
 
 
 
 
13
 
14
  ## Model Details
15
  Jellyfish-7B is a large language model equipped with 7 billion parameters.
@@ -103,9 +106,92 @@ _Few-shot is disabled for Jellyfish models._
103
 
104
  ## Prompt Template
105
  ```
 
 
106
  [INST]:
107
 
108
- <prompt> (without the <>)
109
 
110
  [\INST]]
111
  ```
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  -->
11
  <img src="https://i.imgur.com/E1vqCIw.png" alt="PicToModel" width="330"/>
12
 
13
+ Other versions of Jellyfish:
14
+ [Jellyfish-8B](https://huggingface.co/NECOUDBFM/Jellyfish-8B)
15
+ [Jellyfish-13B](https://huggingface.co/NECOUDBFM/Jellyfish-13B)
16
 
17
  ## Model Details
18
  Jellyfish-7B is a large language model equipped with 7 billion parameters.
 
106
 
107
  ## Prompt Template
108
  ```
109
+ {system message}
110
+
111
  [INST]:
112
 
113
+ {prompt} (without the {})
114
 
115
  [\INST]]
116
  ```
117
+
118
+ ## Prompts
119
+
120
+ We provide the prompts used for both the model's fine-tuning and inference.
121
+ You can structure your data according to these prompts.
122
+
123
+ ### System Message
124
+ ```
125
+ You are an AI assistant that follows instruction extremely well.
126
+ User will give you a question. Your task is to answer as faithfully as you can.
127
+ ```
128
+
129
+ ### For Entity Matching
130
+ ```
131
+ You are tasked with determining whether two records listed below are the same based on the information provided.
132
+ Carefully compare the {attribute 1}, {attribute 2}... for each record before making your decision.
133
+ Note that missing values (N/A or \"nan\") should not be used as a basis for your decision.
134
+ Record A: [{attribute 1}: {attribute 1 value}, {attribute 2}: {attribute 2 value}, ...]
135
+ Record B: [{attribute 1}: {attribute 1 value}, {attribute 2}: {attribute 2 value}, ...]
136
+ Are record A and record B the same entity? Choose your answer from: [Yes, No].
137
+ ```
138
+
139
+ ### For Data Imputation
140
+ ```
141
+ You are presented with a {keyword} record that is missing a specific attribute: {attribute X}.
142
+ Your task is to deduce or infer the value of {attribute X} using the available information in the record.
143
+ You may be provided with fields like {attribute 1}, {attribute 2}, ... to help you in the inference.
144
+ Record: [{attribute 1}: {attribute 1 value}, {attribute 2}: {attribute 2 value}, ...]
145
+ Based on the provided record, what would you infer is the value for the missing attribute {attribute X}?
146
+ Answer only the value of {attribute X}.
147
+ ```
148
+
149
+ ### For Data Imputation
150
+ ```
151
+ You are presented with a {keyword} record that is missing a specific attribute: {attribute X}.
152
+ Your task is to deduce or infer the value of {attribute X} using the available information in the record.
153
+ You may be provided with fields like {attribute 1}, {attribute 2}, ... to help you in the inference.
154
+ Record: [{attribute 1}: {attribute 1 value}, {attribute 2}: {attribute 2 value}, ...]
155
+ Based on the provided record, what would you infer is the value for the missing attribute {attribute X}?
156
+ Answer only the value of {attribute X}.
157
+ ```
158
+
159
+ ### For Error Detection
160
+ _There are two forms of the error detection task.
161
+ In the first form, a complete record row is provided, and the task is to determine if a specific value is erroneous.
162
+ In the second form, only the value of a specific attribute is given, and the decision about its correctness is based solely on the attribute's name and value.
163
+ The subsequent prompt examples pertain to these two forms, respectively._
164
+ ```
165
+ Your task is to determine if there is an error in the value of a specific attribute within the whole record provided.
166
+ The attributes may include {attribute 1}, {attribute 2}, ...
167
+ Errors may include, but are not limited to, spelling errors, inconsistencies, or values that don't make sense given the context of the whole record.
168
+ Record [{attribute 1}: {attribute 1 value}, {attribute 2}: {attribute 2 value}, ...]
169
+ Attribute for Verification: [{attribute X}: {attribute X value}]
170
+ Question: Is there an error in the value of {attribute X}? Choose your answer from: [Yes, No].
171
+ ```
172
+ ```
173
+ Your task is to determine if there is an error in the value of a specific attribute.
174
+ The attributes may belong to a {keyword} record and could be one of the following: {attribute 1}, {attribute 2}, ...
175
+ Errors can include, but are not limited to, spelling errors, inconsistencies, or values that don't make sense for that attribute.
176
+ Note: Missing values (N/A or \"nan\") are not considered errors.
177
+ Attribute for Verification: [{attribute X}: {attribute X value}]
178
+ Question: Is there an error in the value of {attribute X}? Choose your answer from: [Yes, No].
179
+ ```
180
+
181
+ ### For Schema Matching
182
+ ```
183
+ Your task is to determine if the two attributes (columns) are semantically equivalent in the context of merging two tables.
184
+ Each attribute will be provided by its name and a brief description.
185
+ Your goal is to assess if they refer to the same information based on these names and descriptions provided.
186
+ Attribute A is [name: {value of name}, description: {value of description}].
187
+ Attribute B is [name: {value of name}, description: {value of description}].
188
+ Are Attribute A and Attribute B semantically equivalent? Choose your answer from: [Yes, No].
189
+ ```
190
+
191
+ ### For Column Type Annotation
192
+
193
+ We follow the prompt in [Column Type Annotation using ChatGPT](https://arxiv.org/abs/2306.00745) (text+inst+2-step).
194
+
195
+ ### For Attribute Value Extraction
196
+
197
+ We follow the prompt in [Product Attribute Value Extraction using Large Language Models](https://arxiv.org/abs/2310.12537) (textual, w/o examples).