NECOUDBFM
/

Jellyfish-13B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

HCZhang commited on Oct 26, 2023

Commit

744c6f1

•

1 Parent(s): 5720bb3

Update README.md

Files changed (1) hide show

README.md +4 -4

README.md CHANGED Viewed

@@ -9,11 +9,11 @@ language:
 <img src="https://i.imgur.com/d8Bl04i.png" alt="PicToModel" width="330"/>
 ## Model Details
-Jellyfish-13B is a large language model with 13 billion parameters, designed specifically for data managment and preprocessing tasks, such as entity matching, data imputation, error detection, and schema matching.
-We fine-tuned [Open-Orca/OpenOrca-Platypus2-13B](https://huggingface.co/Open-Orca/OpenOrca-Platypus2-13B) using the datasets related to data preprocessing tasks.
-Its performance is competitive, standing up well against prior state-of-the-art algorithms and LLMs such as OpenAI GPT 3.5 and GPT 4 ([evaluated by our previous work](https://arxiv.org/abs/2308.16361)).
-Keep in mind that Jellyfish is only a 13B model, allowing for cost-effective local execution while maintaining data security.
 |  Task  | Dataset | Non-LLM SoTA<sup>1</sup> | GPT-3.5<sup>2</sup> | GPT-4<sup>2</sup> | Jellyfish-13B| Jellyfish-13B-Resoning |
 | ---- | ---- | ---- | ---- | ---- | ---- | ---- |

 <img src="https://i.imgur.com/d8Bl04i.png" alt="PicToModel" width="330"/>
 ## Model Details
+Jellyfish-13B is a large language model equipped with 13 billion parameters. It's tailored specifically for data preprocessing tasks, including entity matching, data imputation, error detection, and schema matching.
+We fine-tuned [Open-Orca/OpenOrca-Platypus2-13B](https://huggingface.co/Open-Orca/OpenOrca-Platypus2-13B) using the datasets pertinent to data preprocessing tasks.
+Its performance is competitive, rivaling previous state-of-the-art algorithms and LLMs such as OpenAI's GPT 3.5 and GPT 4, ([as demonstrated in our earlier studies](https://arxiv.org/abs/2308.16361))
+It is notable that as a 13B model, Jellyfish allows for cost-effective local execution without compromising data security.
 |  Task  | Dataset | Non-LLM SoTA<sup>1</sup> | GPT-3.5<sup>2</sup> | GPT-4<sup>2</sup> | Jellyfish-13B| Jellyfish-13B-Resoning |
 | ---- | ---- | ---- | ---- | ---- | ---- | ---- |