AI4Chem
/

ChemLLM-7B-Chat

@@ -1,5 +1,5 @@
 ---
-license: apache-2.0
 pipeline_tag: text-generation
 tags:
 - chemistry
@@ -8,20 +8,12 @@ language:
 - zh
 ---
 # ChemLLM-7B-Chat: LLM for Chemistry and Molecule Science
-> [!IMPORTANT]
-> Better using New version of ChemLLM!
-> [AI4Chem/ChemLLM-7B-Chat-1.5-DPO](https://huggingface.co/AI4Chem/ChemLLM-7B-Chat-1.5-DPO) or [AI4Chem/ChemLLM-7B-Chat-1.5-SFT](https://huggingface.co/AI4Chem/ChemLLM-7B-Chat-1.5-SFT)
 ChemLLM-7B-Chat, The First Open-source Large Language Model for Chemistry and Molecule Science, Build based on InternLM-2 with ❤
 [![Paper page](https://huggingface.co/datasets/huggingface/badges/resolve/main/paper-page-sm.svg)](https://huggingface.co/papers/2402.06852)
 <center><img src='https://cdn-uploads.huggingface.co/production/uploads/64bce15bafd1e46c5504ad38/wdFV6p3rTBCtskbeuVwNJ.png'></center>
 ## News
-- ChemLLM-1.5 released! Two versions are available [AI4Chem/ChemLLM-7B-Chat-1.5-DPO](https://huggingface.co/AI4Chem/ChemLLM-7B-Chat-1.5-DPO) or [AI4Chem/ChemLLM-7B-Chat-1.5-SFT](https://huggingface.co/AI4Chem/ChemLLM-7B-Chat-1.5-SFT).[2024-4-2]
-- ChemLLM-1.5 updated! Have a try on [Demo Site](https://chemllm.org/#/chat) or [API Reference](https://api.chemllm.org/docs).[2024-3-23]
 - ChemLLM has been featured by HuggingFace on [“Daily Papers” page](https://huggingface.co/papers/2402.06852).[2024-2-13]
 - ChemLLM arXiv preprint released.[ChemLLM: A Chemical Large Language Model](https://arxiv.org/abs/2402.06852)[2024-2-10]
 - News report from [Shanghai AI Lab](https://mp.weixin.qq.com/s/u-i7lQxJzrytipek4a87fw)[2024-1-26]
@@ -44,7 +36,7 @@ import torch
 model_name_or_id = "AI4Chem/ChemLLM-7B-Chat"
 model = AutoModelForCausalLM.from_pretrained(model_name_or_id, torch_dtype=torch.float16, device_map="auto",trust_remote_code=True)
-tokenizer = AutoTokenizer.from_pretrained(model_name_or_id,trust_remote_code=True)
 prompt = "What is Molecule of Ibuprofen?"
@@ -75,21 +67,17 @@ You can format it into this InternLM2 Dialogue format like,
 ```
 def InternLM2_format(instruction,prompt,answer,history):
     prefix_template=[
-        "<|im_start|>system\n",
-        "{}",
-        "<|im_end|>\n"
     ]
     prompt_template=[
-        "<|im_start|>user\n",
-        "{}",
-        "<|im_end|>\n"
-        "<|im_start|>assistant\n",
-        "{}",
-        "<|im_end|>\n"
     ]
-    system = f'{prefix_template[0]}{prefix_template[1].format(instruction)}{prefix_template[2]}'
-    history = "".join([f'{prompt_template[0]}{prompt_template[1].format(qa[0])}{prompt_template[2]}{prompt_template[3]}{prompt_template[4].format(qa[1])}{prompt_template[5]}' for qa in history])
-    prompt = f'{prompt_template[0]}{prompt_template[1].format(prompt)}{prompt_template[2]}{prompt_template[3]}'
     return f"{system}{history}{prompt}"
 ```
 And there is a good example for system prompt,

 ---
+license: other
 pipeline_tag: text-generation
 tags:
 - chemistry
 - zh
 ---
 # ChemLLM-7B-Chat: LLM for Chemistry and Molecule Science
 ChemLLM-7B-Chat, The First Open-source Large Language Model for Chemistry and Molecule Science, Build based on InternLM-2 with ❤
 [![Paper page](https://huggingface.co/datasets/huggingface/badges/resolve/main/paper-page-sm.svg)](https://huggingface.co/papers/2402.06852)
 <center><img src='https://cdn-uploads.huggingface.co/production/uploads/64bce15bafd1e46c5504ad38/wdFV6p3rTBCtskbeuVwNJ.png'></center>
 ## News
 - ChemLLM has been featured by HuggingFace on [“Daily Papers” page](https://huggingface.co/papers/2402.06852).[2024-2-13]
 - ChemLLM arXiv preprint released.[ChemLLM: A Chemical Large Language Model](https://arxiv.org/abs/2402.06852)[2024-2-10]
 - News report from [Shanghai AI Lab](https://mp.weixin.qq.com/s/u-i7lQxJzrytipek4a87fw)[2024-1-26]
 model_name_or_id = "AI4Chem/ChemLLM-7B-Chat"
 model = AutoModelForCausalLM.from_pretrained(model_name_or_id, torch_dtype=torch.float16, device_map="auto",trust_remote_code=True)
+tokenizer = AutoTokenizer.from_pretrained(model_name_or_id，,trust_remote_code=True)
 prompt = "What is Molecule of Ibuprofen?"
 ```
 def InternLM2_format(instruction,prompt,answer,history):
     prefix_template=[
+        "<|system|>:",
+        "{}"
     ]
     prompt_template=[
+        "<|user|>:",
+        "{}\n",
+        "<|Bot|>:\n"
     ]
+    system = f'{prefix_template[0]}\n{prefix_template[-1].format(instruction)}\n'
+    history = "\n".join([f'{prompt_template[0]}\n{prompt_template[1].format(qa[0])}{prompt_template[-1]}{qa[1]}' for qa in history])
+    prompt = f'\n{prompt_template[0]}\n{prompt_template[1].format(prompt)}{prompt_template[-1]}'
     return f"{system}{history}{prompt}"
 ```
 And there is a good example for system prompt,