Trelis
/

Llama-2-7b-chat-hf-function-calling-v2

@@ -13,11 +13,12 @@ tags:
 - function calling
 - sharded
 ---
-# Function Calling Llama 2 + Mistral + Zephyr + Deepseek Coder Models (version 2)
 - Function calling Llama extends the hugging face Llama 2 models with function calling capabilities.
 - The model responds with a structured json argument with the function name and arguments.
 **Recent Updates**
 - Nov 8th 2023 -> added Zephyr beta, an improved version of Mistral 7B (achieved via DPO)
 - November 6th 2023 -> added Deepseek Coder 1.3B, 6.7B and 33B
 - October 11th 2023 -> added Mistral 7B with function calling
@@ -27,7 +28,9 @@ tags:
 1. Shortened syntax: Only function descriptions are needed for inference and no added instruction is required.
 2. Function descriptions are moved outside of the system prompt. This avoids the behaviour of function calling being affected by how the system prompt had been trained to influence the model.
-Most Popular Models:
 - Deepseek-Coder-1.3B-Instruct with function calling ([Base Model](https://huggingface.co/Trelis/deepseek-coder-1.3b-instruct-function-calling-v2)), ([PEFT Adapters](https://huggingface.co/Trelis/deepseek-coder-1.3b-instruct-function-calling-adapters-v2/settings)) - Paid, [purchase here](https://buy.stripe.com/9AQbJubSda9Z8EM00A)
 - Llama-7B-chat with function calling ([Base Model](https://huggingface.co/Trelis/Llama-2-7b-chat-hf-function-calling-v2)), ([PEFT Adapters](https://huggingface.co/Trelis/Llama-2-7b-chat-hf-function-calling-adapters-v2)), ([GGUF - files are in the main branch of the base model]) - Free
 - zephyr-7b-beta with function calling ([Base Model](https://huggingface.co/Trelis/zephyr-7b-beta-function-calling-v2)), ([PEFT Adapters](https://huggingface.co/Trelis/zephyr-7b-beta-function-calling-adapters-v2)), ([GGUF - files are in the main branch of the base model]) - Paid, [purchase here](https://buy.stripe.com/14k00M4pLeqf9IQbJk)
@@ -58,6 +61,8 @@ Mistral-7B, Llama-13B, Code-llama-34b, Llama-70B and Falcon-180B with function c
 Use of all Llama models with function calling is further subject to terms in the [Meta license](https://ai.meta.com/resources/models-and-libraries/llama-downloads/).
 Zephr models were generated using Ultrachat, which relies on openai. OpenAI does not permit the use of it's models to train competitive models. This makes it unclear as to whether Zephyr may be used commercial. Buyers/users do so at their sole risk.
 ## Dataset
@@ -92,6 +97,7 @@ import json
 B_FUNC, E_FUNC = "<FUNCTIONS>", "</FUNCTIONS>\n\n"
 B_INST, E_INST = "[INST] ", " [/INST]" #Llama style
 # B_INST, E_INST = "\n### Instruction:\n", "\n### Response:\n" #DeepSeek Coder Style
 # Define the function metadata
 function_metadata = {
@@ -135,6 +141,7 @@ Example without a system message:
   B_FUNC, E_FUNC = "<FUNCTIONS>", "</FUNCTIONS>\n\n"
   B_INST, E_INST = "[INST] ", " [/INST]" #Llama style
   # B_INST, E_INST = "\n### Instruction:\n", "\n### Response:\n" #DeepSeek Coder Style
   functionList = {function_1_metadata}{function_2_metadata}...
   user_prompt = '...'
@@ -149,6 +156,7 @@ Example with a system message:
   B_FUNC, E_FUNC = "<FUNCTIONS>", "</FUNCTIONS>\n\n"
   B_INST, E_INST = "[INST] ", " [/INST]" #Llama style
   # B_INST, E_INST = "\n### Instruction:\n", "\n### Response:\n" #DeepSeek Coder Style
   B_SYS, E_SYS = "<<SYS>>\n", "\n<</SYS>>\n\n"
   # assuming functionList is defined as above

 - function calling
 - sharded
 ---
+# Function Calling Llama 2 + Yi + Mistral + Zephyr + Deepseek Coder Models (version 2)
 - Function calling Llama extends the hugging face Llama 2 models with function calling capabilities.
 - The model responds with a structured json argument with the function name and arguments.
 **Recent Updates**
+- Nov 15th 2023 -> added Yi 200k context models in 6B and 34B form.
 - Nov 8th 2023 -> added Zephyr beta, an improved version of Mistral 7B (achieved via DPO)
 - November 6th 2023 -> added Deepseek Coder 1.3B, 6.7B and 33B
 - October 11th 2023 -> added Mistral 7B with function calling
 1. Shortened syntax: Only function descriptions are needed for inference and no added instruction is required.
 2. Function descriptions are moved outside of the system prompt. This avoids the behaviour of function calling being affected by how the system prompt had been trained to influence the model.
+Latest Models:
+- Yi-6B-200k context with function calling ([Base Model](Trelis/Yi-6B-200K-Llamafied-function-calling-v2)), ([PEFT Adapters](Trelis/Yi-6B-200K-Llamafied-function-calling-adapters-v2)) - Paid, [purchase here](https://buy.stripe.com/00gdRC7BX1Dt08gbJp)
+- Yi-34B-200k context with function calling ([Base Model](Trelis/Yi-34B-200K-Llamafied-function-calling-v2)), ([PEFT Adapters](Trelis/Yi-34B-200K-Llamafied-function-calling-adapters-v2)) - Paid, [purchase here](https://buy.stripe.com/8wM00M5tP81R6wE9Bi)
 - Deepseek-Coder-1.3B-Instruct with function calling ([Base Model](https://huggingface.co/Trelis/deepseek-coder-1.3b-instruct-function-calling-v2)), ([PEFT Adapters](https://huggingface.co/Trelis/deepseek-coder-1.3b-instruct-function-calling-adapters-v2/settings)) - Paid, [purchase here](https://buy.stripe.com/9AQbJubSda9Z8EM00A)
 - Llama-7B-chat with function calling ([Base Model](https://huggingface.co/Trelis/Llama-2-7b-chat-hf-function-calling-v2)), ([PEFT Adapters](https://huggingface.co/Trelis/Llama-2-7b-chat-hf-function-calling-adapters-v2)), ([GGUF - files are in the main branch of the base model]) - Free
 - zephyr-7b-beta with function calling ([Base Model](https://huggingface.co/Trelis/zephyr-7b-beta-function-calling-v2)), ([PEFT Adapters](https://huggingface.co/Trelis/zephyr-7b-beta-function-calling-adapters-v2)), ([GGUF - files are in the main branch of the base model]) - Paid, [purchase here](https://buy.stripe.com/14k00M4pLeqf9IQbJk)
 Use of all Llama models with function calling is further subject to terms in the [Meta license](https://ai.meta.com/resources/models-and-libraries/llama-downloads/).
+Yi models are subject to the Yi license, which permits commercial use as of Nov 15th 2023.
 Zephr models were generated using Ultrachat, which relies on openai. OpenAI does not permit the use of it's models to train competitive models. This makes it unclear as to whether Zephyr may be used commercial. Buyers/users do so at their sole risk.
 ## Dataset
 B_FUNC, E_FUNC = "<FUNCTIONS>", "</FUNCTIONS>\n\n"
 B_INST, E_INST = "[INST] ", " [/INST]" #Llama style
 # B_INST, E_INST = "\n### Instruction:\n", "\n### Response:\n" #DeepSeek Coder Style
+# B_INST, E_INST = "Human: ", " Assistant: " #Yi Style
 # Define the function metadata
 function_metadata = {
   B_FUNC, E_FUNC = "<FUNCTIONS>", "</FUNCTIONS>\n\n"
   B_INST, E_INST = "[INST] ", " [/INST]" #Llama style
   # B_INST, E_INST = "\n### Instruction:\n", "\n### Response:\n" #DeepSeek Coder Style
+  # B_INST, E_INST = "Human: ", " Assistant: " #Yi Style
   functionList = {function_1_metadata}{function_2_metadata}...
   user_prompt = '...'
   B_FUNC, E_FUNC = "<FUNCTIONS>", "</FUNCTIONS>\n\n"
   B_INST, E_INST = "[INST] ", " [/INST]" #Llama style
   # B_INST, E_INST = "\n### Instruction:\n", "\n### Response:\n" #DeepSeek Coder Style
+  # B_INST, E_INST = "Human: ", " Assistant: " #Yi Style
   B_SYS, E_SYS = "<<SYS>>\n", "\n<</SYS>>\n\n"
   # assuming functionList is defined as above