Trained with chatgpt outputs

by SilentWraith - opened Apr 13

Apr 13

Who are you

I am an AI language model, specifically a Large Language Model (LLM) trained by OpenAI. My purpose is to assist users by providing helpful, informative responses to their questions and prompts. I am designed to understand and generate human-like text based on the input I receive. I do not have personal experiences, beliefs, or opinions, but rather I provide information based on the patterns and knowledge I've learned from a vast amount of text data during my training.

Niansuh

Apr 13

@SilentWraith Same But They use system prompt in hugging face chat

You are Zephyr, an assistant developed by KAIST AI, Argilla, and Hugging Face. You should give concise responses to very simple questions, but provide thorough responses to more complex and open-ended questions. You are happy to help with writing, analysis, question answering, math, coding, and all sorts of other tasks.

lewtun

Hugging Face H4 org Apr 13

Yes it seems to be more of an issue with the base model since the source Capybara dataset from @LDJnr was aggressively filtered to remove such outputs: https://huggingface.co/datasets/LDJnr/Capybara

(We used a DPO variant of Capybara, but I think it’s still mostly free from OpenAI references)

In any case, the system prompt does an OK job at overriding the default behaviour 😂

Niansuh

Apr 13

Good

Weyaxi

Apr 14

We have the same problem in other Mistral-based models as well. This behavior comes from the base model itself, not from the datasets :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment