Jim Lai

grimjim

AI & ML interests

Experimenting primarily with 7B-12B parameter text completion models. Not all models are intended for direct use, but aim for educational and/or merge purposes.

Organizations

Posts 14

Post

1865

To demonstrate that it was possible, I performed a "trapezoid" gradient merge of a Llama 3 8B model onto Llama 3.1 8B Instruct, favoring the L3.1 model at the ends in order to preserve coherence and limiting the influence of the L3 model to at most 0.1 weight. Tested to 16k context length.
grimjim/Llama-Nephilim-Metamorphosis-v2-8B

Post

1840

I was reading through an abstract and found myself wondering how much LLM performance is being left on the table due to insufficient curation of training datasets: "Instruct-SkillMix: A Powerful Pipeline for LLM Instruction Tuning" by Kaur, Park, Goyal, Arora.
https://arxiv.org/abs/2408.14774
In particular, the observation that "Introducing low quality answers ("shirkers") in 20% of Instruct-SkillMix examples causes performance to plummet..." had me wondering how many ostensibly good datasets out there are in fact populated with a significant number of "shirkers".

View all posts

Collections 5

models 109

datasets 2

grimjim/PAlign-PAPI-personality_prompt.json-cleaned

Viewer • Updated Sep 21 • 300 • 41

grimjim/adversarial-10-alpaca

Viewer • Updated Aug 16 • 10 • 42 • 1

Jim Lai

AI & ML interests

Organizations

Posts 14

Collections 5

grimjim/llama-3-Nephilim-v3-8B

grimjim/llama-3-Nephilim-v3-8B-GGUF

grimjim/Llama-3.1-8B-Instruct-abliterated_via_adapter

grimjim/Llama-3.1-8B-Instruct-abliterated_via_adapter-GGUF

grimjim/kuno-kunoichi-v1-DPO-v2-SLERP-7B

grimjim/kukulemon-7B

grimjim/kukulemon-spiked-9B

grimjim/kukulemon-32K-7B

models 109

grimjim/Magnolia-v1-12B-GGUF

grimjim/Magnolia-v1-12B

grimjim/Magot-v3-Gemma2-8k-9B

grimjim/Llama-Nephilim-Metamorphosis-v2-8B

grimjim/Magnolia-v2-Gemma2-8k-9B

grimjim/Magnolia-v1-Gemma2-8k-9B

grimjim/magnum-consolidatum-v1-12b

grimjim/mistralai-Mistral-Nemo-Base-2407

grimjim/mistralai-Mistral-Nemo-Instruct-2407

grimjim/mistralai-Mistral-7B-v0.3

datasets 2

grimjim/PAlign-PAPI-personality_prompt.json-cleaned

grimjim/adversarial-10-alpaca

Jim Lai

AI & ML interests

Organizations

Posts 14

Collections 5

models 109 Sort: Recently updated

datasets 2 Sort: Recently updated

models 109

datasets 2