elinas (elinas)

Posts 1

Post

1585

We conducted an experiment in an effort to revive LLaMA 1 33B as it had unique prose and a lack of "GPT-isms" and "slop" in its pretraining data, as well as being one of the favorites at the time. With multiple finetune runs, we were able to extend the model from it's pretrained base of 2048 to ~12,000 tokens adding approx. 500M tokens in the process. The effective length is 16,384 but it's better to keep it on the lower range. It writes well and in multiple formats. In the future, we have some ideas like implementing GQA. Please take a look and we would love to hear your feedback!

ZeusLabs/Chronos-Divergence-33B

Collections 5

models 26

datasets

None public yet

elinas

AI & ML interests

Organizations

Posts 1

Collections 5

ZeusLabs/Chronos-Divergence-33B

elinas/Chronos-Gold-12B-1.0

ZeusLabs/L3-Aethora-15B-V2

elinas/chronos-33b

elinas/Llama-3-15B-Instruct-ft-v2

elinas/Llama-3-15B-Instruct-zeroed-ft

elinas/Llama-3-15B-Instruct-zeroed

elinas/Llama-3-13B-Instruct-ft

models 26

elinas/Chronos-Gold-12B-1.0

elinas/chronos-mistral-7b

elinas/chronos-33b

elinas/Qwen2-11.3B

elinas/Llama-3-15B-Instruct-ft-v2

elinas/Llama-3-15B-Instruct-zeroed-ft

elinas/Llama-3-13B-Instruct-ft

elinas/Llama-3-15B-Instruct-zeroed

elinas/Llama-3-13B-Instruct

elinas/Meta-Llama-3-120B-Instruct-4.0bpw-exl2

datasets

elinas

AI & ML interests

Organizations

Posts 1

Collections 5

models 26 Sort: Recently updated

datasets

models 26