STLM - a DylanASHillier Collection

DylanASHillier 's Collections

Benchmarks etc.

State Space Models

Learning from feedback dir

Imitative Learning

Sample Efficiency

Embodied useful

STLM

Model Internals

STLM

updated Jun 24

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20 • 85
HARE: HumAn pRiors, a key to small language model Efficiency

Paper • 2406.11410 • Published Jun 17 • 38