🪐 SmolLM Collection A series of smol LLMs: 135M, 360M and 1.7B. We release base and Instruct models as well as the training corpus and some WebGPU demos • 12 items • Updated Aug 18 • 169
LLaVa-NeXT-Video Collection LLaVa-NeXT-Video extends LLaVa-NeXT for video understanding. • 5 items • Updated Jun 10 • 4
view article Article seemore: Implement a Vision Language Model from Scratch By AviSoori1x • Jun 23 • 56
view article Article Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent Apr 22 • 78
Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference Paper • 2403.14520 • Published Mar 21 • 32