Introducing ConTextual: How well can your Multimodal model jointly reason over text and image in text-rich scenes? Mar 5 • 4
Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs Paper • 2311.05657 • Published Nov 9, 2023 • 27
RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment Paper • 2307.12950 • Published Jul 24, 2023 • 9
DesCo: Learning Object Recognition with Rich Language Descriptions Paper • 2306.14060 • Published Jun 24, 2023 • 1