-
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 65 -
Attention Where It Matters: Rethinking Visual Document Understanding with Selective Region Concentration
Paper • 2309.01131 • Published • 1 -
On the Hidden Mystery of OCR in Large Multimodal Models
Paper • 2305.07895 • Published -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 181
Onur Savas PRO
onursavas
AI & ML interests
None yet
Organizations
Collections
3
spaces
9
models
None public yet
datasets
None public yet