Unifying Vision, Text, and Layout for Universal Document Processing
Paper
•
2212.02623
•
Published
•
10
UDOP is a general multimodal model for document AI
Note This is the best performing model, as it uses the highest image resolution (512x512) and is trained the longest.