Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Sign Up
Symbol-LLMΒ 
posted an update 8 days ago
Post
2051
πŸš€ Excited to introduce a new member of the OS-Copilot family: OS-Atlas - an open-sourced foundational action model for GUI agents

πŸ“˜ Paper: OS-ATLAS: A Foundation Action Model for Generalist GUI Agents (2410.23218)
πŸ”— Website: https://osatlas.github.io

πŸ˜‡ TL;DR: OS-Atlas offers:
1. State-of-the-Art GUI Grounding: Helps GUI agents accurately locate GUI elements.
2. Strong OOD Performance and Cross-platform Compatibility: Excels in out-of-domain agentic tasks across MacOS, Windows, Linux, Android, and Web.
3. Complete Infrastructure for GUI Data Synthesis:
You can easily build your own OS agent upon it!

In this post