arxiv:2410.01273

CANVAS: Commonsense-Aware Navigation System for Intuitive Human-Robot Interaction

Published on Oct 2

· Submitted by

lastdefiance20 on Oct 7

Upvote

Authors:

Suhwan Choi ,

Yongjun Cho ,

Jaeyoon Jung ,

Yubeen Park ,

Hwiseong Park ,

Jiwan Chung ,

Youngjae Yu

Abstract

Real-life robot navigation involves more than just reaching a destination; it requires optimizing movements while addressing scenario-specific goals. An intuitive way for humans to express these goals is through abstract cues like verbal commands or rough sketches. Such human guidance may lack details or be noisy. Nonetheless, we expect robots to navigate as intended. For robots to interpret and execute these abstract instructions in line with human expectations, they must share a common understanding of basic navigation concepts with humans. To this end, we introduce CANVAS, a novel framework that combines visual and linguistic instructions for commonsense-aware navigation. Its success is driven by imitation learning, enabling the robot to learn from human navigation behavior. We present COMMAND, a comprehensive dataset with human-annotated navigation results, spanning over 48 hours and 219 km, designed to train commonsense-aware navigation systems in simulated environments. Our experiments show that CANVAS outperforms the strong rule-based system ROS NavStack across all environments, demonstrating superior performance with noisy instructions. Notably, in the orchard environment, where ROS NavStack records a 0% total success rate, CANVAS achieves a total success rate of 67%. CANVAS also closely aligns with human demonstrations and commonsense constraints, even in unseen environments. Furthermore, real-world deployment of CANVAS showcases impressive Sim2Real transfer with a total success rate of 69%, highlighting the potential of learning from human demonstrations in simulated environments for real-world applications.

View arXiv page View PDF Add to collection

Community

lastdefiance20

Paper author Paper submitter about 11 hours ago

•

edited about 11 hours ago

We introduce CANVAS 🖼️, a novel framework that combines visual and linguistic instructions for commonsense-aware navigation. Its success is driven by imitation learning, enabling the robot to learn from human navigation behavior.

Additionally, we present COMMAND ⌨️, a comprehensive dataset with human-annotated navigation results, spanning over 48 hours and 219 km, designed to train commonsense-aware navigation systems in simulated environments.

In the orchard environment, where ROS NavStack records a 0% total success rate, CANVAS achieves a 67% 🔥 success rate. Furthermore, real-world deployment of CANVAS showcases impressive Sim2Real transfer with a total success rate of 69% 🔥, highlighting the potential of learning from human demonstrations in simulated environments for real-world applications.

https://worv-ai.github.io/canvas/

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2410.01273 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2410.01273 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2410.01273 in a Space README.md to link it from this page.