Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessAnthropic Responsible Scaling Policy v3: Dive Into The Detailslesswrong.comI tried ChatGPT's new CarPlay integration: It's my new go-to for the questions Siri can't answerZDNet AIAvast Premium Isn’t Flashy — But It Might Be the Smartest Cheap Antivirus Right NowGizmodoScaling AI's Promise in Healthcare: The Time is Now - Pharmaceutical ExecutiveGNews AI healthcareLiving brain cells enable machine learning computationsPhys.org AILiving brain cells enable machine learning computations - Tech XploreGoogle News: Machine LearningARC Raiders Publisher Nexon Calls The Game A "Trojan Horse" To Normalize Generative AI - gameranx.comGoogle News: Generative AIInvestors Chasing AI Hardware Gains May Want to Rethink ARTY Before Adding More Exposure - 24/7 Wall St.GNews AI NVIDIAAI for investors - MLQ.aiGNews AI NVIDIAMicrosoft Commits $10 Billion to Expand AI Infrastructure, Cybersecurity in Japan - Morocco World NewsGNews AI JapanAragorn Is Officially Being Recast For The Hunt for GollumGizmodoBlack Hat USADark ReadingBlack Hat AsiaAI BusinessAnthropic Responsible Scaling Policy v3: Dive Into The Detailslesswrong.comI tried ChatGPT's new CarPlay integration: It's my new go-to for the questions Siri can't answerZDNet AIAvast Premium Isn’t Flashy — But It Might Be the Smartest Cheap Antivirus Right NowGizmodoScaling AI's Promise in Healthcare: The Time is Now - Pharmaceutical ExecutiveGNews AI healthcareLiving brain cells enable machine learning computationsPhys.org AILiving brain cells enable machine learning computations - Tech XploreGoogle News: Machine LearningARC Raiders Publisher Nexon Calls The Game A "Trojan Horse" To Normalize Generative AI - gameranx.comGoogle News: Generative AIInvestors Chasing AI Hardware Gains May Want to Rethink ARTY Before Adding More Exposure - 24/7 Wall St.GNews AI NVIDIAAI for investors - MLQ.aiGNews AI NVIDIAMicrosoft Commits $10 Billion to Expand AI Infrastructure, Cybersecurity in Japan - Morocco World NewsGNews AI JapanAragorn Is Officially Being Recast For The Hunt for GollumGizmodo
AI NEWS HUBbyEIGENVECTOREigenvector

AURA: Multimodal Shared Autonomy for Real-World Urban Navigation

arXiv cs.ROby [Submitted on 2 Apr 2026]April 3, 20262 min read1 views
Source Quiz

arXiv:2604.01659v1 Announce Type: new Abstract: Long-horizon navigation in complex urban environments relies heavily on continuous human operation, which leads to fatigue, reduced efficiency, and safety concerns. Shared autonomy, where a Vision-Language AI agent and a human operator collaborate on maneuvering the mobile machine, presents a promising solution to address these issues. However, existing shared autonomy methods often require humans and AI to operate within the same action space, leading to high cognitive overhead. We present Assistive Urban Robot Autonomy (AURA), a new multi-modal framework that decomposes urban navigation into high-level human instruction and low-level AI control. AURA incorporates a Spatial-Aware Instruction Encoder to align various human instructions with v

View PDF HTML (experimental)

Abstract:Long-horizon navigation in complex urban environments relies heavily on continuous human operation, which leads to fatigue, reduced efficiency, and safety concerns. Shared autonomy, where a Vision-Language AI agent and a human operator collaborate on maneuvering the mobile machine, presents a promising solution to address these issues. However, existing shared autonomy methods often require humans and AI to operate within the same action space, leading to high cognitive overhead. We present Assistive Urban Robot Autonomy (AURA), a new multi-modal framework that decomposes urban navigation into high-level human instruction and low-level AI control. AURA incorporates a Spatial-Aware Instruction Encoder to align various human instructions with visual and spatial context. To facilitate training, we construct MM-CoS, a large-scale dataset comprising teleoperation and vision-language descriptions. Experiments in simulation and the real world demonstrate that AURA effectively follows human instructions, reduces manual operation effort, and improves navigation stability, while enabling online adaptation. Moreover, under similar takeover conditions, our shared autonomy framework reduces the frequency of takeovers by more than 44%. Demo video and more detail are provided in the project page.

Comments: 17 pages, 18 figures, 4 tables, conference

Subjects:

Robotics (cs.RO)

Cite as: arXiv:2604.01659 [cs.RO]

(or arXiv:2604.01659v1 [cs.RO] for this version)

https://doi.org/10.48550/arXiv.2604.01659

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Yukai Ma [view email] [v1] Thu, 2 Apr 2026 05:59:31 UTC (24,086 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

trainingannouncesafety

Knowledge Map

Knowledge Map
TopicsEntitiesSource
AURA: Multi…trainingannouncesafetymultimodalagentarxivarXiv cs.RO

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 160 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Frontier Research