Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessMercor says it was hit by cyberattack tied to compromise of open-source LiteLLM projectTechCrunch AIHow AI has suddenly become much more useful to open-source developers - ZDNETGNews AI open sourceIn the Iran war, it looks like AI helped with operations, not strategyGary Marcus BlogGoogle adds AI charging guidance to Maps for EV drivers - mezha.netGoogle News - AI UkraineDespite its $350 billion investment promise in the U.S., the U.S. has unprecedentedly raised trade p.. - 매일경제GNews AI KoreaI Was Told to Write My Thesis in LaTeX. Here's How I Actually Got Started.DEV CommunityBuilding a Multi-Tenant SaaS with Stripe Connect in 2026DEV CommunityPart 3 of 3 — Engineering Intent Series -- Inside the Machine: The ISL Build PipelineDEV CommunityChoosing and Integrating Mobile Video SDKs: FFmpeg, ExoPlayer, and Commercial OptionsDEV CommunityStudent hui speaks on AI in education — and how to handle it - Hawaii Public RadioGNews AI educationBuild an End-to-End RAG Pipeline for LLM ApplicationsDEV CommunityAgentX-Phase2: 49-Model Byzantine FBA Consensus — Building Cool Agents that Modernize COBOL to RustDEV CommunityBlack Hat USADark ReadingBlack Hat AsiaAI BusinessMercor says it was hit by cyberattack tied to compromise of open-source LiteLLM projectTechCrunch AIHow AI has suddenly become much more useful to open-source developers - ZDNETGNews AI open sourceIn the Iran war, it looks like AI helped with operations, not strategyGary Marcus BlogGoogle adds AI charging guidance to Maps for EV drivers - mezha.netGoogle News - AI UkraineDespite its $350 billion investment promise in the U.S., the U.S. has unprecedentedly raised trade p.. - 매일경제GNews AI KoreaI Was Told to Write My Thesis in LaTeX. Here's How I Actually Got Started.DEV CommunityBuilding a Multi-Tenant SaaS with Stripe Connect in 2026DEV CommunityPart 3 of 3 — Engineering Intent Series -- Inside the Machine: The ISL Build PipelineDEV CommunityChoosing and Integrating Mobile Video SDKs: FFmpeg, ExoPlayer, and Commercial OptionsDEV CommunityStudent hui speaks on AI in education — and how to handle it - Hawaii Public RadioGNews AI educationBuild an End-to-End RAG Pipeline for LLM ApplicationsDEV CommunityAgentX-Phase2: 49-Model Byzantine FBA Consensus — Building Cool Agents that Modernize COBOL to RustDEV Community

UMI-Underwater: Learning Underwater Manipulation without Underwater Teleoperation

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.27012v1 Announce Type: cross Abstract: Underwater robotic grasping is difficult due to degraded, highly variable imagery and the expense of collecting diverse underwater demonstrations. We introduce a system that (i) autonomously collects successful underwater grasp demonstrations via a self-supervised data collection pipeline and (ii) transfers grasp knowledge from on-land human demonstrations through a depth-based affordance representation that bridges the on-land-to-underwater domain gap and is robust to lighting and color shift. An affordance model trained on on-land handheld de — Hao Li, Long Yin Chung, Jack Goler, Ryan Zhang, Xiaochi Xie, Huy Ha, Shuran Song, Mark Cutkosky

View PDF HTML (experimental)

Abstract:Underwater robotic grasping is difficult due to degraded, highly variable imagery and the expense of collecting diverse underwater demonstrations. We introduce a system that (i) autonomously collects successful underwater grasp demonstrations via a self-supervised data collection pipeline and (ii) transfers grasp knowledge from on-land human demonstrations through a depth-based affordance representation that bridges the on-land-to-underwater domain gap and is robust to lighting and color shift. An affordance model trained on on-land handheld demonstrations is deployed underwater zero-shot via geometric alignment, and an affordance-conditioned diffusion policy is then trained on underwater demonstrations to generate control actions. In pool experiments, our approach improves grasping performance and robustness to background shifts, and enables generalization to objects seen only in on-land data, outperforming RGB-only baselines. Code, videos, and additional results are available at this https URL.

Subjects:

Robotics (cs.RO); Artificial Intelligence (cs.AI)

Cite as: arXiv:2603.27012 [cs.RO]

(or arXiv:2603.27012v1 [cs.RO] for this version)

https://doi.org/10.48550/arXiv.2603.27012

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Hao Li [view email] [v1] Fri, 27 Mar 2026 22:01:19 UTC (4,747 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
UMI-Underwa…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 72 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers