Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessExclusive: Miravoice, Builder Of An AI ‘Interviewer’ To Conduct Phone Surveys, Raises $6.3MCrunchbase NewsMoltbook risks: The dangers of AI-to-AI interactions in health carePhys.org AIMaul: Shadow Lord Will Return for Season 2GizmodoMicrosoft Aims to Create Large Cutting-Edge AI Models By 2027Bloomberg TechnologyA jury says Meta and Google hurt a kid. What now?The Verge AIHow Disney Imagineers are using AI and robotics to reshape the company’s theme parksFast Company TechI have always seen myself as ‘progressive’ – but with AI it’s time to hit the brakes - The GuardianGoogle News: AIOpenAI Teams Up with Smartly to Create Chatty Ads Inside ChatGPT - TipRanksGoogle News: ChatGPTDOJ to Appeal Court Order Halting Trump’s Ban on Anthropic AIBloomberg TechnologyCapacity and speed: why TikTok shelved its second Irish data centreSilicon RepublicAI Uses Eye-Tracking Technology to Diagnose Autism - EMJGoogle News: Machine LearningBlack Hat USADark ReadingBlack Hat AsiaAI BusinessExclusive: Miravoice, Builder Of An AI ‘Interviewer’ To Conduct Phone Surveys, Raises $6.3MCrunchbase NewsMoltbook risks: The dangers of AI-to-AI interactions in health carePhys.org AIMaul: Shadow Lord Will Return for Season 2GizmodoMicrosoft Aims to Create Large Cutting-Edge AI Models By 2027Bloomberg TechnologyA jury says Meta and Google hurt a kid. What now?The Verge AIHow Disney Imagineers are using AI and robotics to reshape the company’s theme parksFast Company TechI have always seen myself as ‘progressive’ – but with AI it’s time to hit the brakes - The GuardianGoogle News: AIOpenAI Teams Up with Smartly to Create Chatty Ads Inside ChatGPT - TipRanksGoogle News: ChatGPTDOJ to Appeal Court Order Halting Trump’s Ban on Anthropic AIBloomberg TechnologyCapacity and speed: why TikTok shelved its second Irish data centreSilicon RepublicAI Uses Eye-Tracking Technology to Diagnose Autism - EMJGoogle News: Machine Learning
AI NEWS HUBbyEIGENVECTOREigenvector

RAAP: Retrieval-Augmented Affordance Prediction with Cross-Image Action Alignment

arXiv cs.ROby [Submitted on 31 Mar 2026]April 1, 20262 min read1 views
Source Quiz

arXiv:2603.29419v1 Announce Type: new Abstract: Understanding object affordances is essential for enabling robots to perform purposeful and fine-grained interactions in diverse and unstructured environments. However, existing approaches either rely on retrieval, which is fragile due to sparsity and coverage gaps, or on large-scale models, which frequently mislocalize contact points and mispredict post-contact actions when applied to unseen categories, thereby hindering robust generalization. We introduce Retrieval-Augmented Affordance Prediction (RAAP), a framework that unifies affordance retrieval with alignment-based learning. By decoupling static contact localization and dynamic action direction, RAAP transfers contact points via dense correspondence and predicts action directions throu

View PDF HTML (experimental)

Abstract:Understanding object affordances is essential for enabling robots to perform purposeful and fine-grained interactions in diverse and unstructured environments. However, existing approaches either rely on retrieval, which is fragile due to sparsity and coverage gaps, or on large-scale models, which frequently mislocalize contact points and mispredict post-contact actions when applied to unseen categories, thereby hindering robust generalization. We introduce Retrieval-Augmented Affordance Prediction (RAAP), a framework that unifies affordance retrieval with alignment-based learning. By decoupling static contact localization and dynamic action direction, RAAP transfers contact points via dense correspondence and predicts action directions through a retrieval-augmented alignment model that consolidates multiple references with dual-weighted attention. Trained on compact subsets of DROID and HOI4D with as few as tens of samples per task, RAAP achieves consistent performance across unseen objects and categories, and enables zero-shot robotic manipulation in both simulation and the real world. Project website: this https URL.

Comments: Accepted to ICRA 2026

Subjects:

Robotics (cs.RO); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.29419 [cs.RO]

(or arXiv:2603.29419v1 [cs.RO] for this version)

https://doi.org/10.48550/arXiv.2603.29419

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Qiyuan Zhuang [view email] [v1] Tue, 31 Mar 2026 08:25:22 UTC (2,151 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modelannounceprediction

Knowledge Map

Knowledge Map
TopicsEntitiesSource
RAAP: Retri…modelannouncepredictionalignmentarxivgithubarXiv cs.RO

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 185 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Frontier Research