AffordGrasp: Cross-Modal Diffusion for Affordance-Aware Grasp Synthesis
arXiv:2603.08021v2 Announce Type: replace-cross Abstract: Generating human grasping poses that accurately reflect both object geometry and user-specified interaction semantics is essential for natural hand-object interactions in AR/VR and embodied AI. However, existing semantic grasping approaches struggle with the large modality gap between 3D object representations and textual instructions, and often lack explicit spatial or semantic constraints, leading to physically invalid or semantically inconsistent grasps. In this work, we present AffordGrasp, a diffusion-based framework that produces — Xiaofei Wu, Yi Zhang, Yumeng Liu, Yuexin Ma, Yujiao Shi, Xuming He
View PDF HTML (experimental)
Abstract:Generating human grasping poses that accurately reflect both object geometry and user-specified interaction semantics is essential for natural hand-object interactions in AR/VR and embodied AI. However, existing semantic grasping approaches struggle with the large modality gap between 3D object representations and textual instructions, and often lack explicit spatial or semantic constraints, leading to physically invalid or semantically inconsistent grasps. In this work, we present AffordGrasp, a diffusion-based framework that produces physically stable and semantically faithful human grasps with high precision. We first introduce a scalable annotation pipeline that automatically enriches hand-object interaction datasets with fine-grained structured language labels capturing interaction intent. Building upon these annotations, AffordGrasp integrates an affordance-aware latent representation of hand poses with a dual-conditioning diffusion process, enabling the model to jointly reason over object geometry, spatial affordances, and instruction semantics. A distribution adjustment module further enforces physical contact consistency and semantic alignment. We evaluate AffordGrasp across four instruction-augmented benchmarks derived from HO-3D, OakInk, GRAB, and AffordPose, and observe substantial improvements over state-of-the-art methods in grasp quality, semantic accuracy, and diversity.
Comments: CVPR 2026
Subjects:
Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
Cite as: arXiv:2603.08021 [cs.RO]
(or arXiv:2603.08021v2 [cs.RO] for this version)
https://doi.org/10.48550/arXiv.2603.08021
arXiv-issued DOI via DataCite
Submission history
From: Xiaofei Wu [view email] [v1] Mon, 9 Mar 2026 06:56:35 UTC (16,745 KB) [v2] Sat, 28 Mar 2026 15:33:35 UTC (16,617 KB)
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
researchpaperarxiv
Andrej Karpathy's new open source 'autoresearch' lets you run hundreds of AI experiments a night — with revolutionary implications - VentureBeat
Andrej Karpathy's new open source 'autoresearch' lets you run hundreds of AI experiments a night — with revolutionary implications VentureBeat

Vector researchers presented more than 50 papers at ICML 2024
Vector researchers presented more than 50 papers at the 2024 International Conference on Machine Learning (ICML). 35 papers co-authored by Vector Faculty Members were accepted to the conference, with a [ ] The post Vector researchers presented more than 50 papers at ICML 2024 appeared first on Vector Institute for Artificial Intelligence .
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Research Papers

Vector researchers presented more than 50 papers at ICML 2024
Vector researchers presented more than 50 papers at the 2024 International Conference on Machine Learning (ICML). 35 papers co-authored by Vector Faculty Members were accepted to the conference, with a [ ] The post Vector researchers presented more than 50 papers at ICML 2024 appeared first on Vector Institute for Artificial Intelligence .

Vector Researchers present papers at ACL 2024
Vector researchers will be well represented at the 62nd Annual Meeting of the Association for Computational Linguistics in Bangkok, Thailand this year. 14 papers co-authored by Vector-affiliated researchers are being [ ] The post Vector Researchers present papers at ACL 2024 appeared first on Vector Institute for Artificial Intelligence .



Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!