Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessThis International Fact-Checking Day, use these 5 tips to spot AI-generated contentFast Company TechDay 13: Why Good Models Fail in the Real World (Data Leakage)Medium AISmart solutions for sustainable energy: Machine learning powers biochar production from aquatic biomass - EurekAlert!Google News: Machine LearningIran Reportedly Executing Political Prisoners As War With Israel And U.S. Rages OnInternational Business TimesI Built a 6-Agent AI System in a WeekendMedium AIGenerative AI shifts from market boom to disruption risk - FinTech GlobalGoogle News: Generative AIChatGPT shopping: How it works, and how to get your products listed - AOL.comGoogle News: ChatGPTAgentic Coding: The Risks and Pitfalls Nobody Talks AboutMedium AIHow to Make Money with AI in 2026 (Even If You’re Starting from Zero)Medium AIYour Company Is Spending on AI. The Numbers Are Not Adding Up. Here Is What Is Actually Happening.Medium AIIn the AI Era, Just Get FitMedium AIMy Salary Doubled After I Added These 4 Skills to My Resume — All Free to LearnMedium AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessThis International Fact-Checking Day, use these 5 tips to spot AI-generated contentFast Company TechDay 13: Why Good Models Fail in the Real World (Data Leakage)Medium AISmart solutions for sustainable energy: Machine learning powers biochar production from aquatic biomass - EurekAlert!Google News: Machine LearningIran Reportedly Executing Political Prisoners As War With Israel And U.S. Rages OnInternational Business TimesI Built a 6-Agent AI System in a WeekendMedium AIGenerative AI shifts from market boom to disruption risk - FinTech GlobalGoogle News: Generative AIChatGPT shopping: How it works, and how to get your products listed - AOL.comGoogle News: ChatGPTAgentic Coding: The Risks and Pitfalls Nobody Talks AboutMedium AIHow to Make Money with AI in 2026 (Even If You’re Starting from Zero)Medium AIYour Company Is Spending on AI. The Numbers Are Not Adding Up. Here Is What Is Actually Happening.Medium AIIn the AI Era, Just Get FitMedium AIMy Salary Doubled After I Added These 4 Skills to My Resume — All Free to LearnMedium AI
AI NEWS HUBbyEIGENVECTOREigenvector

CLaD: Planning with Grounded Foresight via Cross-Modal Latent Dynamics

arXiv cs.ROby Andrew Jeong, Jaemin Kim, Sebin Lee, Sung-Eui YoonApril 1, 20261 min read0 views
Source Quiz

arXiv:2603.29409v1 Announce Type: new Abstract: Robotic manipulation involves kinematic and semantic transitions that are inherently coupled via underlying actions. However, existing approaches plan within either semantic or latent space without explicitly aligning these cross-modal transitions. To address this, we propose CLaD, a framework that models how proprioceptive and semantic states jointly evolve under actions through asymmetric cross-attention that allows kinematic transitions to query semantic ones. CLaD predicts grounded latent foresights via self-supervised objectives with EMA target encoders and auxiliary reconstruction losses, preventing representation collapse while anchoring predictions to observable states. Predicted foresights are modulated with observations to condition

View PDF HTML (experimental)

Abstract:Robotic manipulation involves kinematic and semantic transitions that are inherently coupled via underlying actions. However, existing approaches plan within either semantic or latent space without explicitly aligning these cross-modal transitions. To address this, we propose CLaD, a framework that models how proprioceptive and semantic states jointly evolve under actions through asymmetric cross-attention that allows kinematic transitions to query semantic ones. CLaD predicts grounded latent foresights via self-supervised objectives with EMA target encoders and auxiliary reconstruction losses, preventing representation collapse while anchoring predictions to observable states. Predicted foresights are modulated with observations to condition a diffusion policy for action generation. On LIBERO-LONG benchmark, CLaD achieves 94.7% success rate, competitive with large VLAs with significantly fewer parameters.

Comments: Project page: this https URL

Subjects:

Robotics (cs.RO)

Cite as: arXiv:2603.29409 [cs.RO]

(or arXiv:2603.29409v1 [cs.RO] for this version)

https://doi.org/10.48550/arXiv.2603.29409

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Andrew Jeong [view email] [v1] Tue, 31 Mar 2026 08:13:45 UTC (21,383 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
CLaD: Plann…modelbenchmarkannouncepredictionpolicyarxivarXiv cs.RO

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 185 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!