Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessMassachusetts Sen. Ed Markey is putting AV firms on blast for using human staffersFast Company TechST’s smart IMU bolsters Qualcomm’s monster AI chip for wearablesFierce ElectronicsRound three: More Rising Stars 2026Fierce ElectronicsQ/A: How engineers must design AVs to drive safelyFierce ElectronicsBosch’s pressure sensor is part of Qualcomm’s new wearables chipFierce ElectronicsQ/A: Lumotive CTO talks software-defined optical sensingFierce ElectronicsOpenAI contract with U.S. Cyber Command went unnoticed amid degradation of transparency and veracity of U.S. procurement database - All-Source Intelligence | Jack PoulsonGoogle News: OpenAIEDITORIAL: Benefits of generative AI do not outweigh drawbacks - The Daily TargumGoogle News: Generative AIHere's the severance package Oracle offered laid-off US employeesBusiness InsiderTeenager died after asking ChatGPT for ‘most successful’ way to take his life, inquest told - The GuardianGoogle News: ChatGPTBig Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.Dev.to AIChild Soldiers in Tehran: Iran’s Security Crisis DeepensDev.to AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessMassachusetts Sen. Ed Markey is putting AV firms on blast for using human staffersFast Company TechST’s smart IMU bolsters Qualcomm’s monster AI chip for wearablesFierce ElectronicsRound three: More Rising Stars 2026Fierce ElectronicsQ/A: How engineers must design AVs to drive safelyFierce ElectronicsBosch’s pressure sensor is part of Qualcomm’s new wearables chipFierce ElectronicsQ/A: Lumotive CTO talks software-defined optical sensingFierce ElectronicsOpenAI contract with U.S. Cyber Command went unnoticed amid degradation of transparency and veracity of U.S. procurement database - All-Source Intelligence | Jack PoulsonGoogle News: OpenAIEDITORIAL: Benefits of generative AI do not outweigh drawbacks - The Daily TargumGoogle News: Generative AIHere's the severance package Oracle offered laid-off US employeesBusiness InsiderTeenager died after asking ChatGPT for ‘most successful’ way to take his life, inquest told - The GuardianGoogle News: ChatGPTBig Tech firms are accelerating AI investments and integration, while regulators and companies focus on safety and responsible adoption.Dev.to AIChild Soldiers in Tehran: Iran’s Security Crisis DeepensDev.to AI

PhysMem: Scaling Test-time Physical Memory for Robot Manipulation

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2602.20323v4 Announce Type: replace-cross Abstract: Reliable object manipulation requires understanding physical properties that vary across objects and environments. Vision-language model (VLM) planners can reason about friction and stability in general terms; however, they often cannot predict how a specific ball will roll on a particular surface or which stone will provide a stable foundation without direct experience. We present PhysMem, a memory framework that enables VLM robot planners to learn physical principles from interaction at test time, without updating model parameters. Th — Haoyang Li, Yang You, Hao Su, Leonidas Guibas

View PDF HTML (experimental)

Abstract:Reliable object manipulation requires understanding physical properties that vary across objects and environments. Vision-language model (VLM) planners can reason about friction and stability in general terms; however, they often cannot predict how a specific ball will roll on a particular surface or which stone will provide a stable foundation without direct experience. We present PhysMem, a memory framework that enables VLM robot planners to learn physical principles from interaction at test time, without updating model parameters. The system records experiences, generates candidate hypotheses, and verifies them through targeted interaction before promoting validated knowledge to guide future decisions. A central design choice is verification before application: the system tests hypotheses against new observations rather than applying retrieved experience directly, reducing rigid reliance on prior experience when physical conditions change. We evaluate PhysMem on three real-world manipulation tasks and simulation benchmarks across four VLM backbones. On a controlled brick insertion task, principled abstraction achieves 76% success compared to 23% for direct experience retrieval, and real-world experiments show consistent improvement over 30-minute deployment sessions.

Subjects:

Robotics (cs.RO); Artificial Intelligence (cs.AI)

Cite as: arXiv:2602.20323 [cs.RO]

(or arXiv:2602.20323v4 [cs.RO] for this version)

https://doi.org/10.48550/arXiv.2602.20323

arXiv-issued DOI via DataCite

Submission history

From: Haoyang Li [view email] [v1] Mon, 23 Feb 2026 20:18:35 UTC (18,772 KB) [v2] Wed, 4 Mar 2026 04:33:20 UTC (18,767 KB) [v3] Mon, 23 Mar 2026 00:23:00 UTC (18,767 KB) [v4] Sat, 28 Mar 2026 02:43:53 UTC (18,768 KB)

Original source

arXiv

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
PhysMem: Sc…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 175 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers