Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessCommunity Without Tokens: What AI Dev Tools Can Learn from Crypto's Community PlaybookDev.to AIThis Isn’t Another ‘AI Productivity Hack’ ArticleMedium AIMercor says it was hit by cyberattack tied to compromise of open-source LiteLLM projectTechCrunch AIHow AI has suddenly become much more useful to open-source developers - ZDNETGNews AI open sourceIn the Iran war, it looks like AI helped with operations, not strategyGary Marcus BlogGoogle adds AI charging guidance to Maps for EV drivers - mezha.netGoogle News - AI UkraineDespite its $350 billion investment promise in the U.S., the U.S. has unprecedentedly raised trade p.. - 매일경제GNews AI KoreaI Was Told to Write My Thesis in LaTeX. Here's How I Actually Got Started.DEV CommunityBuilding a Multi-Tenant SaaS with Stripe Connect in 2026DEV CommunityPart 3 of 3 — Engineering Intent Series -- Inside the Machine: The ISL Build PipelineDEV CommunityChoosing and Integrating Mobile Video SDKs: FFmpeg, ExoPlayer, and Commercial OptionsDEV CommunityStudent hui speaks on AI in education — and how to handle it - Hawaii Public RadioGNews AI educationBlack Hat USADark ReadingBlack Hat AsiaAI BusinessCommunity Without Tokens: What AI Dev Tools Can Learn from Crypto's Community PlaybookDev.to AIThis Isn’t Another ‘AI Productivity Hack’ ArticleMedium AIMercor says it was hit by cyberattack tied to compromise of open-source LiteLLM projectTechCrunch AIHow AI has suddenly become much more useful to open-source developers - ZDNETGNews AI open sourceIn the Iran war, it looks like AI helped with operations, not strategyGary Marcus BlogGoogle adds AI charging guidance to Maps for EV drivers - mezha.netGoogle News - AI UkraineDespite its $350 billion investment promise in the U.S., the U.S. has unprecedentedly raised trade p.. - 매일경제GNews AI KoreaI Was Told to Write My Thesis in LaTeX. Here's How I Actually Got Started.DEV CommunityBuilding a Multi-Tenant SaaS with Stripe Connect in 2026DEV CommunityPart 3 of 3 — Engineering Intent Series -- Inside the Machine: The ISL Build PipelineDEV CommunityChoosing and Integrating Mobile Video SDKs: FFmpeg, ExoPlayer, and Commercial OptionsDEV CommunityStudent hui speaks on AI in education — and how to handle it - Hawaii Public RadioGNews AI education

Environment Maps: Structured Environmental Representations for Long-Horizon Agents

arXivMarch 30, 202610 min read0 views
Source Quiz

arXiv:2603.23610v3 Announce Type: replace Abstract: Although large language models (LLMs) have advanced rapidly, robust automation of complex software workflows remains an open problem. In long-horizon settings, agents frequently suffer from cascading errors and environmental stochasticity; a single misstep in a dynamic interface can lead to task failure, resulting in hallucinations or trial-and-error. This paper introduces $\textit{Environment Maps}$: a persistent, agent-agnostic representation that mitigates these failures by consolidating heterogeneous evidence, such as screen recordings an — Yenchia Feng, Chirag Sharma, Karime Maamari

View PDF HTML (experimental)

Abstract:Although large language models (LLMs) have advanced rapidly, robust automation of complex software workflows remains an open problem. In long-horizon settings, agents frequently suffer from cascading errors and environmental stochasticity; a single misstep in a dynamic interface can lead to task failure, resulting in hallucinations or trial-and-error. This paper introduces $\textit{Environment Maps}$: a persistent, agent-agnostic representation that mitigates these failures by consolidating heterogeneous evidence, such as screen recordings and execution traces, into a structured graph. The representation consists of four core components: (1) Contexts (abstracted locations), (2) Actions (parameterized affordances), (3) Workflows (observed trajectories), and (4) Tacit Knowledge (domain definitions and reusable procedures). We evaluate this framework on the WebArena benchmark across five domains. Agents equipped with environment maps achieve a 28.2% success rate, nearly doubling the performance of baselines limited to session-bound context (14.2%) and outperforming agents that have access to the raw trajectory data used to generate the environment maps (23.3%). By providing a structured interface between the model and the environment, Environment Maps establish a persistent foundation for long-horizon planning that is human-interpretable, editable, and incrementally refinable.

Comments: 9 pages, 5 figures, accepted to ICLR 2026 the 2nd Workshop on World Models; updated formatting issue

Subjects:

Artificial Intelligence (cs.AI)

Cite as: arXiv:2603.23610 [cs.AI]

(or arXiv:2603.23610v3 [cs.AI] for this version)

https://doi.org/10.48550/arXiv.2603.23610

arXiv-issued DOI via DataCite

Submission history

From: Chirag Sharma [view email] [v1] Tue, 24 Mar 2026 18:00:56 UTC (1,042 KB) [v2] Thu, 26 Mar 2026 01:28:12 UTC (1,042 KB) [v3] Fri, 27 Mar 2026 03:53:46 UTC (1,042 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Environment…researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 79 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers