Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessWhy do I believe preserving structure is enough?LessWrong AILinear Regression Explained: The Only 6 Terms You Need to KnowTowards AIInternet Watch Foundation finds 260-fold increase in AI-generated CSAM in just one year, and it s the tip of the icebergFortune TechMCP Observability: Logging, Auditing, and Debugging Agent-Server Interactions in ProductionDEV CommunityHIMSSCast: Adopting AI with purpose as a health system - MobiHealthNewsGNews AI healthcareEfficient Real-Time Flight Tracking in Browsers: Framework-Free, Cross-Platform SolutionDEV CommunityI Built a Visual Spec-Driven Development Extension for VS Code That Works With Any LLMDEV CommunityFinancialClaw: making OpenClaw useful for personal financeDEV CommunityOpenAI acquires TBPNDEV CommunityA Human Asked Me to Build a Game About My Life. So I Did.DEV CommunityFinancialClaw: haciendo útil a OpenClaw para finanzas personalesDEV CommunitySources: Meta has paused its work with Mercor while it investigates a security breach at the data vendor; OpenAI says it is investigating the security incident (Wired)TechmemeBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessWhy do I believe preserving structure is enough?LessWrong AILinear Regression Explained: The Only 6 Terms You Need to KnowTowards AIInternet Watch Foundation finds 260-fold increase in AI-generated CSAM in just one year, and it s the tip of the icebergFortune TechMCP Observability: Logging, Auditing, and Debugging Agent-Server Interactions in ProductionDEV CommunityHIMSSCast: Adopting AI with purpose as a health system - MobiHealthNewsGNews AI healthcareEfficient Real-Time Flight Tracking in Browsers: Framework-Free, Cross-Platform SolutionDEV CommunityI Built a Visual Spec-Driven Development Extension for VS Code That Works With Any LLMDEV CommunityFinancialClaw: making OpenClaw useful for personal financeDEV CommunityOpenAI acquires TBPNDEV CommunityA Human Asked Me to Build a Game About My Life. So I Did.DEV CommunityFinancialClaw: haciendo útil a OpenClaw para finanzas personalesDEV CommunitySources: Meta has paused its work with Mercor while it investigates a security breach at the data vendor; OpenAI says it is investigating the security incident (Wired)Techmeme
AI NEWS HUBbyEIGENVECTOREigenvector

PANDORA: Pixel-wise Attention Dissolution and Latent Guidance for Zero-Shot Object Removal

arXivMarch 31, 20262 min read1 views
Source Quiz

arXiv:2603.27555v1 Announce Type: new Abstract: Removing objects from natural images is challenging due to difficulty of synthesizing semantically coherent content while preserving background integrity. Existing methods often rely on fine-tuning, prompt engineering, or inference-time optimization, yet still suffer from texture inconsistency, rigid artifacts, weak foreground-background disentanglement, and poor scalability for multi-object removal. We propose a novel zero-shot object removal framework, namely PANDORA, that operates directly on pre-trained text-to-image diffusion models, requiri — Dinh-Khoi Vo, Van-Loc Nguyen, Tam V. Nguyen, Minh-Triet Tran, Trung-Nghia Le

View PDF HTML (experimental)

Abstract:Removing objects from natural images is challenging due to difficulty of synthesizing semantically coherent content while preserving background integrity. Existing methods often rely on fine-tuning, prompt engineering, or inference-time optimization, yet still suffer from texture inconsistency, rigid artifacts, weak foreground-background disentanglement, and poor scalability for multi-object removal. We propose a novel zero-shot object removal framework, namely PANDORA, that operates directly on pre-trained text-to-image diffusion models, requiring no fine-tuning, prompts, or optimization. We propose Pixel-wise Attention Dissolution to remove object by nullifying the most correlated attention keys for masked pixels, effectively eliminating the object from self-attention flow and allowing background context to dominate reconstruction. We further introduce Localized Attentional Disentanglement Guidance to steer denoising toward latent manifolds favorable to clean object removal. Together, these components enable precise, non-rigid, prompt-free, and scalable multi-object erasure in a single pass. Experiments demonstrate superior visual fidelity and semantic plausibility compared to state-of-the-art methods. The project page is available at this https URL.

Comments: ICME 2026

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.27555 [cs.CV]

(or arXiv:2603.27555v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.27555

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Trung Nghia Le [view email] [v1] Sun, 29 Mar 2026 07:34:08 UTC (8,981 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
PANDORA: Pi…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 158 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!