Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessAssembly of 59 Best of Sensors 2026 finalists announcedFierce ElectronicsDorsey makes the AI case against managersThe Rundown AIKyndryl rolls out Agentic Service Management for AI-driven processesTech MonitorHow have you used tech to support your or your parents' aging and caregiving journeys? We want to hear from you.Business InsiderCan we block fresh accounts from posting?Reddit r/LocalLLaMAAlibaba Launches Qwen3.6-Plus For Enterprise AI Applications - DataconomyGNews AI AlibabaBuilding a Fully Local RAG System with Qdrant and OllamaDev.to AIIs PAIO bot the ultimate cheat code for OpenClaw? (We just launched) 🚀Dev.to AIGemini Is Sending More Traffic Than Perplexity. Here’s the Bigger Signal for MarketersMedium AISmall (0.1B params) Spam Detection model optimized for Italian textReddit r/LocalLLaMAClaude Code memory: how to survive a 200k context window filling upDev.to AIGood News for CarPlay Users: The ChatGPT Revolution Begins Now With an Official App - autoevolutionGoogle News: ChatGPTBlack Hat USADark ReadingBlack Hat AsiaAI BusinessAssembly of 59 Best of Sensors 2026 finalists announcedFierce ElectronicsDorsey makes the AI case against managersThe Rundown AIKyndryl rolls out Agentic Service Management for AI-driven processesTech MonitorHow have you used tech to support your or your parents' aging and caregiving journeys? We want to hear from you.Business InsiderCan we block fresh accounts from posting?Reddit r/LocalLLaMAAlibaba Launches Qwen3.6-Plus For Enterprise AI Applications - DataconomyGNews AI AlibabaBuilding a Fully Local RAG System with Qdrant and OllamaDev.to AIIs PAIO bot the ultimate cheat code for OpenClaw? (We just launched) 🚀Dev.to AIGemini Is Sending More Traffic Than Perplexity. Here’s the Bigger Signal for MarketersMedium AISmall (0.1B params) Spam Detection model optimized for Italian textReddit r/LocalLLaMAClaude Code memory: how to survive a 200k context window filling upDev.to AIGood News for CarPlay Users: The ChatGPT Revolution Begins Now With an Official App - autoevolutionGoogle News: ChatGPT
AI NEWS HUBbyEIGENVECTOREigenvector

UniSER: A Foundation Model for Unified Soft Effects Removal

arXivby [Submitted on 18 Nov 2025 (v1), last revised 27 Mar 2026 (this version, v2)]March 30, 20262 min read1 views
Source Quiz

arXiv:2511.14183v2 Announce Type: replace Abstract: Digital images are often degraded by soft effects such as lens flare, haze, shadows, and reflections, which reduce aesthetics even though the underlying pixels remain partially visible. The prevailing works address these degradations in isolation, developing highly specialized, specialist models that lack scalability and fail to exploit the shared underlying essences of these restoration problems. Meanwhile, although recent large-scale generalist models (e.g., GPT-4o, Flux Kontext, Nano Banana) offer powerful text-driven editing capabilities, — Jingdong Zhang, Lingzhi Zhang, Qing Liu, Mang Tik Chiu, Connelly Barnes, Yizhou Wang, Haoran You, Xiaoyang Liu, Yuqian Zhou, Zhe Lin, Eli Shechtman, Sohrab Amirghodsi, Xin Li, Wenping Wang, Xiaohang Zhan

Authors:Jingdong Zhang, Lingzhi Zhang, Qing Liu, Mang Tik Chiu, Connelly Barnes, Yizhou Wang, Haoran You, Xiaoyang Liu, Yuqian Zhou, Zhe Lin, Eli Shechtman, Sohrab Amirghodsi, Xin Li, Wenping Wang, Xiaohang Zhan

View PDF HTML (experimental)

Abstract:Digital images are often degraded by soft effects such as lens flare, haze, shadows, and reflections, which reduce aesthetics even though the underlying pixels remain partially visible. The prevailing works address these degradations in isolation, developing highly specialized, specialist models that lack scalability and fail to exploit the shared underlying essences of these restoration problems. Meanwhile, although recent large-scale generalist models (e.g., GPT-4o, Flux Kontext, Nano Banana) offer powerful text-driven editing capabilities, they heavily rely on detailed prompts and often fail to achieve robust removal on such fine-grained tasks while preserving the scene's identity. Leveraging the common essence of soft effects, i.e., semi-transparent occlusions, we introduce a foundational versatile model UniSER, capable of addressing diverse degradations caused by soft effects within a single framework. Our methodology centers on curating a massive 3.8M-pair dataset to ensure robustness and generalization, which includes novel, physically-plausible data to fill critical gaps in public benchmarks, and a tailored training pipeline that fine-tunes a Diffusion Transformer to learn robust restoration priors from this diverse data, integrating fine-grained mask and strength controls. This synergistic approach allows UniSER to significantly outperform both specialist and generalist models, achieving robust, high-fidelity restoration in the wild.

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2511.14183 [cs.CV]

(or arXiv:2511.14183v2 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2511.14183

arXiv-issued DOI via DataCite

Submission history

From: Jingdong Zhang [view email] [v1] Tue, 18 Nov 2025 06:39:39 UTC (43,492 KB) [v2] Fri, 27 Mar 2026 07:15:47 UTC (55,091 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
UniSER: A F…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 172 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers