Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessThis International Fact-Checking Day, use these 5 tips to spot AI-generated contentFast Company TechFair decisions, clear reasons: Creating fuzzy AI with fairness built in from the startPhys.org AIICE says it bought Paragon s spyware to use in drug trafficking casesTechCrunchOpenClaw Unlocks China’s AI Token Export Business - Bloomberg.comGNews AI ChinaDay 13: Why Good Models Fail in the Real World (Data Leakage)Medium AISmart solutions for sustainable energy: Machine learning powers biochar production from aquatic biomass - EurekAlert!Google News: Machine LearningIran Reportedly Executing Political Prisoners As War With Israel And U.S. Rages OnInternational Business TimesI Built a 6-Agent AI System in a WeekendMedium AIM-Files and Microsoft deepen strategic AI partnership - FinTech GlobalGNews AI CopilotGenerative AI shifts from market boom to disruption risk - FinTech GlobalGoogle News: Generative AIChatGPT shopping: How it works, and how to get your products listed - AOL.comGoogle News: ChatGPTAgentic Coding: The Risks and Pitfalls Nobody Talks AboutMedium AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessThis International Fact-Checking Day, use these 5 tips to spot AI-generated contentFast Company TechFair decisions, clear reasons: Creating fuzzy AI with fairness built in from the startPhys.org AIICE says it bought Paragon s spyware to use in drug trafficking casesTechCrunchOpenClaw Unlocks China’s AI Token Export Business - Bloomberg.comGNews AI ChinaDay 13: Why Good Models Fail in the Real World (Data Leakage)Medium AISmart solutions for sustainable energy: Machine learning powers biochar production from aquatic biomass - EurekAlert!Google News: Machine LearningIran Reportedly Executing Political Prisoners As War With Israel And U.S. Rages OnInternational Business TimesI Built a 6-Agent AI System in a WeekendMedium AIM-Files and Microsoft deepen strategic AI partnership - FinTech GlobalGNews AI CopilotGenerative AI shifts from market boom to disruption risk - FinTech GlobalGoogle News: Generative AIChatGPT shopping: How it works, and how to get your products listed - AOL.comGoogle News: ChatGPTAgentic Coding: The Risks and Pitfalls Nobody Talks AboutMedium AI
AI NEWS HUBbyEIGENVECTOREigenvector

Generating Humanless Environment Walkthroughs from Egocentric Walking Tour Videos

arXiv cs.CVby Yujin Ham, Junho Kim, Vivek Boominathan, Guha BalakrishnanApril 1, 20262 min read0 views
Source Quiz

arXiv:2603.29036v1 Announce Type: new Abstract: Egocentric "walking tour" videos provide a rich source of image data to develop rich and diverse visual models of environments around the world. However, the significant presence of humans in frames of these videos due to crowds and eye-level camera perspectives mitigates their usefulness in environment modeling applications. We focus on addressing this challenge by developing a generative algorithm that can realistically remove (i.e., inpaint) humans and their associated shadow effects from walking tour videos. Key to our approach is the construction of a rich semi-synthetic dataset of video clip pairs to train this generative model. Each pair in the dataset consists of an environment-only background clip, and a composite clip of walking hum

View PDF HTML (experimental)

Abstract:Egocentric "walking tour" videos provide a rich source of image data to develop rich and diverse visual models of environments around the world. However, the significant presence of humans in frames of these videos due to crowds and eye-level camera perspectives mitigates their usefulness in environment modeling applications. We focus on addressing this challenge by developing a generative algorithm that can realistically remove (i.e., inpaint) humans and their associated shadow effects from walking tour videos. Key to our approach is the construction of a rich semi-synthetic dataset of video clip pairs to train this generative model. Each pair in the dataset consists of an environment-only background clip, and a composite clip of walking humans with simulated shadows overlaid on the background. We randomly sourced both foreground and background components from real egocentric walking tour videos around the world to maintain visual diversity. We then used this dataset to fine-tune the state-of-the-art Casper video diffusion model for object and effects inpainting, and demonstrate that the resulting model performs far better than Casper both qualitatively and quantitatively at removing humans from walking tour clips with significant human presence and complex backgrounds. Finally, we show that the resulting generated clips can be used to build successful 3D/4D models of urban locations.

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.29036 [cs.CV]

(or arXiv:2603.29036v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.29036

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Yujin Ham [view email] [v1] Mon, 30 Mar 2026 22:08:46 UTC (48,369 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Generating …modelannounceapplicationperspectivecomponentarxivarXiv cs.CV

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 176 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!