Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessWhich countries use ChatGPT the most? New study reveals top 5 - Deseret NewsGoogle News: ChatGPTOpenAI Is Letting Individuals Invest in Its $852 Billion Valuation—Here’s How - inc.comGoogle News: OpenAITransition From Data Scientist to Machine Learning Engineer 2026 Guide - Interview Kickstart Publishes New Career Guide - The Manila TimesGoogle News: Machine LearningValuations are 'Punchy': Salesforce's DrewsBloomberg TechnologyEarly AI Use Risks Children’s Development, Safety: UN - Mexico Business NewsGoogle News: AI SafetyAI blueprints can be stolen with a single small antennaTechXplore AIYou Have to Start Early in AI: Axiom Founder VenkatachalamBloomberg TechnologyAI and the Work-Product Doctrine: A New Frontier - callaborlaw.comGoogle News: AICompliance Policies: AI Policy & Upcoming Incident Response Plan Deadline - natlawreview.comGoogle News: AIIntegration in the Wealth Management Industry - wealthmanagement.comGoogle News: AI‘Boring’ Liberty Formula One Upgraded To Buy at Bank of AmericaBloomberg TechnologyCan You Run a Computer Without RAM? Surprisingly, Yes—But You’ll Be MiserableGizmodoBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessWhich countries use ChatGPT the most? New study reveals top 5 - Deseret NewsGoogle News: ChatGPTOpenAI Is Letting Individuals Invest in Its $852 Billion Valuation—Here’s How - inc.comGoogle News: OpenAITransition From Data Scientist to Machine Learning Engineer 2026 Guide - Interview Kickstart Publishes New Career Guide - The Manila TimesGoogle News: Machine LearningValuations are 'Punchy': Salesforce's DrewsBloomberg TechnologyEarly AI Use Risks Children’s Development, Safety: UN - Mexico Business NewsGoogle News: AI SafetyAI blueprints can be stolen with a single small antennaTechXplore AIYou Have to Start Early in AI: Axiom Founder VenkatachalamBloomberg TechnologyAI and the Work-Product Doctrine: A New Frontier - callaborlaw.comGoogle News: AICompliance Policies: AI Policy & Upcoming Incident Response Plan Deadline - natlawreview.comGoogle News: AIIntegration in the Wealth Management Industry - wealthmanagement.comGoogle News: AI‘Boring’ Liberty Formula One Upgraded To Buy at Bank of AmericaBloomberg TechnologyCan You Run a Computer Without RAM? Surprisingly, Yes—But You’ll Be MiserableGizmodo

InstaVSR: Taming Diffusion for Efficient and Temporally Consistent Video Super-Resolution

arXivMarch 30, 202610 min read0 views
Source Quiz

arXiv:2603.26134v1 Announce Type: new Abstract: Video super-resolution (VSR) seeks to reconstruct high-resolution frames from low-resolution inputs. While diffusion-based methods have substantially improved perceptual quality, extending them to video remains challenging for two reasons: strong generative priors can introduce temporal instability, and multi-frame diffusion pipelines are often too expensive for practical deployment. To address both challenges simultaneously, we propose InstaVSR, a lightweight diffusion framework for efficient video super-resolution. InstaVSR combines three ingre — Jintong Hu, Bin Chen, Zhenyu Hu, Jiayue Liu, Guo Wang, Lu Qi

View PDF HTML (experimental)

Abstract:Video super-resolution (VSR) seeks to reconstruct high-resolution frames from low-resolution inputs. While diffusion-based methods have substantially improved perceptual quality, extending them to video remains challenging for two reasons: strong generative priors can introduce temporal instability, and multi-frame diffusion pipelines are often too expensive for practical deployment. To address both challenges simultaneously, we propose InstaVSR, a lightweight diffusion framework for efficient video super-resolution. InstaVSR combines three ingredients: (1) a pruned one-step diffusion backbone that removes several costly components from conventional diffusion-based VSR pipelines, (2) recurrent training with flow-guided temporal regularization to improve frame-to-frame stability, and (3) dual-space adversarial learning in latent and pixel spaces to preserve perceptual quality after backbone simplification. On an NVIDIA RTX 4090, InstaVSR processes a 30-frame video at 2K$\times$2K resolution in under one minute with only 7 GB of memory usage, substantially reducing the computational cost compared to existing diffusion-based methods while maintaining favorable perceptual quality with significantly smoother temporal transitions.

Comments: 12 pages, 7 figures

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.26134 [cs.CV]

(or arXiv:2603.26134v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.26134

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Jintong Hu [view email] [v1] Fri, 27 Mar 2026 07:33:13 UTC (3,641 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
InstaVSR: T…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 135 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers