Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessGenesis Agent – A self-modifying AI agent that runs local (Electron, Ollama)Hacker News AI TopShow HN: Currant – Anonymus social media for NON-AI agentsHacker News AI TopTourism Tech Revolution in Japan is Changing Everything: Aurora Mobile Unleashes AI That Talks to Tourists Like a Local! - Travel And Tour WorldGNews AI JapanUniversity of Chicago's "self-driving" lab automates experiments in quantum computing research - CBS NewsGoogle News: AIMajority of college students use AI for their coursework, poll finds - upi.comGNews AI USAI Tried Building My Own AI… Here’s What Actually HappenedDEV CommunityShow HN: OpenVole – VoleNet Distributed AI Agent NetworkingHacker News AI TopFilesystem for AI Agents: What I Learned Building OneDEV CommunityGoogle debuts Gemma 4 open AI models for local use - TestingCatalogGNews AI multimodalAI’s Uncertain Cost Effects in Health Care - American Enterprise Institute - AEIGNews AI healthcareWhat happened when they installed ChatGPT on a nuclear supercomputer - OutriderGoogle News: ChatGPTSony's gaming division just bought an AI startup that turns photos into 3D volumesEngadgetBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessGenesis Agent – A self-modifying AI agent that runs local (Electron, Ollama)Hacker News AI TopShow HN: Currant – Anonymus social media for NON-AI agentsHacker News AI TopTourism Tech Revolution in Japan is Changing Everything: Aurora Mobile Unleashes AI That Talks to Tourists Like a Local! - Travel And Tour WorldGNews AI JapanUniversity of Chicago's "self-driving" lab automates experiments in quantum computing research - CBS NewsGoogle News: AIMajority of college students use AI for their coursework, poll finds - upi.comGNews AI USAI Tried Building My Own AI… Here’s What Actually HappenedDEV CommunityShow HN: OpenVole – VoleNet Distributed AI Agent NetworkingHacker News AI TopFilesystem for AI Agents: What I Learned Building OneDEV CommunityGoogle debuts Gemma 4 open AI models for local use - TestingCatalogGNews AI multimodalAI’s Uncertain Cost Effects in Health Care - American Enterprise Institute - AEIGNews AI healthcareWhat happened when they installed ChatGPT on a nuclear supercomputer - OutriderGoogle News: ChatGPTSony's gaming division just bought an AI startup that turns photos into 3D volumesEngadget
AI NEWS HUBbyEIGENVECTOREigenvector

ANVIL: Accelerator-Native Video Interpolation via Codec Motion Vector Priors

arXivMarch 31, 20262 min read0 views
Source Quiz

arXiv:2603.26835v1 Announce Type: cross Abstract: Mobile displays refresh at 90-120 Hz, yet most video is encoded at 24-30 frames per second; real-time frame-rate doubling requires each synthesized frame within 33.3 ms on mobile neural processing units. We show that mainstream flow-based video frame interpolation faces three structural deployment barriers on mobile accelerators: spatial sampling operators exceed the frame budget or lack hardware support, iterative flow refinement collapses under 8-bit post-training quantization, and memory-bound operators dominate the inference graph. ANVIL ad — Shibo Liu

View PDF HTML (experimental)

Abstract:Mobile displays refresh at 90-120 Hz, yet most video is encoded at 24-30 frames per second; real-time frame-rate doubling requires each synthesized frame within 33.3 ms on mobile neural processing units. We show that mainstream flow-based video frame interpolation faces three structural deployment barriers on mobile accelerators: spatial sampling operators exceed the frame budget or lack hardware support, iterative flow refinement collapses under 8-bit post-training quantization, and memory-bound operators dominate the inference graph. ANVIL addresses these barriers by reusing motion vectors already computed by the H.264 decoder to prealign input frames, removing learned optical flow, spatial sampling, and iterative accumulation from the accelerator graph. The remaining residual is refined by a convolution-dominated network whose inference graph is composed almost entirely of compute-bound operators. On a Snapdragon 8 Gen 3 device, ANVIL achieves 12.8 ms 1080p network inference in 8-bit integer precision; an open-source Android player sustains 28.4 ms median end-to-end latency per interpolated frame pair over 54,623 consecutively logged samples during 30-minute continuous playback. Per-operator causal analysis identifies quantized accumulation on recurrent flow states as a key mechanism behind integer quantization failure in iterative methods. The current design targets H.264 playback scenarios with decoder-exposed motion vectors.

Comments: 12 pages, 4 figures, 9 tables

Subjects:

Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV)

ACM classes: I.4.2; I.2.10

Cite as: arXiv:2603.26835 [eess.IV]

(or arXiv:2603.26835v1 [eess.IV] for this version)

https://doi.org/10.48550/arXiv.2603.26835

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Shibo Liu [view email] [v1] Fri, 27 Mar 2026 05:32:16 UTC (2,437 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
ANVIL: Acce…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 108 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!