Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessAnthropic is learning that there are no take-backs on the internetBusiness InsiderOpenClaw launches an official China mirror, with ByteDance providing the servers to host the Chinese-language service, as OpenClaw explodes in the country (Juro Osawa/The Information)TechmemeArtificial Intelligence in Process Control - The Chemical EngineerGoogle News: AIOpenAI doesn’t just want to answer your questions — it wants to run your digital life - TechRadarGoogle News: OpenAIWhy Nvidia just poured $2 billion into AI ASIC competitor Marvell — NVLink Fusion turns into soft ecosystem lock-intomshardware.comIs AI the new “Manhattan Project”? Vox went to Los Alamos to find out. - VoxGoogle News: ChatGPT'Users Should Own Their AI Agents, Not Rent Them' — Valory CEO David Minarsch Explains the Future of AI Control - CCN.comGoogle News: Generative AIBest Video Conferencing Solution for Enterprises in 2026Dev.to AIFunctional Testing vs Reality: What Actually Breaks in ProductionDev.to AIGenerative AI In Manufacturing Market to hit USD 10,540.1 Million by 2033 - vocal.mediaGoogle News: Generative AISources: Chinese optics company and Nvidia supplier Innolight confidentially filed for a Hong Kong IPO that could raise $3B+; Innolight is listed in Shenzhen (Bloomberg)TechmemeData Observability 2.0: The Backbone of Trusted Enterprise AnalyticsDev.to AIBlack Hat USADark ReadingBlack Hat AsiaAI BusinessAnthropic is learning that there are no take-backs on the internetBusiness InsiderOpenClaw launches an official China mirror, with ByteDance providing the servers to host the Chinese-language service, as OpenClaw explodes in the country (Juro Osawa/The Information)TechmemeArtificial Intelligence in Process Control - The Chemical EngineerGoogle News: AIOpenAI doesn’t just want to answer your questions — it wants to run your digital life - TechRadarGoogle News: OpenAIWhy Nvidia just poured $2 billion into AI ASIC competitor Marvell — NVLink Fusion turns into soft ecosystem lock-intomshardware.comIs AI the new “Manhattan Project”? Vox went to Los Alamos to find out. - VoxGoogle News: ChatGPT'Users Should Own Their AI Agents, Not Rent Them' — Valory CEO David Minarsch Explains the Future of AI Control - CCN.comGoogle News: Generative AIBest Video Conferencing Solution for Enterprises in 2026Dev.to AIFunctional Testing vs Reality: What Actually Breaks in ProductionDev.to AIGenerative AI In Manufacturing Market to hit USD 10,540.1 Million by 2033 - vocal.mediaGoogle News: Generative AISources: Chinese optics company and Nvidia supplier Innolight confidentially filed for a Hong Kong IPO that could raise $3B+; Innolight is listed in Shenzhen (Bloomberg)TechmemeData Observability 2.0: The Backbone of Trusted Enterprise AnalyticsDev.to AI
AI NEWS HUBbyEIGENVECTOREigenvector

MegaFlow: Zero-Shot Large Displacement Optical Flow

arXivMarch 26, 202610 min read0 views
Source Quiz

Accurate estimation of large displacement optical flow remains a critical challenge. Existing methods typically rely on iterative local search or/and domain-specific fine-tuning, which severely limits their performance in large displacement and zero-shot generalization scenarios. To overcome this, we introduce MegaFlow, a simple yet powerful model for zero-shot large displacement optical flow. Rather than relying on highly complex, task-specific architectural designs, MegaFlow adapts powerful pre-trained vision priors to produce temporally consistent motion fields. In particular, we formulate — Dingxi Zhang, Fangjinhua Wang, Marc Pollefeys

View PDF HTML (experimental)

Abstract:Accurate estimation of large displacement optical flow remains a critical challenge. Existing methods typically rely on iterative local search or/and domain-specific fine-tuning, which severely limits their performance in large displacement and zero-shot generalization scenarios. To overcome this, we introduce MegaFlow, a simple yet powerful model for zero-shot large displacement optical flow. Rather than relying on highly complex, task-specific architectural designs, MegaFlow adapts powerful pre-trained vision priors to produce temporally consistent motion fields. In particular, we formulate flow estimation as a global matching problem by leveraging pre-trained global Vision Transformer features, which naturally capture large displacements. This is followed by a few lightweight iterative refinements to further improve the sub-pixel accuracy. Extensive experiments demonstrate that MegaFlow achieves state-of-the-art zero-shot performance across multiple optical flow benchmarks. Moreover, our model also delivers highly competitive zero-shot performance on long-range point tracking benchmarks, demonstrating its robust transferability and suggesting a unified paradigm for generalizable motion estimation. Our project page is at: this https URL.

Comments: Project Page: this https URL Code: this https URL

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.25739 [cs.CV]

(or arXiv:2603.25739v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.25739

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Dingxi Zhang [view email] [v1] Thu, 26 Mar 2026 17:59:51 UTC (8,972 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
MegaFlow: Z…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 169 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!