Research Papers research paper arxiv computer-vision image-recognition

MegaFlow: Zero-Shot Large Displacement Optical Flow

arXivMarch 26, 202610 min read0 views

Accurate estimation of large displacement optical flow remains a critical challenge. Existing methods typically rely on iterative local search or/and domain-specific fine-tuning, which severely limits their performance in large displacement and zero-shot generalization scenarios. To overcome this, we introduce MegaFlow, a simple yet powerful model for zero-shot large displacement optical flow. Rather than relying on highly complex, task-specific architectural designs, MegaFlow adapts powerful pre-trained vision priors to produce temporally consistent motion fields. In particular, we formulate — Dingxi Zhang, Fangjinhua Wang, Marc Pollefeys

View PDF HTML (experimental)

Abstract:Accurate estimation of large displacement optical flow remains a critical challenge. Existing methods typically rely on iterative local search or/and domain-specific fine-tuning, which severely limits their performance in large displacement and zero-shot generalization scenarios. To overcome this, we introduce MegaFlow, a simple yet powerful model for zero-shot large displacement optical flow. Rather than relying on highly complex, task-specific architectural designs, MegaFlow adapts powerful pre-trained vision priors to produce temporally consistent motion fields. In particular, we formulate flow estimation as a global matching problem by leveraging pre-trained global Vision Transformer features, which naturally capture large displacements. This is followed by a few lightweight iterative refinements to further improve the sub-pixel accuracy. Extensive experiments demonstrate that MegaFlow achieves state-of-the-art zero-shot performance across multiple optical flow benchmarks. Moreover, our model also delivers highly competitive zero-shot performance on long-range point tracking benchmarks, demonstrating its robust transferability and suggesting a unified paradigm for generalizable motion estimation. Our project page is at: this https URL.

Comments: Project Page: this https URL Code: this https URL

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.25739 [cs.CV]

(or arXiv:2603.25739v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.25739

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Dingxi Zhang [view email] [v1] Thu, 26 Mar 2026 17:59:51 UTC (8,972 KB)

Original source

arXiv

https://arxiv.org/abs/2603.25739v1

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Research PapersLive

I ran by instinct for years. Then I built an AI running coach.

How a 50 km trail race, a broken ChatGPT workflow, and 60+ research papers led me to create Coach Leo. Continue reading on Medium »

Medium AI

1mabout 1 hour ago

ModelsRecent

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models WSJ

Google News: LLM

1m1 day ago

ModelsFresh

How AI Fails: An Interactive Pedagogical Tool for Demonstrating Dialectal Bias in Automated Toxicity Models

arXiv:2511.06676v2 Announce Type: replace-cross Abstract: Now that AI-driven moderation has become pervasive in everyday life, we often hear claims that "the AI is biased". While this is often said jokingly, the light-hearted remark reflects a deeper concern. How can we be certain that an online post flagged as "inappropriate" was not simply the victim of a biased algorithm? This paper investigates this problem using a dual approach. First, I conduct a quantitative benchmark of a widely used toxicity model (unitary/toxic-bert) to measure performance disparity between text in African-American English (AAE) and Standard American English (SAE). The benchmark reveals a clear, systematic bias: on average, the model scores AAE text as 1.8 times more toxic and 8.8 times higher for "identity hate"

arXiv cs.HC

1mabout 7 hours ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 169 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

MegaFlow: Zero-Shot Large Displacement Optical Flow

Submission history

Daily AI Digest

More about

I ran by instinct for years. Then I built an AI running coach.

Exclusive | Caltech Researchers Claim Radical Compression of High-Fidelity AI Models - WSJ

How AI Fails: An Interactive Pedagogical Tool for Demonstrating Dialectal Bias in Automated Toxicity Models

Knowledge Map

Connected Articles — Knowledge Graph

Discussion

More in Research Papers

I ran by instinct for years. Then I built an AI running coach.

“It's not about gatekeeping."

Adversaries have under-protected APIs in their sights

Exclusive | OpenAI’s Former Research Chief Aims to Automate Manufacturing With AI - WSJ