Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessCollege students say they are changing their majors because of AIBusiness InsiderInside KPMG's push to turn tax experts into hands-on software buildersBusiness Insider'You’re not trying to be perfect': I tested ChatGPT’s advice for burnout — and it helped - TechRadarGoogle News: ChatGPTU.S. Postal Inspection Service warns public about new wave of scams powered by artificial intelligence - WMAR 2 News BaltimoreGNews AI USAGame to Lose Online Mode After Its Server Partner Pivots to You’ll Never Guess WhatGizmodoThis $400 (Not) AI Keychain Is Pointless, Extravagant, and Weirdly LovableGizmodoI used ChatGPT to transform my iPhone home screen — and now it feels like a brand-new device - Tom's GuideGoogle News: ChatGPTUganda to Unveil Comprehensive AI and Emerging Tech Roadmap by June - PC Tech MagazineGoogle News - AI UgandaHow Israel is expanding its use of AI warfare in Iran and Lebanon - Al JazeeraGNews AI IsraelPresentation: Directing a Swarm of Agents for Fun and ProfitInfoQ AI/MLForthcoming machine learning and AI seminars: April 2026 editionAIhubAssembly of 59 Best of Sensors 2026 finalists announcedFierce ElectronicsBlack Hat USADark ReadingBlack Hat AsiaAI BusinessCollege students say they are changing their majors because of AIBusiness InsiderInside KPMG's push to turn tax experts into hands-on software buildersBusiness Insider'You’re not trying to be perfect': I tested ChatGPT’s advice for burnout — and it helped - TechRadarGoogle News: ChatGPTU.S. Postal Inspection Service warns public about new wave of scams powered by artificial intelligence - WMAR 2 News BaltimoreGNews AI USAGame to Lose Online Mode After Its Server Partner Pivots to You’ll Never Guess WhatGizmodoThis $400 (Not) AI Keychain Is Pointless, Extravagant, and Weirdly LovableGizmodoI used ChatGPT to transform my iPhone home screen — and now it feels like a brand-new device - Tom's GuideGoogle News: ChatGPTUganda to Unveil Comprehensive AI and Emerging Tech Roadmap by June - PC Tech MagazineGoogle News - AI UgandaHow Israel is expanding its use of AI warfare in Iran and Lebanon - Al JazeeraGNews AI IsraelPresentation: Directing a Swarm of Agents for Fun and ProfitInfoQ AI/MLForthcoming machine learning and AI seminars: April 2026 editionAIhubAssembly of 59 Best of Sensors 2026 finalists announcedFierce Electronics
AI NEWS HUBbyEIGENVECTOREigenvector

Beyond the Golden Data: Resolving the Motion-Vision Quality Dilemma via Timestep Selective Training

arXivMarch 26, 202610 min read0 views
Source Quiz

Recent advances in video generation models have achieved impressive results. However, these models heavily rely on the use of high-quality data that combines both high visual quality and high motion quality. In this paper, we identify a key challenge in video data curation: the Motion-Vision Quality Dilemma. We discovered that visual quality and motion intensity inherently exhibit a negative correlation, making it hard to obtain golden data that excels in both aspects. To address this challenge, we first examine the hierarchical learning dynamics of video diffusion models and conduct gradient- — Xiangyang Luo, Qingyu Li, Yuming Li

View PDF HTML (experimental)

Abstract:Recent advances in video generation models have achieved impressive results. However, these models heavily rely on the use of high-quality data that combines both high visual quality and high motion quality. In this paper, we identify a key challenge in video data curation: the Motion-Vision Quality Dilemma. We discovered that visual quality and motion intensity inherently exhibit a negative correlation, making it hard to obtain golden data that excels in both aspects. To address this challenge, we first examine the hierarchical learning dynamics of video diffusion models and conduct gradient-based analysis on quality-degraded samples. We discover that quality-imbalanced data can produce gradients similar to golden data at appropriate timesteps. Based on this, we introduce the novel concept of Timestep selection in Training Process. We propose Timestep-aware Quality Decoupling (TQD), which modifies the data sampling distribution to better match the model's learning process. For certain types of data, the sampling distribution is skewed toward higher timesteps for motion-rich data, while high visual quality data is more likely to be sampled during lower timesteps. Through extensive experiments, we demonstrate that TQD enables training exclusively on separated imbalanced data to achieve performance surpassing conventional training with better data, challenging the necessity of perfect data in video generation. Moreover, our method also boosts model performance when trained on high-quality data, showcasing its effectiveness across different data scenarios.

Comments: Accepted to CVPR 2026

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2603.25527 [cs.CV]

(or arXiv:2603.25527v1 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2603.25527

arXiv-issued DOI via DataCite (pending registration)

Submission history

From: Xiangyang Luo [view email] [v1] Thu, 26 Mar 2026 14:59:57 UTC (7,381 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Beyond the …researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 125 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers