Live
Black Hat USAAI BusinessBlack Hat AsiaAI BusinessHow Disney Imagineers are using AI and robotics to reshape the company’s theme parksFast Company TechArtemis II: Why our return to the moon took so longFast Company TechPost Quantum Cryptography - ComputerphileComputerphile YTPickNik Robotics gives MoveIt Pro 9.0 enhanced perception-to-motion, teleop capabilitiesRobotics Business ReviewScientists Build Living Robots With Nervous SystemsIEEE RoboticsHow AI agents are changing journalismFast Company TechQ&A: Killian Brackey on Building AI That Holds UpInternational Business TimesWhy AI health chatbots won’t make you better at diagnosing yourself – new research - Gavi, the Vaccine AllianceGoogle News: AIRun OpenCode in Docker - Clean machine, same convenienceDEV CommunityMCP Dev Summit [Day 1] ft. Anthropic, Hugging Face, Open AI & MicrosoftAI YouTube Channel 24Good UI Is Just Invisible EngineeringDEV CommunityFace Tracking for Vertical Video: Why It's Harder Than It Looks (And How It Works)DEV CommunityBlack Hat USAAI BusinessBlack Hat AsiaAI BusinessHow Disney Imagineers are using AI and robotics to reshape the company’s theme parksFast Company TechArtemis II: Why our return to the moon took so longFast Company TechPost Quantum Cryptography - ComputerphileComputerphile YTPickNik Robotics gives MoveIt Pro 9.0 enhanced perception-to-motion, teleop capabilitiesRobotics Business ReviewScientists Build Living Robots With Nervous SystemsIEEE RoboticsHow AI agents are changing journalismFast Company TechQ&A: Killian Brackey on Building AI That Holds UpInternational Business TimesWhy AI health chatbots won’t make you better at diagnosing yourself – new research - Gavi, the Vaccine AllianceGoogle News: AIRun OpenCode in Docker - Clean machine, same convenienceDEV CommunityMCP Dev Summit [Day 1] ft. Anthropic, Hugging Face, Open AI & MicrosoftAI YouTube Channel 24Good UI Is Just Invisible EngineeringDEV CommunityFace Tracking for Vertical Video: Why It's Harder Than It Looks (And How It Works)DEV Community
AI NEWS HUBbyEIGENVECTOREigenvector

LAMP: Language-Assisted Motion Planning for Controllable Video Generation

arXivMarch 31, 20262 min read0 views
Source Quiz

arXiv:2512.03619v3 Announce Type: replace Abstract: Video generation has achieved remarkable progress in visual fidelity and controllability, enabling conditioning on text, layout, or motion. Among these, motion control - specifying object dynamics and camera trajectories - is essential for composing complex, cinematic scenes, yet existing interfaces remain limited. We introduce LAMP that leverages large language models (LLMs) as motion planners to translate natural language descriptions into explicit 3D trajectories for dynamic objects and (relatively defined) cameras. LAMP defines a motion d — Muhammed Burak Kizil, Enes Sanli, Niloy J. Mitra, Erkut Erdem, Aykut Erdem, Duygu Ceylan

View PDF HTML (experimental)

Abstract:Video generation has achieved remarkable progress in visual fidelity and controllability, enabling conditioning on text, layout, or motion. Among these, motion control - specifying object dynamics and camera trajectories - is essential for composing complex, cinematic scenes, yet existing interfaces remain limited. We introduce LAMP that leverages large language models (LLMs) as motion planners to translate natural language descriptions into explicit 3D trajectories for dynamic objects and (relatively defined) cameras. LAMP defines a motion domain-specific language (DSL), inspired by cinematography conventions. By harnessing program synthesis capabilities of LLMs, LAMP generates structured motion programs from natural language, which are deterministically mapped to 3D trajectories. We construct a large-scale procedural dataset pairing natural text descriptions with corresponding motion programs and 3D trajectories. Experiments demonstrate LAMP's improved performance in motion controllability and alignment with user intent compared to state-of-the-art alternatives establishing the first framework for generating both object and camera motions directly from natural language specifications. Code, models and data are available on our project page.

Comments: CVPR 2026. Project Page: this https URL

Subjects:

Computer Vision and Pattern Recognition (cs.CV)

Cite as: arXiv:2512.03619 [cs.CV]

(or arXiv:2512.03619v3 [cs.CV] for this version)

https://doi.org/10.48550/arXiv.2512.03619

arXiv-issued DOI via DataCite

Submission history

From: Muhammed Burak Kızıl [view email] [v1] Wed, 3 Dec 2025 09:51:13 UTC (39,114 KB) [v2] Mon, 8 Dec 2025 11:47:40 UTC (39,114 KB) [v3] Sun, 29 Mar 2026 17:38:57 UTC (39,583 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by Eigenvector · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

researchpaperarxiv

Knowledge Map

Knowledge Map
TopicsEntitiesSource
LAMP: Langu…researchpaperarxivcomputer-vi…image-recog…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 191 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers