AI video is having its Stable Diffusion moment

Replicate BlogDecember 16, 20241 min read0 views

Source Quiz

There are lots of models that are as good as OpenAI's Sora now.

Posted December 16, 2024 by

fofr

AI video used to not be very good:

Will Smith eating spaghetti, u/chaindrop, March 2023

Then, 10 months later, OpenAI announced Sora:

Creating video from text, OpenAI, February 2024

Sora reset expectations about what a video model could be. The output was high resolution, smooth, and coherent. The examples looked like real video. It felt like we’d jumped into the future.

The problem was, nobody could use it! It was just a preview.

This was like when OpenAI announced the DALL-E image generation model back in 2021. It was one of the most extraordinary pieces of software that had been seen for years, but nobody could use it.

This created all of this pent-up demand that led to Stable Diffusion, which we wrote about last year.

Now the same thing is happening with video. Sora made everyone realize what is possible.

There are lots of models that are as good as Sora now

Some are high-quality, some are fast, some focus on realism, and others focus on style and creativity.

Some are open source, and the community is modifying, optimizing, and building upon them. You can fine-tune them with new styles, objects and characters, and more.

ModelELO scoreSpeedDurationResolutionOpen SourceOpenAI Sora114740s5s720pNoMinimax Video-0111013min5s720pNoTencent Hunyuan Video10718min5s720pYesGenmo Mochi 110644min5s848 × 480YesRunway Gen3104820s5s720pNoHaiper 2.010375min4 or 6s720pNoLuma Ray102940s5s720pNoLightricks LTX-Video68010s3s864 × 480Yes

ELO ratings are from Artificial Analysis. Speed and duration are based on generation times for a five second 720p video, unless otherwise specified.

Most of these models are on Replicate. You can try them out in your browser and build with them using APIs. Here are the ones you should try:

Minimax Video-01

View Minimax Video-01 on Replicate

Video-01 (also known as Hailuo) is the best at realism and coherency. It is, in many ways, Sora quality. It’s just as smooth, the subjects are coherent, and it’s high resolution. It handles out-of-distribution subjects well. It doesn’t have all the features that Sora has, though.

You can generate five second 720p videos with it, using a text description or an image as the starting frame. It is closed-source, and takes about three minutes to generate.

Run it on Replicate: minimax/video-01

Tencent Hunyuan Video

View Tencent Hunyuan Video on Replicate

HunyuanVideo is up there with Sora and Minimax’s Video-01, and it’s open-source!

Because it’s open-source, you can do anything with it. You can fine-tune it, people have made video-to-video, it’s much more configurable (resolution, duration, steps, guidance scale, and lots more). It can make five second 720p videos, as well as smaller, faster 540p ones. You can reduce the steps and resolution to try different things quickly.

The downside is it’s slower than Video-01, but we’re working on making it faster. We’ll open source the optimizations, of course.

Run it on Replicate: tencent/hunyuan-video

Luma Ray

View Luma Ray on Replicate

Luma Ray (also known as Dream Machine) is not as realistic as Minimax Video-01 or Hunyuan Video, but it’s much faster and more creative. Released in June, it was one of the first of this new generation of capable video models.

It takes 40 seconds to generate a 5 second video at 720p resolution. It’s got more tools for controlling the output than some of the other models:

Start and end frames
Interpolation between start and end videos
Looped videos

Ray 2 is coming soon.

Run it on Replicate: luma/ray

Haiper 2.0

View Haiper 2.0 on Replicate

Haiper 2.0 was released in October. It can generate four and six second 720p videos. Six second videos take about five minutes to generate. You can use text or images to generate videos at a variety of aspect ratios.

A 4K version is coming soon.

Run it on Replicate: haiper-ai/haiper-video-2

Genmo Mochi 1

View Genmo Mochi 1 on Replicate

Mochi 1 was the first high-quality open-source video model to be released. To begin with you needed 4xH100s to run it, but the community swiftly optimized it for a single 4090.

Run it on Replicate: genmoai/mochi-1

You can also fine-tune Mochi 1 on Replicate. Use genmoai/mochi-1-lora-trainer to train it and genmoai/mochi-1-lora to run your trained models.

Lightricks LTX-Video

View Lightricks LTX-Video on Replicate

LTX-Video is a low-memory open-source video model. It’s so fast: it makes three second videos in just 10 seconds on an L40S GPU (compared with minutes on an H100 for other models).

While it’s super fast, you should expect the quality to be lower than other models.

Run it on Replicate: lightricks/ltx-video

There’s more

There are a few more excellent models that aren’t on Replicate yet:

Kling AI
OpenAI Sora
Pika 2.0 with powerful “scene ingredients” feature
Runway Gen3

And of course, we’re all still waiting for Black Forest Labs (creators of FLUX) to release their hotly anticipated video model.

Original source

Replicate Blog

https://replicate.com/blog/ai-video-is-having-its-stable-diffusion-moment

Was this article helpful?

Ask AI about this article

Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

More about

modelstable diffusion

ModelsLive

Liquid AI Released LFM2.5-350M: A Compact 350M Parameter Model Trained on 28T Tokens with Scaled Reinforcement Learning - MarkTechPost

<a href="https://news.google.com/rss/articles/CBMi8AFBVV95cUxPWVR3WDE5VFpuZ3JibVBzOWFtRkxFNm1EZndVZW84amZ3eV9tMXZaRWdrRlJiSTFlUE4xZW5OaDF4YzlYUDVvV3pJV1daUjNfWkR0NUg5SnlrSEdObWdYdHFmejRJOUM1UjFxUFc5TEppVXZyT09PUDhmdUx6eVF2QkNSbVk1NmRmTkV6cl9ESnAtMEtjdS1EaEtJU01iYnZ0MmFJTDdQbndwbGt6RmcyXzZ2SnJ2ejJ6NmxUUUg5RGhqR09KN0NmaGhwd2R2djlicHQ2X1pnZ3pKb0dSZmVhMnU5bU5WTzYwek5ldjZfaUHSAfYBQVVfeXFMTnFjYWt0TnhoaHhhalFsdVphSk5RV1MxRFY1UWJqWHJuU1FwVlB1TVJJQnpOVlh6MVRKazZObV9rM1Q1eExzSEExd2hGcFc2OXpKLWpKT3dTOFV2c201RFdLOTV3a3J6RjV4Yzd0UXNRRFlmbUZMVy00OWkxSzlSaUU0VmlIWjgxNWNxTVhLZEMyQ2NOOU1rbUhEb2FHX1hPNVpRZzQ2ZEJpUkRfczcwQjU4anA4YVJXbFNpUVZHWnNkblhOV1NOemFzRmhwZFI1elg5SGREMVprcW90TmFZRG5aSGJLamYzc19FVGM3Q2J5NGpXbGpB?oc=5" target="_blank">Liquid AI Released LFM2.5-350M: A Compact 350M Parameter Model Trained

Google News: Machine Learning

1mabout 1 hour ago

Models

Large language models show Dunning-Kruger-like effects in multilingual fact-checking | Scientific Reports - Nature

<a href="https://news.google.com/rss/articles/CBMiX0FVX3lxTE5WaGlPLU1EY21Qak9TVWxxaFRadUotYlZUNHU1LWJZQ3R1SnBVb1RiVVNocTFrT2l5Y1FDMFFDZ0tvZko1ZUhKN1p6V1Q1WGxYMHFyai05dC1abTV3YzZr?oc=5" target="_blank">Large language models show Dunning-Kruger-like effects in multilingual fact-checking | Scientific Reports</a> Nature

Google News: LLM

1mabout 1 month ago

ModelsLive

China's AI large models frequently used by overseas users - People's Daily Online

<a href="https://news.google.com/rss/articles/CBMiY0FVX3lxTFBidHQ1Vkh0NXpRWVFQRmVwN1dZVi1JQ3NRZWptTXF2d19YT1FzdEZwSEtvOXB5clN5VDBQeTF2R1pXYlZ4ZW1BR0pjZnBiZUR5c2JqNUxZVnRscm5xUGRudFJjRQ?oc=5" target="_blank">China's AI large models frequently used by overseas users</a> People's Daily Online

Google News: LLM

1mabout 1 hour ago

Knowledge Map

TopicsEntitiesSource

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 202 connections

Scroll to zoom · drag to pan · click to open

Discussion

No comments yet — be the first to share your thoughts!

More in AI Tools

AI ToolsLive

Claude Code Unpacked : A visual guide

Comments

Hacker News

1mabout 1 hour ago

AI ToolsLive

Anthropic’s Claude Code Leak Sparks Panic: AI Tool’s Source Code Reportedly Exposed Online Again - Republic World

<a href="https://news.google.com/rss/articles/CBMixwFBVV95cUxNNmdSMTBaeng5eUNaV080Z3F1VW1yaHU3dmJiMWc5a21yRTFYbmVNaGl0QTB4WnpndHpNUUN5TDc5VWM1WVhTSmJiMGpJdzZoSXlpS3hxSW42N25PSnhlQWhBZ1V5WjNISUN0Mzk1QnF2RGJ4VE9oVThDRUVqZjNqSlNNM3lkQnhPR0NXTml4aDMxYzlObFdod3JGVnV6N3pUSFF0WVk0LU00SkRBalFzYVg5YUdmZmRkZ2x6MnBiM0dVRnNVWFJN0gHMAUFVX3lxTE81ME9DZmRVeERjdjFRQUNmMFVzMzBCb25ZVE5QZDU1MThjeXpQdHFXaEVIZDl1VFM5QTFMRURMOXFxb3pLMnlhbTM4LTQxbnJ5MVVwdmgxeTBESjl3QkZSNl9lN2JENFRnWW9wYXFUeUZOVm9wdTJwRktyUTJPSDVCU0tLYUJWQVRkY1BTd0txaXUzc3NxVGpCckJlRHVNSDRHRVhlRUtMY0VpME5lanVQNUhaTXROY2luSU9pVzlIa3pLOXc2cGJNM09qVA?oc=5" target="_blank">Anthropic’s Claude Code Leak Sparks Panic: AI Tool’s Source Code Reportedly Exposed Online Again</a> Republic World

Google News: Claude

1mabout 1 hour ago

AI ToolsLive

Anthropic releases part of AI tool source code in 'error' - RFI

<a href="https://news.google.com/rss/articles/CBMiqAFBVV95cUxObjFYNzJkbEZienBVbjBUd01rZ1I0TzRPUHRsTDc1WXU5UVVUa0VmWjJEOF81N214N0FpVXFmTlg0X29GUzA1cUtpSHh2TEw4RnFod2pVYW5PX1BDVy1YUE94QkVYbG9aY2JkSi13amtQM09NOUpJNV9ESGxRQXB0WWQ5d1pQMkl3UHRidEN2UWZ5Y210anlOYm1iTG4wQVlNaG5ycDl6cDc?oc=5" target="_blank">Anthropic releases part of AI tool source code in 'error'</a> RFI

Google News: Claude

1m41 minutes ago

AI Tools

Exclusive | The Sudden Fall of OpenAI’s Most Hyped Product Since ChatGPT - WSJ

<a href="https://news.google.com/rss/articles/CBMiogNBVV95cUxOSHFuYnVOdjkxMnBLZEZVUTVNWDJGUkFXUXF0SE1yN1NnWlVFdUt4Q1VyZm1palo0YnRVaWpYR09mcm5XUENMZnlKVXNMWW9RUHhET2M4a25IdE5TanJ6bUlwdmJtUjgtNDBUWllzOFByaTFYSGNyUjQ1aVgyeXBjcUFDeVdLVFA1cF9ISTA1RU9WbWJ3NW9wdE14VkxkVkRBV2lQWllaWTNDMzlVNVpVU0Y2VHo0a2tkWDE2dnNDQm85TFFaU242akF2aW1LblRYRkUtX1d1czFXZGpiWm1hMElrSHh4Z2FqOHpKMWhILThpVGdlVng1WkRYd0JqODBPUDNfQ3hDMnZVOS0zTV8yYlgwSTR2Y1QtTnpRd1UxLUd3R1hGejVaSGlJdkFDWmxoem40Zy15MVdsQy00SjFrRnVjUEd2djRUenFXOHZQcGszb1ZfWEdGOWYzN2NGdzRjX19LTWpERk9BY01Za0pXNDNhd3liWlctM2RnOVFIeWpsWEtnV2xhV0xOYWx2WGprTW15VkNVV0liYTNFVnNwMW9R?oc=5" target="_blank">Exclusive | The Sudden Fall of OpenAI’s Most Hyped Product Since ChatGPT</a> WSJ

Google News: ChatGPT

1m2 days ago