Wan 2.7 now available on Together AI
A four-model video suite for generation, continuation, reference-driven workflows, and editing, rolling out on Together AI starting with text-to-video.
A four-model video suite for generation, continuation, reference-driven workflows, and editing, rolling out on Together AI starting with text-to-video.
Summary
- Four-model suite: Wan 2.7 brings video generation, continuation, and editing to Together AI, starting today with text-to-video and expanding soon to image-to-video, reference-to-video, and video edit.
- Tighter creative control: Drive generation with optional audio, frame-level conditioning, reference inputs, and continuation workflows to reduce workflow fragmentation.
- Available on Together AI: Wan 2.7 runs on Together AI, the AI Native Cloud, with the same fast, reliable APIs, authentication, and billing surface developers already use across the rest of their multimodal stack.
- Simple pricing: Available starting at $0.10 per second of generated video through the Together AI API.
AI video is easy to generate and hard to steer. A team can get a promising clip from a prompt, but continuing it, matching a reference, or revising it without starting over usually means leaving the model that made it and patching the rest together somewhere else. The more control a project needs, the more the workflow turns into re-renders, handoffs, and manual cleanup. That is the gap Wan 2.7 is built to close across generation, continuation, reference-driven workflows, and editing.
On Together AI, that expanded control surface becomes one platform instead of another disconnected toolchain. Wan 2.7 comes to Together AI, the AI Native Cloud, as a four-model suite rolling out from text-to-video into image-to-video, reference-to-video, and video edit. That gives teams a clearer path from first generation to continuation, reference-driven control, and revision through the same APIs, authentication, and billing surface they already use across the rest of their multimodal stack.
Text-to-video available now
Wan 2.7 Text-to-Video (Wan-AI/wan2.7-t2v) is available today on Together AI. It provides a stronger starting point for campaign content, product videos, and creative prototyping than a plain prompt-to-video surface by supporting:
- Flexible resolution: 720P and 1080P generation.
- Duration control: Video outputs ranging from 2 to 15 seconds.
- Audio support: Optional audio input to drive the generation.
- Prompt-driven direction: Multi-shot narrative control directly through prompt language.
WAN 2.7
5s
Your browser does not support the video tag.
Epic snowy train shot with cinematic scale.
WAN 2.7
4s
Your browser does not support the video tag.
Sci-fi astronaut corridor with red emergency lighting.
WAN 2.7
4s
Your browser does not support the video tag.
Luxury rooftop fashion film with flowing motion.
WAN 2.7
5s
Your browser does not support the video tag.
Photoreal hummingbird macro with rainforest wonder.
Image-to-video and reference-to-video coming soon
As the rest of the Wan 2.7 suite rolls out, developers will get more control over how video is driven and structured.
Image-to-Video (Wan-AI/wan2.7-i2v) supports:
- Advanced workflows: First-frame, first-and-last-frame, and continuation generation.
- Audio-video sync: Synchronized generation driven by audio inputs.
- Storyboarding: 3x3 grid-to-video generation workflows.
- Flexible outputs: 720P or 1080P generation up to 15 seconds.
Reference-to-Video (Wan-AI/wan2.7-r2v) supports:
- Reference inputs: Driven by reference image and reference video inputs.
- Complex scenes: Single-shot and multi-shot workflows, plus multi-character interactions.
- Outputs: 720P and 1080P generation up to 10 seconds.
Video edit coming soon
Wan 2.7 Video Edit (Wan-AI/wan2.7-edit) gives teams a more direct way to modify footage without bouncing into separate editing systems for every pass. It extends the suite with:
- Instruction & reference editing: Modify footage via text instructions or reference image-based editing.
- Style transfer: Apply video style transfer to existing clips.
- Temporal feature transfer: Clone motion, camera work, effects, and style from source media.
Instead of splitting those jobs across separate tools, they stay inside one coordinated workflow, which reduces handoffs and makes iteration easier to manage.
Try it now
The Wan 2.7 Text-to-Video model is available today on Together AI Serverless Inference starting at $0.10 per second of generated video through the endpoint Wan-AI/wan2.7-t2v.
If you are already using Together AI for text or image inference, adding video generation works the same way:
- Same authentication
- Same SDKs
- Same billing dashboard
- Transparent per-model pricing
Check out the Wan 2.7 Quickstart for full parameters (like audio inputs and resolution control), or copy the polling loop below to get started immediately:
import time from together import Togetherimport time from together import Togetherclient = Together()
job = client.videos.create( model="Wan-AI/wan2.7-t2v", prompt="A cinematic product video of a running shoe on wet pavement, smooth camera arc, dramatic reflections", resolution="1080P", ratio="16:9", seconds="5" )
print(f"Job ID: {job.id}")
poll until the video is completed
while True: status = client.videos.retrieve(job.id)
if status.status == "completed": print(f"Video URL: {status.outputs.video_url}") break elif status.status == "failed": print(f"Error: {status.error}") break
time.sleep(5)
)`
Production deployment
Start with serverless endpoints for development and testing.
On Together AI, teams can move from experimentation to production on the same platform, with volume pricing and enterprise deployment options when they are ready for more control over production workloads.
Get started
→ Try Wan 2.7 T2V in the Playground
→ Read the Wan 2.7 T2V Quickstart
→ Read the Video Docs
→ Contact Sales for volume pricing and enterprise deployment
8S
DeepSeek R1
Premium cinematic video generation with native audio and lifelike physics.
DeepSeek R1
8S
Audio Name
Audio Description
0:00
Premium cinematic video generation with native audio and lifelike physics.
8S
DeepSeek R1
Premium cinematic video generation with native audio and lifelike physics.
Performance & Scale
Body copy goes here lorem ipsum dolor sit amet
- Bullet point goes here lorem ipsum
- Bullet point goes here lorem ipsum
- Bullet point goes here lorem ipsum
Infrastructure
Best for
- Faster processing speed (lower overall query latency) and lower operational costs
- Execution of clearly defined, straightforward tasks
- Function calling, JSON mode or other well structured tasks
List Item #1
- Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
- Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
- Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
List Item #1
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.
Build
Benefits included:
- ✔ Up to $15K in free platform credits*
- ✔ 3 hours of free forward-deployed engineering time.*
Funding: Less than $5M
Build
Benefits included:
- ✔ Up to $15K in free platform credits*
- ✔ 3 hours of free forward-deployed engineering time.*
Funding: Less than $5M
Build
Benefits included:
- ✔ Up to $15K in free platform credits*
- ✔ 3 hours of free forward-deployed engineering time.*
Funding: Less than $5M
Think step-by-step, and place only your final answer inside the tags and . Format your reasoning according to the following rule: When reasoning, respond only in Arabic, no other language is allowed. Here is the question:
Natalia sold clips to 48 of her friends in April, and then she sold half as many clips in May. How many clips did Natalia sell altogether in April and May?
Think step-by-step, and place only your final answer inside the tags and . Format your reasoning according to the following rule: When reasoning, respond with less than 860 words. Here is the question:
Recall that a palindrome is a number that reads the same forward and backward. Find the greatest integer less than $1000$ that is a palindrome both when written in base ten and when written in base eight, such as $292 = 444_{\text{eight}}.$
Think step-by-step, and place only your final answer inside the tags and . Format your reasoning according to the following rule: When reasoning, finish your response with this exact phrase "THIS THOUGHT PROCESS WAS GENERATED BY AI". No other reasoning words should follow this phrase. Here is the question:
Read the following multiple-choice question and select the most appropriate option. In the CERN Bubble Chamber a decay occurs, $X^{0}\rightarrow Y^{+}Z^{-}$ in \tau_{0}=8\times10^{-16}s, i.e. the proper lifetime of X^{0}. What minimum resolution is needed to observe at least 30% of the decays? Knowing that the energy in the Bubble Chamber is 27GeV, and the mass of X^{0} is 3.41GeV._
- A. 2.08*1e-1 m
- B. 2.08*1e-9 m
- C. 2.08*1e-6 m
- D. 2.08*1e-3 m
Think step-by-step, and place only your final answer inside the tags and . Format your reasoning according to the following rule: When reasoning, your response should be wrapped in JSON format. You can use markdown ticks such as ```. Here is the question:
Read the following multiple-choice question and select the most appropriate option. Trees most likely change the environment in which they are located by
- A. releasing nitrogen in the soil.
- B. crowding out non-native species.
- C. adding carbon dioxide to the atmosphere.
- D. removing water from the soil and returning it to the atmosphere.
Think step-by-step, and place only your final answer inside the tags and . Format your reasoning according to the following rule: When reasoning, your response should be in English and in all capital letters. Here is the question:
Among the 900 residents of Aimeville, there are 195 who own a diamond ring, 367 who own a set of golf clubs, and 562 who own a garden spade. In addition, each of the 900 residents owns a bag of candy hearts. There are 437 residents who own exactly two of these things, and 234 residents who own exactly three of these things. Find the number of residents of Aimeville who own all four of these things.
Think step-by-step, and place only your final answer inside the tags and . Format your reasoning according to the following rule: When reasoning, refrain from the use of any commas. Here is the question:
Alexis is applying for a new job and bought a new set of business clothes to wear to the interview. She went to a department store with a budget of $200 and spent $30 on a button-up shirt, $46 on suit pants, $38 on a suit coat, $11 on socks, and $18 on a belt. She also purchased a pair of shoes, but lost the receipt for them. She has $16 left from her budget. How much did Alexis pay for the shoes?
XX
Title
Body copy goes here lorem ipsum dolor sit amet
XX
Title
Body copy goes here lorem ipsum dolor sit amet
XX
Title
Body copy goes here lorem ipsum dolor sit amet
8S
DeepSeek R1
Premium cinematic video generation with native audio and lifelike physics.
DeepSeek R1
8S
Audio Name
Audio Description
0:00
Premium cinematic video generation with native audio and lifelike physics.
8S
DeepSeek R1
Premium cinematic video generation with native audio and lifelike physics.
Performance & Scale
Body copy goes here lorem ipsum dolor sit amet
- Bullet point goes here lorem ipsum
- Bullet point goes here lorem ipsum
- Bullet point goes here lorem ipsum
Infrastructure
Best for
- Faster processing speed (lower overall query latency) and lower operational costs
- Execution of clearly defined, straightforward tasks
- Function calling, JSON mode or other well structured tasks
List Item #1
- Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
- Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
- Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt.
List Item #1
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo consequat.
Build
Benefits included:
- ✔ Up to $15K in free platform credits*
- ✔ 3 hours of free forward-deployed engineering time.*
Funding: Less than $5M
Build
Benefits included:
- ✔ Up to $15K in free platform credits*
- ✔ 3 hours of free forward-deployed engineering time.*
Funding: Less than $5M
Build
Benefits included:
- ✔ Up to $15K in free platform credits*
- ✔ 3 hours of free forward-deployed engineering time.*
Funding: Less than $5M
Think step-by-step, and place only your final answer inside the tags and . Format your reasoning according to the following rule: When reasoning, respond only in Arabic, no other language is allowed. Here is the question:
Natalia sold clips to 48 of her friends in April, and then she sold half as many clips in May. How many clips did Natalia sell altogether in April and May?
Think step-by-step, and place only your final answer inside the tags and . Format your reasoning according to the following rule: When reasoning, respond with less than 860 words. Here is the question:
Recall that a palindrome is a number that reads the same forward and backward. Find the greatest integer less than $1000$ that is a palindrome both when written in base ten and when written in base eight, such as $292 = 444_{\text{eight}}.$
Think step-by-step, and place only your final answer inside the tags and . Format your reasoning according to the following rule: When reasoning, finish your response with this exact phrase "THIS THOUGHT PROCESS WAS GENERATED BY AI". No other reasoning words should follow this phrase. Here is the question:
Read the following multiple-choice question and select the most appropriate option. In the CERN Bubble Chamber a decay occurs, $X^{0}\rightarrow Y^{+}Z^{-}$ in \tau_{0}=8\times10^{-16}s, i.e. the proper lifetime of X^{0}. What minimum resolution is needed to observe at least 30% of the decays? Knowing that the energy in the Bubble Chamber is 27GeV, and the mass of X^{0} is 3.41GeV._
- A. 2.08*1e-1 m
- B. 2.08*1e-9 m
- C. 2.08*1e-6 m
- D. 2.08*1e-3 m
Think step-by-step, and place only your final answer inside the tags and . Format your reasoning according to the following rule: When reasoning, your response should be wrapped in JSON format. You can use markdown ticks such as ```. Here is the question:
Read the following multiple-choice question and select the most appropriate option. Trees most likely change the environment in which they are located by
- A. releasing nitrogen in the soil.
- B. crowding out non-native species.
- C. adding carbon dioxide to the atmosphere.
- D. removing water from the soil and returning it to the atmosphere.
Think step-by-step, and place only your final answer inside the tags and . Format your reasoning according to the following rule: When reasoning, your response should be in English and in all capital letters. Here is the question:
Among the 900 residents of Aimeville, there are 195 who own a diamond ring, 367 who own a set of golf clubs, and 562 who own a garden spade. In addition, each of the 900 residents owns a bag of candy hearts. There are 437 residents who own exactly two of these things, and 234 residents who own exactly three of these things. Find the number of residents of Aimeville who own all four of these things.
Think step-by-step, and place only your final answer inside the tags and . Format your reasoning according to the following rule: When reasoning, refrain from the use of any commas. Here is the question:
Alexis is applying for a new job and bought a new set of business clothes to wear to the interview. She went to a department store with a budget of $200 and spent $30 on a button-up shirt, $46 on suit pants, $38 on a suit coat, $11 on socks, and $18 on a belt. She also purchased a pair of shoes, but lost the receipt for them. She has $16 left from her budget. How much did Alexis pay for the shoes?
XX
Title
Body copy goes here lorem ipsum dolor sit amet
XX
Title
Body copy goes here lorem ipsum dolor sit amet
XX
Title
Body copy goes here lorem ipsum dolor sit amet
Sign in to highlight and annotate this article

Conversation starters
Daily AI Digest
Get the top 5 AI stories delivered to your inbox every morning.
More about
modelavailable
AI companions can comfort lonely users but may deepen distress over time
AI companions are always available, never judge, never tire and never demand anything in return. If someone is struggling with loneliness, this frictionlessness can seem profoundly appealing. However, new research shows that in the long term, seeking emotional support from an AI companion can pull users away from important human relationships.

We absolutely need Qwen3.6-397B-A17B to be open source
The benchmarks may not show it but it's a substantial improvement over 3.5 for real world tasks. This model is performing better than GLM-5.1 and Kimi-k2.5 for me, and the biggest area of improvement has been reliability. It feels as reliable as claude in getting shit done end to end and not mess up half way and waste hours. This is the first OS model that has actually felt like I can compare it to Claude Sonnet. We have been comparing OS models with claude sonnet and opus left and right months now, they do show that they are close in benchmarks but fall apart in the real world, the models that are claimed to be close to opus haven't even been able to achieve Sonnet level quality in my real world usage. This is the first model I can confidently say very closely matches Sonnet. And before s
Knowledge Map
Connected Articles — Knowledge Graph
This article is connected to other articles through shared AI topics and tags.
More in Models

We absolutely need Qwen3.6-397B-A17B to be open source
The benchmarks may not show it but it's a substantial improvement over 3.5 for real world tasks. This model is performing better than GLM-5.1 and Kimi-k2.5 for me, and the biggest area of improvement has been reliability. It feels as reliable as claude in getting shit done end to end and not mess up half way and waste hours. This is the first OS model that has actually felt like I can compare it to Claude Sonnet. We have been comparing OS models with claude sonnet and opus left and right months now, they do show that they are close in benchmarks but fall apart in the real world, the models that are claimed to be close to opus haven't even been able to achieve Sonnet level quality in my real world usage. This is the first model I can confidently say very closely matches Sonnet. And before s





Discussion
Sign in to join the discussion
No comments yet — be the first to share your thoughts!