Seedance 2.0
ByteDance's flagship AI video model — #1 on Artificial Analysis Video Arena across all categories. 2K resolution, stereo audio, quad-modal input.
What's New in 2.0
Every major dimension upgraded — from resolution and duration to a fundamentally new multimodal architecture.
| FEATURE | 1.5 PRO | 2.0 |
|---|---|---|
| Max Resolution | 1080p | 2K |
| Max Duration | 12s | 15s |
| Audio | Basic sync | Stereo spatial |
| Image Refs | Limited | Up to 9 |
| Video Input | None | Up to 3 clips |
| Audio Input | None | Up to 3 files |
| Motion Quality | Baseline | 2x better |
| Speed | Baseline | 30% faster |
Seedance 2.0 introduces a unified Dual-Branch Diffusion Transformer that generates video and audio in a single pass — unlike 1.5 Pro's separate processing pipelines. This architectural shift enables stereo spatial audio, physics-aware motion, and seamless multi-shot narratives.
Key Features
Quad-Modal Input
Combine text, images, video clips, and audio files in a single generation. The @ Reference System lets you tag up to 12 files for precise creative control.
Stereo Spatial Audio
Native dual-channel audio with spatial positioning. Material-specific sounds, environmental acoustics, and phoneme-level lip sync in 8+ languages.
Director-Level Camera
Control dolly zooms, rack focuses, tracking shots, POV switches, Hitchcock zooms, and orbit movements through natural language or reference video.
Physics-Aware Motion
Training penalizes impossible motion for realistic weight transfer, momentum conservation, and natural body dynamics — even in complex choreography.
Multi-Shot Narratives
Generate coherent multi-shot sequences with natural cuts and transitions. Use the "lens switch" keyword to signal cuts while preserving character continuity.
Video Editing & Extension
Make targeted modifications to existing clips without full regeneration. Extend videos forward or backward, replace characters, and add or remove elements.
#1 Across All Categories
Seedance 2.0 leads the Artificial Analysis Video Arena — the industry's most comprehensive human-preference leaderboard for AI video models.
Prompt adherence and visual fidelity from text descriptions
Video quality with native audio co-generation
Animation quality from reference image input
Image animation with synchronized audio
success rate
generation time
1.5 Pro
Technical Specs
| Developer | ByteDance (Seed Team) |
| Release Date | February 10, 2026 |
| Architecture | Dual-Branch Diffusion Transformer |
| Max Resolution | 2K |
| Max Duration | 4-15 seconds |
| Frame Rate | 24 fps |
| Aspect Ratios | 16:9, 9:16, 1:1, 4:3, 3:4, 21:9 |
| Audio | Dual-channel stereo, spatial positioning |
| Lip Sync | 8+ languages (EN, ZH, JA, KO, ES, FR, DE, PT) |
| Image Inputs | Up to 9 per generation |
| Video Inputs | Up to 3 clips (max 15s each) |
| Audio Inputs | Up to 3 files (max 15s, MP3/WAV) |
| Total References | Up to 12 files simultaneously |
| Generation Time | ~60s standard, ~10min for complex |
| Success Rate | 90%+ on first attempt |
Frequently Asked Questions
Start Creating with Seedance 1.5 Pro
While Seedance 2.0 integration is on its way, generate stunning videos today with our current models.
Try Seedance 1.5 Pro