ByteDance Flagship

Seedance 2.0

ByteDance's next-generation AI video model with the revolutionary @-reference system. Combine text, images, video clips, and audio in a single prompt. Native audio-video synchronization, V2V editing, and up to 2K resolution at 30fps — all in one unified generation.

About

About Seedance 2.0

Seedance 2.0 is ByteDance's most advanced AI video generation model, unveiled in February 2026. It adopts a unified multimodal audio-video joint generation architecture supporting 4 input modalities simultaneously — text, up to 9 images, up to 3 video clips, and up to 3 audio tracks. The ground-breaking @-reference system lets you tag specific elements in your prompt and bind them to uploaded references for granular control over camera movement, character appearance, audio rhythm, and visual style. Outputs reach up to 2K resolution with native synchronized audio including multilingual lip-sync, sound effects, and background music.

Key Features of Seedance 2.0

@-Reference System

Revolutionary reference tagging using @Image, @Video, and @Audio labels in your prompt. Bind specific elements to uploaded files for precise control over camera movement, character actions, audio rhythm, and visual style.

4-Modality Input

Combine text, up to 9 images, up to 3 video clips, and up to 3 audio tracks in a single generation request. Seedance 2.0 is the first model to process all four input types simultaneously.

Native Audio-Video Sync

Joint audio-video synthesis produces lip-sync dialogue, sound effects, and background music synchronized with the visual output. Supports multilingual lip-sync with phoneme-level precision.

V2V Video Editing

Edit existing videos through reference-to-video mode. Transfer motion patterns, camera paths, and pacing from uploaded clips. Change outfits, modify actions, or replace elements while preserving the original structure.

2K Resolution & 30fps

Native 2K (2048x1080) output at 30fps with multiple quality levels: 480p, 720p, and 1080p. Video duration ranges from 4 to 15 seconds per generation.

Multi-Shot Character Consistency

Upload multiple reference images of the same character from different angles. Seedance 2.0 maintains consistent faces, clothing, body proportions, and accessories across multiple generated clips.

Official Showcase

Explore Seedance 2.0's capabilities in multimodal reference control, native audio generation, and video editing

Multi-reference prompt combining all modalities

@-Reference System

“@Image1 walks through @Image2 with camera movement from @Video1 and background music from @Audio1”

Multi-reference prompt combining all modalities

Character motion guided by audio beat reference

@-Reference System

“@Image1 character dances with rhythm from @Audio1 in @Image3 environment”

Character motion guided by audio beat reference

Native Audio Generation

“A person giving a presentation with synchronized English speech and slide transitions”

Lip-sync dialogue with visual content

Narration synchronized with cooking actions

Native Audio Generation

“Cooking tutorial with step-by-step narration and ambient kitchen sounds”

Narration synchronized with cooking actions

FAQ

Seedance 2.0 FAQ

Testimonials

What Creators Say About Seedance 2.0

“The @-reference system is genuinely revolutionary. I can extract camera movements from a reference clip and apply them instantly — it's a completely new creative workflow.”

Explore More AI Video Models

Veo 3.1 Free AI Video Generator

New

Veo 3.1 is Google DeepMind's most advanced free AI video generator with native audio generation. It creates synchronized sound effects, dialogue, and environmental audio alongside 1080p video at 24 FPS — all available online with no watermark. Generate unlimited HD videos up to 8 seconds per clip, extendable to 60+ seconds.

Try now

Wan 2.6

New

Wan 2.6 is Alibaba's video generation model delivering high-quality videos with diverse style support, smooth motion, and cinematic output from text prompts and reference images.

Try now

Sora 2

Sora 2 is OpenAI's flagship video generation model capable of producing high-quality videos from both text descriptions and image inputs. It understands complex scene compositions, character interactions, camera movements, and real-world physics to deliver cinematic results. Sora 2 represents a major leap in AI video generation with improved temporal consistency, longer duration support, and more faithful prompt interpretation.

Try now

Kling 2.6

Kling 2.6 is Kuaishou's latest AI video generation model, recognized for its exceptional motion quality and cinematic output. Built on advanced spatiotemporal modeling, Kling 2.6 produces videos with fluid character movement, dynamic camera transitions, and rich visual detail. It supports both text-to-video and image-to-video generation, making it a versatile tool for creators seeking professional-quality AI video content.

Try now

Grok Video

New

Grok Video (powered by Grok Imagine Video) is xAI's video generation model built directly into the Grok ecosystem. Powered by the proprietary Aurora engine, it converts text prompts or static images into short video clips with synchronized audio. What sets Grok Video apart is its speed — clips generate in seconds, not minutes — combined with real-time web data access for current, relevant visual references. The model prioritizes prompt adherence and natural motion coherence, making it ideal for rapid social media content, quick prototyping, and iterative creative workflows.

Try now

Grok Imagine

Grok Imagine is xAI's image generation model, producing photorealistic imagery and creative compositions from natural language prompts with minimal restrictions on creative expression.

Try now

Start Creating with Seedance 2.0

Experience Seedance 2.0 — the most advanced video generator from ByteDance, free online

Try Seedance 2.0 Free

10,000+ users

Seedance 2.0

About Seedance 2.0