New

Generate Stunning Video Scenes with AI

Kling O1 lets you create, edit and extend video scenes with a conversational workflow, bringing your imagination to life.

Unified Multi-Modal Model
Consistent Characters
Conversational Editing
Kling O1 Video Scene Generation Interface

Unleash Your Creative Vision

Kling O1’s unified video engine empowers you with unprecedented control and consistency in video scene generation.

Consistent Characters

Consistent Characters

Maintain character consistency across multiple shots, even with changing camera angles. Use up to 5 reference images to ensure consistent faces, clothing and props.

Conversational Scene Editing

Conversational Scene Editing

Edit scenes with natural language. Remove unwanted elements, change the time of day, or apply stylistic changes without manual masking or keyframing.

Advanced Shot Extension

Advanced Shot Extension

Seamlessly generate previous or next shots based on existing clips. Effortlessly transfer camera motion for dynamic and engaging scenes.

How It Works

Generate incredible video scenes in three simple steps.

1

Provide a Reference

Upload reference images, videos, or text prompts to define the scene's content and style.

2

Describe the Action

Use natural language to describe the desired action, camera movement, and any scene-specific details.

3

Generate and Refine

Kling O1 automatically generates a video scene based on your input. Refine using conversational commands.

Frequently Asked Questions

Everything you need to know about video scene generation with Kling O1.

Kling O1 supports text prompts, image references, video references, and subject-based references.
Kling O1 uses a universal reference system with up to 5 reference images to build 'subjects' and maintain consistent faces, clothing, and props.
You can perform pixel-level semantic reconstruction using natural language commands like 'remove the passerby', 'change daytime to dusk', or 'change it to pixel art style'.
Kling O1 supports 3-10 seconds of video generation per clip, enabling fine-grained control over scene rhythm and narrative pacing. It can generate up to 2 minutes of continuous video with synchronized audio.

Ready to Reimagine Video Scene Generation?

Sign up and start creating stunning video scenes today!