The Future of Video Creation is Here: Multi-Modal Video Generation
Generate videos from text, images, and more, all within a unified multi-modal video engine. Experience unmatched creative flexibility with Kling O1.

Unleash Your Creativity with Our Multi-Modal Video Generator
Kling O1's unified video foundation model allows you to create videos in ways you never thought possible.

Unified Multi-Modal Video Engine
Generate videos from text, images, subjects, and more, all within one unified semantic space. No more switching between separate tools and plugins. Powered by Kuaishou's innovative Kling O1.

Conversational Video Editing
Edit your videos using natural language commands. Remove unwanted objects, change the time of day, or apply different styles without manual masking or keyframing. Effortless video post-production with Kling O1.

Consistent Characters Across Shots
Maintain consistent character faces, clothing, and props across multiple shots, even with changing camera angles. Use up to 5 reference images to build your subject. Perfect for creating engaging storylines using Kling O1.
Create Stunning Videos in Three Easy Steps
Leverage the power of Kling O1 to bring your visions to life.
Input Your Media
Upload text, images, or video clips. Use up to 5 reference photos to ensure consistent character design using the subject based reference system.
Customize Your Video
Use natural language commands to edit, style, and extend your video. Control duration up to 10 seconds per clip and easily create transitions.
Generate and Share
Generate your final video with synchronized audio. Share your creations with the world.
Frequently Asked Questions
Learn more about creating multi-modal videos with Kling O1.
Related Tools
Explore more AI tools in Core Product and beyond
More in Core Product
You May Also Like
Ready to Create Stunning Videos with Kling O1?
Experience the power of unified multi-modal video generation.