Vidu 2.0 Models Live on fal

fal is partnering with Vidu, a cutting-edge video generation platform designed to help enterprises and developers seamlessly create and scale video production. With advanced multimodal capabilities – such as start-end frames and reference images – and robust performance, Vidu makes it easy to generate high-quality, visually stunning videos tailored to diverse use cases. To try out Vidu models, visit the model gallery on fal.
Introducing Vidu video and editing models
Vidu’s advanced capabilities streamline the creative process, enabling users to produce professional-grade videos effortlessly—making it an ideal solution for a wide range of applications, from marketing campaigns to app development.
- Model Types: Text-to-Video, Image-to-Video, Start-End-to-Video, Reference-to-Video
- Templates: Industry-optimized templates ensure stable and high-quality video generation.
Key Capabilities
- Advanced Semantic Understanding – Vidu accurately follows input prompts. The Start-End-to-Video feature simplifies creating visuals that align with your vision, while complex tasks like gacha animations become faster and more precise.
- Exceptional Anime-Style Performance – Vidu excels in generating anime-style videos, maintaining consistency in text-to-video and image-to-video transformations without unexpected style changes.
- Stable Subject Consistency – No more inconsistent characters or objects. Vidu ensures the main subject remains stable throughout the video, eliminating the need for manual keyframes.
- Custom Templates – Designed for industry-specific needs, Vidu’s templates are optimized for highly stable generation results. Developers can quickly integrate and customize for different business scenarios.
Example Outputs
Image-to-Video: Turn static images into dynamic videos with creative storytelling and animations
Uploaded image of the David statue in chrome. Prompt: David statue melting.
Start-End-to-Video: Convert static images into dynamic videos with creative storytelling and animations.


Uploaded image of a car chassis (start frame) and finished car body (end frame).
Video generated with above start and end images.
Reference-to-Video: Generate videos from a reference image and text description. Supports various subjects like characters and objects. Upload multiple perspectives of a subject to create videos that maintain consistency.



Uploaded reference images of two characters and a background scene.
Video generated with above reference images.
Head over to the fal model gallery to explore the Vidu integration. Keep an eye on our blog, Twitter, or Discord for more exciting updates, new model launches, and product improvements.
– The fal Team