MiniMax Hailuo 2.3 is now available on fal
We’re thrilled to announce that MiniMax Hailuo 2.3 is now available on fal on day 0. This release delivers a major leap in video realism, camera control, and motion physics, making it MinMax's most capable generative video model to date. The model is available through Pro, Standard, and Fast endpoints for both text-to-video and image-to-video generation.
Understanding Hailuo's Unique Strengths
Cinematic Realism
Hailuo 2.3 stands out for its ability to reproduce cinematic realism across a wide range of lighting and compositional conditions.
The masked man approaches the camera
In the masked man and night highway videos, you can see the model’s mastery of exposure balance and light behavior, headlights, reflections, and motion blur all integrate naturally without overexposure or color banding. The geometry of the scene remains stable even under complex lighting, a clear improvement over earlier versions that struggled with temporal consistency.
A cinematic top-down aerial view of a wide modern highway at night, illuminated by glowing streetlights and the streaks of headlights and taillights from fast-moving cars. The surrounding landscape fades into darkness, with subtle reflections of light on the asphalt. The atmosphere feels calm yet dynamic, with a high-contrast composition emphasizing the geometry of the road lines and traffic patterns. Capture a realistic, high-resolution night-time scene with moody blue and amber tones.
For creators, that realism translates directly to creative freedom. Filmmakers and storytellers can now stage complex lighting setups without fighting model instability. Advertisers can generate luxury interiors and product scenes with photometric precision.
A semi-lit, upscale whiskey bar with a warm amber glow emanating from vintage pendant lights. The polished wooden bar counter reflects the light from bottles of premium whiskey neatly arranged on glass shelves behind the bartender. Several well-dressed men in tailored suits sit and stand around the bar, engaged in quiet conversation, their expressions relaxed and confident. The atmosphere exudes sophistication and exclusivity, with hints of cigar smoke and jazz music in the air. The lighting is moody and cinematic, emphasizing shadows and golden highlights for a rich, intimate ambiance.
Advanced Camera Control
One of Hailuo 2.3’s biggest upgrades is camera control. It maintains spatial coherence and motion stability even in high-speed, continuous shots.
A dramatic drone tracking shot capturing a ski jumper mid-flight during a ski jumping run. The camera follows closely from behind and above as the athlete launches off the inrun ramp, soaring through the crisp mountain air with perfect aerodynamic form. The skier’s body leans forward, skis held parallel and slightly apart, slicing through the cold air above a pristine snow-covered landscape. The drone maintains a smooth, cinematic motion, revealing the vast landing hill (outrun) below and the dramatic alpine backdrop of pine forests and snow-capped peaks.
In the ski jumper sequence, the drone-like tracking remains locked and smooth from takeoff to landing, with no jitter or frame warping — a challenge for most text-to-video systems. Similarly, in the snowboarder and mountain biker clips, the model handles fast motion and changing perspectives without breaking continuity. Background parallax and object motion are consistent, giving the feel of a real tracking camera rather than a stitched sequence.
The camera follows the snowboarder as they carve down the mountain through deep powder, each turn sending up huge rooster tails of snow. They navigate between trees, floating through the powder with smooth, flowing movements. The rider launches off a natural jump, grabbing the board mid-air before landing softly in deep snow and continuing down. Powder sprays continuously as they link turns together.
That control is crucial for production workflows. It enables realistic action sequences for narrative work, drone-style tracking shots for sports and outdoor advertising, and rhythmically synced motion for music videos. For anyone building b-roll libraries, Hailuo 2.3’s camera coherence means every clip looks like part of the same visual language.
The camera follows the mountain biker as they navigate a technical forest trail at high speed, wheels bouncing over roots and rocks. The rider approaches a jump, launching into the air with the bike, both rider and machine perfectly synchronized. They land smoothly and continue through tight turns, splashing through a stream crossing. Mud and water spray as the bike powers through challenging terrain. The atmosphere is wild and adventurous.
Improved Physics
Hailuo 2.3 demonstrates a major improvements in physical simulation and temporal coherence. In the gymnast backflip video, body movement and momentum are preserved naturally, with proper follow-through on limbs and landing. Prior models often produced artifacts in rotational motion, this one eliminates nearly all of them.
a gymnast doing a backflip in the park
The sailboat example shows improved water physics and reflection modeling. Surface distortion and wave response to the boat’s movement are consistent frame-to-frame, and light reflections remain physically plausible. This is a clear improvement in how the model handles complex inter-object dynamics, something crucial for realism.
A cinematic shot of a sailboat gliding smoothly across a sunlit ocean. The scene is captured from a low angle near the waterline, emphasizing the sleek white hull cutting through gentle waves. The bright sails are fully open, filled with wind, and glowing softly under the warm afternoon sunlight. A man wearing a captain’s hat and light casual attire stands confidently at the helm, steering the vessel.
Expressive Performances
Hailuo 2.3 also elevates human performance and emotion. In the kitchen argument scene, body language reads as intentional — gestures, micro-movements, and facial tension all align with the emotional tone. The model’s ability to maintain expressive accuracy across multiple actors shows significant progress in behavioral realism and emotion control.
For directors and visual storytellers, this makes a huge difference. Dialogue-driven pieces can now carry genuine emotion. Advertising scenes with on-camera talent can deliver nuanced reactions instead of static faces. Even short-form music or social content benefits from characters who move and feel human.
A tense cinematic shot inside a modern kitchen, bathed in soft evening light. A couple in their 30s stands across from each other: the woman near the counter, the man by the table. Their body language is sharp and expressive—raised hands, furrowed brows, and tense postures—conveying the intensity of their argument. The polished surfaces of the kitchen—stainless steel appliances, glossy cabinets, and scattered dishes—reflect the dramatic atmosphere. The camera slowly pushes in from a medium-wide shot to a closer view, capturing their faces as emotions flare.
the men perforrm a hiphop dance
Getting Started with Minimax Hailuo 2.3
The easiest way to explore Hailuo's capabilities is through Fal's Playground, where you can experiment with prompts and see immediate results. A detailed guide on how to integrate Hailuo 2.3 into your platform is available in our API documentation.
Endpoints:
Stay tuned to our Reddit, blog, Twitter, or Discord for the latest updates on generative media and the new model releases!





