MiniMax Hailuo 02 — MiniMax's text-to-video and image-to-video at 768p — strong physics and motion.

MiniMax · From 43 credits / 6s on Dahab Studio.

MiniMax Hailuo 02 is a video model from the team behind ABAB and the Talkie chatbot. It excels at natural motion physics and atmospheric scenes, generating either 6s clips at 1080p or 10s clips at 768p (the 10s mode requires the lower resolution by Replicate's schema). Available on Dahab Studio at curated 6s and 10s durations with 16:9 or 9:16 aspect ratios.

Specs

  • Max duration: 10s
  • Resolution: 1080p (6s) or 768p (10s)
  • Aspect ratios: 16:9, 9:16
  • Native audio: No
  • Multi-reference: No
  • Pricing: 43 cr / 6s, 71 cr / 10s

Use cases

  • Atmospheric mood reels: Hailuo 02 nails environmental motion — water, weather, light shifts — making it ideal for landing-page hero videos.
  • Realistic human motion: When the prompt names specific actions (walking, gesturing, turning), Hailuo handles physics more naturally than most peers.
  • Image-to-video with controlled animation: Upload a still and prompt the camera or subject motion. Hailuo respects the source frame strictly.
  • Cost-conscious 10-second videos: At 71 credits per 10s clip, Hailuo is one of the cheapest ways to get a full 10-second 768p video on Dahab Studio.

Hailuo 02 vs alternatives

  • vs Grok Imagine Video: Hailuo wins on motion physics realism. Grok wins on native audio (Hailuo is silent — needs separate SFX/TTS pass).
  • vs Kling V3 Omni: Lower per-second cost. Kling Omni wins on multi-reference workflows.
  • vs Veo 3.1 Fast: Cheaper for 10-second clips and slightly better at unconventional camera moves. Veo wins on photorealism and native audio.

Frequently asked questions

Why is the 10-second mode 768p instead of 1080p?
Replicate's Hailuo 02 schema requires 768p resolution when duration is 10s — a model-side limit, not a Dahab cap. 6-second clips run at 1080p.
Does Hailuo 02 produce audio?
No. The model output is silent. To add audio, use Dahab's SFX pipeline (mmaudio) or layer in a voiceover via Talking Head if dialogue is needed.
Can I use image-to-video?
Yes. Hailuo accepts a starting image and animates motion based on your prompt. Upload via the Generate or Studio page.
What aspect ratios are supported?
16:9 and 9:16 only on Hailuo 02. 1:1 and 4:3 are not in the schema.
Is Hailuo 02 good for Arabic prompts?
Visual prompts in Arabic work fine. The output is silent so dialogue isn't a concern. For Arabic spoken content, route through Talking Head which mux Hailuo video with ElevenLabs Egyptian TTS.

Related models

  • Grok Imagine Video — xAI's native-audio video model — fast, cheap, and 1080p out of the box.
  • Google Veo 3.1 Fast — Google's fast-tier Veo 3.1 — 1080p with native synchronised audio.
  • Kling V2.6 — Kling's latest text-to-video and image-to-video at 1080p with audio.

Generate with Hailuo 02 →

← All AI video models on Dahab Studio