| Video Agent | Direct Video | Cinematic Avatar | |
|---|---|---|---|
| Endpoint | POST /v3/video-agents | POST /v3/videos | POST /v3/videos (type: "cinematic_avatar") |
| Input | Natural language prompt | Structured JSON | Prompt + 1–3 avatar looks |
| Script writing | Agent writes it | You write it | None — motion driven by the prompt |
| Avatar selection | Agent picks (or you override) | You specify | You specify 1–3 looks |
| Voice selection | Agent picks (or you override) | You specify | None — no spoken voice |
| Interactive iteration | ✅ Via chat mode | ❌ | ❌ |
| Webhook support | ✅ callback_url | ✅ callback_url | ✅ callback_url |
| Control level | Low (prompt-driven) | High (explicit) | Medium (prompt + your looks) |
Video Agent — best for speed
Send a text prompt, get a video. The agent handles scripting, avatar selection, and scene composition automatically.- You want a video fast without managing avatars or scripts
- You’re building a product where end users describe videos in natural language
- You want to iterate interactively — use
mode: "chat"to review the storyboard before rendering
Direct Video — best for control
Explicitly specify the avatar, voice, and script. Predictable, repeatable output for automated pipelines.- Building automated pipelines (personalized sales videos, daily reports)
- You need exact control over avatar, voice, and script
- Generating videos programmatically from data (CRM records, form submissions)
Cinematic Avatar — best for cinematic shots
A prompt-driven variant ofPOST /v3/videos. Hand HeyGen 1–3 avatar looks plus a natural-language prompt and the Seedance pipeline composes the scene, motion, and framing — no script or voice. See the full Cinematic Avatar guide.
- You want cinematic b-roll or motion of an avatar rather than a talking-head script
- You want to feature up to three looks in one composed shot
- You want to steer style and motion with your own reference videos and images
Not sure which to pick?
Start with Video Agent. If you need precise control over the script, avatar, or timing, switch toPOST /v3/videos. If you want a prompt-composed cinematic shot with no script, reach for Cinematic Avatar.
You can also combine them — use Video Agent to explore ideas and find the right style, then recreate with explicit parameters for the final production version.
