One-shot video generation from a prompt — agent handles scripting, avatar selection, scene composition, and rendering. Supports generate (fire-and-forget) and chat (multi-turn) modes.
HeyGen API key. Obtain from your HeyGen dashboard.
Request body for creating a video from a prompt using Video Agent v3.
All configuration is flat (no nested config object). Files use the type-discriminated AssetInput union for flexible asset inputs.
Supports two modes:
generate (default): one-shot — auto-proceeds through storyboard, produces one video.chat: multi-turn — may pause for user input on real decisions (e.g. pick a voice),
auto-proceeds on confirmations. Allows revisions and follow-up videos.The message/prompt for video generation (1-10000 characters)
1 - 10000Session mode. 'generate' produces one video (fire-and-forget). 'chat' enables multi-turn interaction — the agent may pause for decisions and allows revisions.
generate, chat Specific avatar ID to use
Specific voice ID to use for narration
Style ID from GET /v3/video-agents/styles. Applies a curated visual template to the generated video.
Video orientation. If not provided, auto-detected from content.
landscape, portrait Optional file attachments (max 20 files)
20Asset input via publicly accessible HTTPS URL.
Webhook URL for completion/failure notifications
Optional callback ID included in webhook payload
When enabled, disables memory injection and extraction for this session
If true, skip interactive review and go straight to video generation (no storyboard approval step)
Set true to disable agentic stop (not recommended)
Successful response
Response from creating a video agent session.
Designed as a session resource for future extensibility — GET /v3/video-agents/{session_id} can return the same shape enriched with more fields (messages, video URL, etc.).