HeyGen offers two Enterprise billing models depending on your contract. Your account team will help you choose the right fit.
Model
How It Works
Best For
Usage-Based Billing
Monthly Minimum Commitment (MMC) with a per-second rate and overage billing
Teams with predictable, recurring API usage
Credit Packages
Purchase an annual pool of credits upfront under a contract
Teams that prefer a fixed annual spend with flexible drawdown
Both models authenticate with an API Key (x-api-key header). Check your balance at any time with GET /v3/user/me → wallet.
OAuth vs API Key: If you authenticate with an OAuth bearer token, usage is billed against your web plan — not your Enterprise API balance.API key authentication provides higher concurrency limits and is more flexible and powerful for automation and integration workflows.
Usage-based billing pairs a flat Monthly Minimum Commitment (MMC) with a per-second credit rate. If you exceed the included credits in a given month, overage is billed at a slightly higher rate.
$5,000 MMC tier: Your contract includes 600,000 credit-seconds per month at $0.00750/sec. If you use 700,000 credit-seconds in a month, the first 600,000 are covered by your MMC. The remaining 100,000 seconds are billed at the overage rate of $0.00833/sec, adding $833.00 to your invoice.
Annual Contract: You agree to a total credit amount for the contract term (typically 12 months).
Drawdown: Credits are consumed per second as you use HeyGen’s API. The per-second cost of each product is listed in the pricing table below.
Balance Tracking: Monitor your remaining credit balance via GET /v3/user/me → wallet.
Credit package terms — including total credits, price per credit, and contract duration — are set in your Enterprise agreement. Contact your account team to discuss packaging options.
The table below applies to both billing models. For usage-based accounts, credits are metered per second against your included pool and overage. For credit package accounts, credits are deducted from your annual balance.
Concurrent jobs include any asynchronous generation in progress: Video Agent sessions, avatar video renders, and video translations. Exceeding the limit returns 429 Too Many Requests with a Retry-After header.
All endpoints enforce rate limits. When exceeded, the API returns 429 Too Many Requests with a Retry-After header indicating the number of seconds to wait before retrying.