Platform

Rate Limits

Rate limits are applied per account, across all API keys and all models. The default is 60 requests per minute, enforced as a sliding 60-second window. There is no token-per-minute cap at launch.

Header	Meaning
X-RateLimit-Limit	Max requests per minute on this account
X-RateLimit-Remaining	Requests left in the current 60-second window
X-RateLimit-Reset	Unix timestamp when the window resets
Retry-After	Seconds to wait before retrying (only on 429 responses)

When you exceed the limit, the API returns a 429 with a Retry-After header. Higher limits are available on Enterprise — contact sales to negotiate.