Platform
Rate Limits
Rate limits are applied per account, across all API keys and all models. The default is 60 requests per minute, enforced as a sliding 60-second window. There is no token-per-minute cap at launch.
| Header | Meaning |
|---|---|
| X-RateLimit-Limit | Max requests per minute on this account |
| X-RateLimit-Remaining | Requests left in the current 60-second window |
| X-RateLimit-Reset | Unix timestamp when the window resets |
| Retry-After | Seconds to wait before retrying (only on 429 responses) |
When you exceed the limit, the API returns a 429 with a Retry-After header. Higher limits are available on Enterprise — contact sales to negotiate.