Rate Limits
API requests are rate-limited per tenant to ensure fair usage and platform stability.
Limits by tier
| Tier | Requests/min | Requests/hour | Monthly agent runs |
|---|---|---|---|
| Basic | 20 | 600 | 5,000 |
| Professional | 100 | 3,000 | 50,000 |
| Enterprise | 500 | 15,000 | 500,000 |
Rate limit headers
Every response includes these headers:
| Header | Description |
|---|---|
X-RateLimit-Limit | Maximum requests allowed per minute |
X-RateLimit-Remaining | Requests remaining in the current window |
X-RateLimit-Reset | Unix timestamp when the rate limit resets |
Exceeding limits
When you exceed your rate limit, the API returns 429 Too Many Requests:
{
"detail": "Rate limit exceeded. Retry after 1712500800."
}
Wait until the X-RateLimit-Reset timestamp before retrying. Use exponential backoff if you receive repeated 429 responses.
Per-endpoint limits
Some endpoints have additional limits:
| Endpoint | Additional limit | Reason |
|---|---|---|
POST /ai/agents/{agent_id}/trigger | 10/min per agent | Prevents runaway manual triggers |
GET /ai/agents/{agent_id}/decisions | 60/min | Pagination queries can be expensive |
GET /ai/agents/{agent_id}/metrics | 30/min | Metrics aggregation is compute-intensive |
Scheduled and event-driven triggers
Scheduled and event-driven triggers consume the same rate limit pool as API requests. If your agents run frequently on schedules or events, factor this into your rate limit planning.
Increasing limits
Contact engineering@korastratum.com or your account manager to discuss Enterprise tier limits or custom rate limit configurations.