Skip to main content
Feedback

Usage limits

note

Orchestrate is an Early Access release. Refer to the Early Access disclaimer for details.

The Orchestrate conversational interface processes text in small chunks called tokens, roughly one token per word, though punctuation and spaces also count. Orchestrate tracks token usage to manage how much information you can send to in the chat and how much it can send back.

  • Input tokens: Units to measure the amounts of text in your prompt, any data you provide, and the agent's instructions.
  • Output tokens: Units to measure the amount of text in the agent's response.

The following table lists token limits that apply to each prompt and response:

Per-Request token limits

DimensionLimitDescription
Input tokens~ 200,000 tokensTotal capacity of the user request including goal, clarifications, mode specifications, and Meta Hub glossaries
Response tokens~ 16,000 tokensMaximum length of a single Orchestrate reply

Per-Account rate quotas

Account limitations on tokens refresh after the rolling time window.

WindowCumulative Token CountNotes
Hourly400,000 tokensEach design session is approximately 200k tokens
Daily1,000,000 tokensApproximately 50 solution designs per day
Monthly20,000,000 tokensApproximately 1,000 full solution design sessions per month
On this Page