Important
- Billing for premium requests began on June 18, 2025 for all paid Copilot plans, and the request counters were only set to zero for paid plans.
- Premium request counters reset on the 1st of each month. See Monitoring your Copilot usage and entitlements.
- Certain requests may experience rate limits to accommodate high demand. Rate limits restrict the number of requests that can be made within a specific time period.
A request is any interaction where you ask Copilot to do something for you—whether it’s generating code, answering a question, or helping you through an extension. Each time you send a prompt in a chat window or trigger a response from Copilot, you’re making a request.
Some Copilot features use more advanced processing power and count as premium requests. The number of premium requests a feature consumes can vary depending on the feature and the AI model used.
The following Copilot features can use premium requests:
- Copilot Chat
- Copilot coding agent 1
- Agent mode in Copilot Chat 2
- Copilot code review
- Copilot Extensions
- Copilot Spaces
If you use Copilot Free, your plan comes with up to 2,000 code completion requests and up to 50 premium requests per month. All chat interactions count as premium requests.
If you're on a paid plan, you get unlimited code completions and unlimited chat interactions using the included models (GPT-4.1 and GPT-4o). Rate limiting is in place to accommodate for high demand. See Rate limits for Copilot.
Paid plans also receive a monthly allowance of premium requests, which can be used for advanced chat interactions, code completions using premium models, and other premium features. For an overview of the amount of premium requests included in each plan, see Plans for Copilot.
Unused requests for the previous month do not carry over to the following month.
Note
Additional premium requests are not available to:
- Users on Copilot Free. To access more premium requests, upgrade to a paid plan.
- Users who subscribe, or have subscribed, to Copilot Pro or Copilot Pro+ through Mobile on iOS or Android.
If you're on a paid plan and use all of your premium requests, you can still use Copilot with one of the included models for the rest of the month. This is subject to change. Response times for the included models may vary during periods of high usage. Requests to the included models may be subject to rate limiting. See Rate limits for Copilot.
If you need more premium requests beyond your monthly allowance, you can:
- Set a spending limit for additional premium requests. See Using budgets to control spending on metered products.
- Upgrade to a higher plan.
These actions can be taken by organization owners, billing managers, and personal account users.
Important
By default, all budgets are set to zero and premium requests over the allowance are rejected unless a budget has been created. Additional premium requests beyond your plan’s included amount are billed at $0.04 USD per request.
The available models vary depending on your Copilot plan. See Plans for Copilot.
Note
The models included with Copilot plans are subject to change.
Each model has a premium request multiplier, based on its complexity and resource usage. If you are on a paid Copilot plan, your premium request allowance is deducted according to this multiplier.
GPT-4.1 and GPT-4o are the included models, and do not consume any premium requests if you are on a paid plan.
If you use Copilot Free, you have access to a limited number of models, and each model will consume one premium request when used. For example, if you make a request using the o3-mini model, your interaction will consume one premium request, not 0.33 premium requests.
Model | Multiplier for paid plans | Multiplier for Copilot Free |
---|---|---|
GPT-4.1 | 0 | 1 |
GPT-4o | 0 | 1 |
GPT-4.5 | 50 | Not applicable |
Claude Sonnet 3.5 | 1 | 1 |
Claude Sonnet 3.7 | 1 | Not applicable |
Claude Sonnet 3.7 Thinking | 1.25 | Not applicable |
Claude Sonnet 4 | 1 | Not applicable |
Claude Opus 4 | 10 | Not applicable |
Gemini 2.0 Flash | 0.25 | 1 |
Gemini 2.5 Pro | 1 | Not applicable |
o1 | 10 | Not applicable |
o3 | 1 | Not applicable |
o3-mini | 0.33 | 1 |
o4-mini | 0.33 | Not applicable |
Premium request usage is based on the model’s multiplier and the feature you’re using. For example:
- Using GPT-4.5 in Copilot Chat: With a 50× multiplier, one interaction counts as 50 premium requests.
- Using GPT-4.1 on Copilot Free: Each interaction counts as 1 premium request.
- Using GPT-4.1 on a paid plan: No premium requests are consumed.