Query consumption for high-performance models

Overview

Some models in the Intelligence tab consume more than one query per message. This is indicated directly in the model dropdown when you select one of these models.

This article explains what that means, how it affects your query usage, and how to monitor consumption.

Which models are affected

Currently, Claude Sonnet 4.6 Reasoning, Claude Opus 4.5, Claude Opus 4.6, and Gemini 3.1 consume 2 queries per message.

Model	Queries per Message
Standard models (e.g., GPT-4o, Claude Sonnet)	1
Claude Opus 4.5	2
Claude Opus 4.6	2
Sonnet 4.6 Reasoning	2
GPT-5.4	2
Gemini 3.1	2

Note: The model selection dropdown is available to Enterprise plan users only.

How this affects your usage

Every message processed by your agent — including messages from end users interacting with your deployed agent — counts against your query balance at the rate shown for the selected model.

For example, if your agent receives 100 messages while an Opus model and Sonnet 4.6 Reasoning are selected, 200 queries will be deducted from your balance.

Monitoring your query usage

You can monitor your query consumption at any time by going to your usage dashboard in account settings. This is especially important in high-traffic deployments.

👉
Learn more about How to view limits and usage

Switching to a different model

You can change the selected model in the Intelligence tab at any time. Switching to a standard model reverts query consumption to 1 query per message.

Other high-performance models

Additional models may be added under this framework in the future. Each will display its own query multiplier in the dropdown.

Overview

Which models are affected

How this affects your usage

Monitoring your query usage

Switching to a different model

Other high-performance models

Related articles