Query consumption for high-performance models

Overview

Some models in the Intelligence tab consume more than one query per message. This is indicated directly in the model dropdown when you select one of these models.

This article explains what that means, how it affects your query usage, and how to monitor consumption.


Which models are affected

Currently, Claude Opus 4.5 and Claude Opus 4.6 consume 2 queries per message.

ModelQueries per Message
Standard models (e.g., GPT-4o, Claude Sonnet)1
Claude Opus 4.52
Claude Opus 4.62

Note: The model selection dropdown is available to Enterprise plan users only.

screenshot of AI model dropdown list

How this affects your usage

Every message processed by your agent — including messages from end users interacting with your deployed agent — counts against your query balance at the rate shown for the selected model.

For example, if your agent receives 100 messages while an Opus model is selected, 200 queries will be deducted from your balance.


Monitoring your query usage

You can monitor your query consumption at any time by going to your usage dashboard in account settings. This is especially important in high-traffic deployments.

👉

Learn more about How to view limits and usage


Switching to a different model

You can change the selected model in the Intelligence tab at any time. Switching to a standard model reverts query consumption to 1 query per message.


Other high-performance models

Additional models may be added under this framework in the future. Each will display its own query multiplier in the dropdown.


Related articles