Use fast responses mode for your AI agent
This step-by-step guide explains how to enable fast responses mode and customize your agent’s response behavior.
What is fast responses mode?
Fast Responses mode uses a lightweight and optimized version of GPT-4o, called GPT-4o mini, along with internal performance enhancements. Although its reasoning capabilities may be slightly reduced compared to larger models, it leverages Retrieval-Augmented Generation (RAG) to ensure responses are still based on accurate and relevant information from your data.
This mode is ideal for use cases where speed matters—such as live chats, high-volume queries, or time-sensitive support.
How to enable fast responses mode:
- Click Agent Settings.

- In Agent Settings, click AI Intelligence tab.

- Select Fastest Responses to enable the mode.

- Scroll to the Generate Responses From section.

- Choose your preferred response source:
- My Data Only (Recommended): Uses only your uploaded content for responses.
- My Data + LLM: Combines your data with the model’s general knowledge base for more diverse replies.

- Click Save Settings at the bottom of the page to apply your changes.

Updated 9 days ago