Enable AI vision for uploaded images
This guide explains how to enable AI Vision when uploading images to your agent. With AI Vision, your agent can automatically analyze and summarize images—transforming visual content into useful knowledge.
What is AI Vision
AI Vision allows your agent to intelligently process uploaded images using advanced computer vision models. It identifies what’s happening in the image, generates a natural-language description, and adds that summary directly to the agent’s knowledge base.
This feature supports:
- Photos
- Charts and diagrams
- Infographics
- Screenshots
- Any other image type
👉 Learn more about how this works in the How AI Vision works.
Note:AI Vision can also function as an OCR (optical character recognition) tool for reading text within images. However, performance is best with English text.
Why use AI Vision
- Converts visual data into readable summaries.
- Enhances your agent’s understanding of context-rich visuals.
- Makes data from images searchable and usable in chat.
- Auto-enables image citations for your agent (Premium and Enterprise only).
👉 Learn more about Activate image citations
How to enable AI Vision for uploaded images
You can enable AI Vision when uploading images to an existing agent or while creating a new one.
For existing agents
- Click the three dots next to your agent’s name, then select Build.

- Click Add Source.

- Select File Upload.

- Select one or more image files (e.g., JPG, PNG, or diagram screenshots) and click on Advanced Settings.

- Scroll down and toggle AI Vision in the upload window to enable intelligent image analysis.

Note:This setting must be turned on for each file upload batch.
- Click Add Files to confirm the upload.

When creating a new agent
- On your CustomGPT.ai dashboard, click New Agent.

- Click Upload.

- Upload your image and click Advanced Settings.

- Scroll down to AI Vision Image Processing

- Click On to enable it.

- Finish setting up your agent and click Create.

Related articles:
Updated about 4 hours ago