Visual Retrieval with PDFs FAQ

What is this feature?

ChatGPT Enterprise now supports reading and understanding visuals (images, graphs, diagrams, etc.) embedded in PDF files included in prompts. Users can upload a PDF, and ChatGPT can interpret the text and any visual elements within that file.

How does it work?

Click the paperclip (attachment) icon in the chat to upload your PDF.
ChatGPT will read both the text and any embedded images or diagrams within the PDF.
You can then ask questions or request summaries—anything from extracting the main points of a report to explaining complex charts.

Is it compatible with GPTs and Projects?

Partially. PDFs uploaded as GPT Knowledge or Project Files are processed using text-only retrieval. PDFs uploaded by users during interactions with a published GPT or within a Project conversation are processed using visual retrieval.

Who can use it?

This capability is available only to ChatGPT Enterprise customers. It is not supported for ChatGPT Free, Pro, Team, or Edu accounts.

What problem does it solve?

Previously, ChatGPT could only process images when uploaded separately (e.g., as PNGs/JPEGs). Embedded visuals within a PDF were overlooked. Now, ChatGPT can provide a more holistic analysis—combining the text and visuals in one go—leading to more accurate and context-rich responses.

Will this feature eventually extend to other plans?

It’s currently exclusive to Enterprise and may be expanded in the future. We’re monitoring customer feedback to determine when and how to broaden support.