ChatGPT Agent: complete guide to features, setup, and what it can do

What is ChatGPT Agent, and how is it different from regular ChatGPT?

If you have used ChatGPT before, you know the drill. You type a question, it gives you an answer, and then you type another question. That back-and-forth is useful, but it is still just a conversation. You are doing all the thinking about what to do next, and ChatGPT is just responding one message at a time.

ChatGPT Agent is something different. Instead of answering one question and waiting for the next, it can actually go out and do things for you. You describe what you want to accomplish, and ChatGPT figures out the steps, takes action, and delivers results. It can pull data from your connected apps, write and run code, create documents, and handle multi-step workflows on its own.

Think of it this way: regular ChatGPT is like texting a really smart friend for advice. ChatGPT Agent is like handing that friend your laptop and saying "just take care of it for me." For a broader look at how agents and chatbots compare, check out our guide on what AI agents are and how they differ from chatbots.

How to access ChatGPT Agent

Agent mode is a separate mode inside ChatGPT that you activate yourself. It does not turn on automatically based on what you ask. When you open ChatGPT, you start in regular chat mode by default.

To switch to agent mode, you have two options. You can click the Tools dropdown (the "+" icon near the message box) and select "Agent mode" from the list. Or you can type /agent followed by your instructions directly in the chat.

Once you activate it, the interface changes. Instead of just getting a text response, you will see ChatGPT working in real time. It shows you the steps it takes and the decisions it makes along the way. It feels more like watching someone work on your behalf than having a conversation.

Who can access ChatGPT Agent?

ChatGPT Agent is only available on paid plans. If you are using the free version of ChatGPT, you will not see the option at all.

Here is what each plan gets:

ChatGPT Plus ($20/month) includes 40 agent tasks per month
ChatGPT Pro ($200/month) includes 400 agent tasks per month
ChatGPT Team ($30/month per user) includes agent access with usage limits
Business and Enterprise plans include agent access, with flexible usage options

One helpful detail: only the initial task prompt counts toward your monthly limit. Follow-up steps, clarifications, and the agent's own internal actions during a task do not eat into your quota.

If you hit your limit on a Plus plan, you can purchase additional credits at $0.75 each, or you can wait for the monthly reset.

Where does ChatGPT Agent work?

One of the best things about ChatGPT Agent is that it works across all platforms. You do not need a specific app or device to use it.

Web browser at chatgpt.com
Desktop app on Mac and Windows
Mobile apps on iOS and Android

The experience is consistent across all of these. You activate agent mode the same way, and it has the same capabilities regardless of which platform you are on.

That said, if you want the most seamless experience (especially for things like keyboard shortcuts and always-on access), the desktop app is worth installing. You can summon ChatGPT instantly with Option + Space on Mac or Alt + Space on Windows, which is useful when you are in the middle of other work.

How to install the ChatGPT desktop app

While the desktop app is not required for agent mode, it does make the overall experience smoother. Here is how to get it set up.

On Mac

Visit openai.com/chatgpt/download and click the Mac download button
Open the downloaded .dmg file and drag ChatGPT into your Applications folder
Launch ChatGPT from Applications and sign in with your OpenAI account
Grant any permissions the app requests (these are needed for features like the keyboard shortcut)

Requirements: macOS 11.0 (Big Sur) or newer, 4GB RAM, 500MB free storage space.

On Windows

Visit openai.com/chatgpt/download and click the Windows download button
Run the ChatGPTSetup.exe installer (run as administrator if prompted)
Choose your installation options (desktop shortcut, Start menu entry, launch at startup)
Complete the installation and sign in with your OpenAI account

Requirements: Windows 10 version 1903 or later, or Windows 11. 4GB RAM minimum and 500MB free storage space.

How ChatGPT Agent actually works under the hood

When you give ChatGPT Agent a task, it does not just generate text. It boots up its own virtual computer in the cloud and uses that machine to get work done.

This virtual computer comes equipped with several tools:

A terminal for writing and running code
Direct API access for connecting to external services

Here is what a typical task looks like from start to finish:

You describe what you want accomplished in plain language
ChatGPT analyzes your request and starts planning its approach
The agent begins executing, and you can watch its progress in real time
If it needs to take a significant action (like submitting a form or sending a message), it pauses and asks your permission first
When finished, it presents the results for your review

Connecting your apps

One of ChatGPT Agent's most practical features is its ability to connect directly to the tools you already use. OpenAI calls these "Apps" (they were previously called "connectors"), and they let ChatGPT pull information from and take actions in your existing services without you needing to copy and paste anything.

Currently supported apps

The list keeps growing, but some of the most useful integrations include:

Email: Gmail, Outlook
Communication: Slack, Microsoft Teams, Intercom
File storage: Google Drive, Dropbox, Box, SharePoint
Project management: Notion, ClickUp, Basecamp, Linear
Development: GitHub
Other: Airtable, Help Scout, and more

How to connect an app

Click your profile icon in ChatGPT, then go to Settings and select Apps
Browse the app directory and find the service you want to connect
Click Connect and follow the authorization prompts on that service's website
Once connected, ChatGPT can access that app whenever it is relevant to your task

You can also connect apps on the fly by using @mentions in your prompt (like typing @Gmail) or by clicking the + icon and selecting More.

Once an app is connected, ChatGPT Agent can do things like search your Gmail for a specific email thread, pull files from your Google Drive, check your Slack messages for context, or reference GitHub issues while working on a task. It only accesses data that is relevant to your current request and within the permissions you granted during setup.

A note on plan requirements

Most app integrations are available on Team, Business, Enterprise, and Edu plans. Some basic integrations work on Plus, but the full range of connected apps is geared toward team and business users.

Working with files and documents

ChatGPT Agent can create, read, and work with files in several ways.

Uploading files directly

You can upload files to any ChatGPT conversation, and the agent can read and analyze them. This works with PDFs, spreadsheets (Excel and CSV), Word documents, text files, images, and more. Upload a financial report and ask the agent to summarize the key findings. Drop in a dataset and ask it to build a chart. Hand it a contract and ask it to flag potential issues.

Creating files for you

The agent can also generate files from scratch. Ask it to build a spreadsheet with specific data, create a slide deck outline, draft a report, or write a document, and it will produce downloadable files you can save and edit on your own machine.

Connecting to cloud storage

Through the Apps integrations mentioned above, ChatGPT Agent can reach into your Google Drive, Dropbox, SharePoint, or Box to find and reference files without you needing to upload them manually. This is particularly useful when the agent needs to cross-reference multiple documents or pull from a large collection of files.

What it cannot do with local files

This is an important distinction. ChatGPT Agent does not have direct access to your computer's file system. It cannot browse your local folders, open files on your desktop, or save things directly to your hard drive the way some other AI agents can. Everything goes through either file uploads or connected cloud storage. If you need an AI that works directly with your local files and folders, see our guide on Claude Cowork.

Projects: keeping context without re-uploading

One of the most practical features in ChatGPT is the Projects tab. If you find yourself working on the same topic repeatedly and constantly re-uploading the same files to give ChatGPT context, then use the Projects feature.

A project is a dedicated workspace where you upload your relevant files once, and every conversation you start inside that project automatically has access to them. Your files stay put. You just open the project and start a new conversation.

This is especially useful for ongoing work. Say you are managing a client account and regularly need to draft updates, answer questions, or produce reports based on their background documents. Create a project for that client, upload their briefs, contracts, and reference materials, and every future conversation in that project can draw on all of it without you lifting a finger.

A few things worth knowing about Projects:

Files persist across conversations. Anything you upload to a project is available in every conversation you have inside it.
You can set custom instructions per project. Each project has its own system prompt, so you can tell ChatGPT to always respond in a certain format, adopt a specific tone, or keep particular constraints in mind for that work.
Projects work in both chat and agent mode. You can have a regular conversation inside a project or kick off an agent task, and both have access to the same files.
Storage limits apply. The amount of data you can store in a project depends on your plan, so very large file collections may hit limits on lower tiers.

Think of a project as giving ChatGPT a persistent briefing. Instead of re-explaining the situation every time, you set it up once and it is ready every time you return.

What can you actually do with ChatGPT Agent?

Now that you have ChatGPT Agent set up, what should you use it for? Agents excel at tasks that involve multiple steps, pulling from different sources, and synthesizing information. Here are the major categories where ChatGPT Agent makes the biggest impact:

Research and analysis — Give the agent a topic, a set of uploaded documents, or files from your connected cloud storage and it will read, cross-reference, and synthesize findings into a structured summary. Great for competitive research, literature reviews, or building a brief before a meeting.

Email management — Connect your Gmail and the agent can read your inbox, draft replies, summarize threads, and flag the messages that need your attention. It works directly with your connected email rather than requiring you to paste content into a separate chat window. If you use Outlook, see our guide on the Copilot agent.

Document creation and editing — Describe what you need and the agent will pull from your uploaded files or connected cloud storage to write a polished first draft. Whether it is a report, a proposal, or edits to a spreadsheet, the output is a downloadable file you can open and edit immediately.

Content writing and repurposing — The agent can write content from scratch, whether that is a blog post, a newsletter, a case study, or an internal report. It can also take content you already have and reformat it into something new: a LinkedIn post, a slide deck outline, a summary email, or a set of FAQs.

Workflow automation — Chain multiple steps into a single task so the agent can pull data from your connected apps, write a document, and deliver a summary all from one prompt. The goal is offloading full workflows rather than individual actions.

For detailed examples and specific prompts you can use in each category, check out our comprehensive guide: Top 8 Use Cases of AI Agents.

Scheduled tasks: put ChatGPT on autopilot

ChatGPT also supports scheduled tasks, which let you set up recurring prompts that run automatically on a schedule you define. Instead of typing the same request every morning or remembering to check in on a recurring workflow, you describe what you want done and ChatGPT handles it at the time you specify. You can create a scheduled task by typing something like "every Monday morning, summarize the top AI news from the past week and save it as a note" directly in the chat.

Scheduled tasks are available on Plus, Pro, Team, and Enterprise plans, currently on web and macOS only (not yet on Windows). They are a good fit for things like daily briefings, regular research summaries, or any recurring task that currently relies on you remembering to kick it off. One thing to keep in mind: the task runs inside the ChatGPT interface, so it does not have access to your local machine or files outside of what you have connected through Apps.

Best practices for getting the most out of ChatGPT Agent

Be specific about what you want

Vague prompts lead to vague results. Instead of "find me some flights," try "find round-trip flights from Austin to Tokyo for June 15-22, economy class, and compare the three cheapest options." The more detail you give, the better the output. See our guide on smart prompting for more on how to give effective instructions.

Start with simple tasks

Before handing over anything high-stakes, run a few low-risk tasks to get a feel for how the agent works. Try something like "look through my connected Google Drive folder and summarize the three most recent project documents." Watch how it approaches the task, what it does well, and where it might need clearer instructions.

Watch the agent work, at least initially

You can observe exactly what ChatGPT Agent is doing in real time. During your first few tasks, pay attention to how it navigates, what decisions it makes, and where it gets stuck. This helps you learn how to frame future requests more effectively.

Review everything before acting on it

The agent is capable but it does make mistakes. Always review the output before sharing it with others, acting on the information, or letting it take irreversible actions. This is especially important for anything involving money, communications, or sensitive data.

Use connectors to reduce manual work

The more apps you connect, the more useful the agent becomes. If you find yourself constantly uploading the same types of files or copying information from one tool to another, connecting that tool directly will save you significant time.

ChatGPT Atlas: a browser built around the agent

In October 2025, OpenAI released ChatGPT Atlas, a standalone web browser with ChatGPT baked directly into it. It is a separate product from ChatGPT Agent mode, but they complement each other well.

Atlas puts a ChatGPT sidebar right next to whatever webpage you are viewing. You can highlight text on any page and ask questions about it, get summaries while you read, compare products across tabs, and use agent mode right inside the browser for more complex tasks.

A few things worth knowing about Atlas:

Currently Mac-only. It requires Apple Silicon (M1 or later) and macOS 12 (Monterey) or newer. Windows and mobile versions are expected in the future.
It remembers context. Atlas has a "browser memories" feature that lets ChatGPT remember details from your browsing sessions and recall them later when relevant.
You control privacy. You can choose which sites ChatGPT can see, toggle memory on or off, and use incognito mode when you want full privacy. By default, OpenAI does not use your browsing data for model training.

To try Atlas, visit chatgpt.com/atlas to download the installer.

Atlas is not required to use ChatGPT Agent. Agent mode works perfectly fine in regular Chrome, Edge, Safari, or any other browser through the ChatGPT website. But if you want AI assistance woven into your actual browsing experience, Atlas is worth checking out.

What comes next

You now have a solid understanding of what ChatGPT Agent can do and how to start using it. The next step is to try it yourself.

Pick a task you do regularly that involves multiple steps or tools and hand it to the agent. Watch how it handles the work, adjust your instructions based on what you observe, and gradually build up to more complex workflows as you get comfortable.

The AI agent landscape is evolving quickly, and ChatGPT Agent is one of the most accessible entry points available right now. Whether you are using it for research, task automation, or just to offload some of the repetitive work that eats up your day, the best way to understand its value is to put it to work.

For more guidance on AI agents, explore our Intro to AI agents collection, where we cover everything from foundational concepts to advanced workflows.