Gemini Deep Research is now available in preview with collaborative planning, visualization, MCP support, and more.

Send feedback

Gemini Deep Research Agent

The Gemini Deep Research Agent autonomously plans, executes, and synthesizes multi-step research tasks. Powered by Gemini, it navigates complex information landscapes to produce detailed, cited reports. New capabilities allow you to collaboratively plan with the agent, connect to external tools using MCP servers, include visualizations (like charts and graphs), and provide documents directly as input.

Research tasks involve iterative searching and reading and can take several minutes to complete. You must use background execution (set background=true) to run the agent asynchronously and poll for results or stream updates. See Handling long running tasks for more details.

The following example shows how to start a research task in the background and poll for results.

Tool	Type value	Description
Google Search	`google_search`	Search the public web. Enabled by default.
URL Context	`url_context`	Read and summarize web page content. Enabled by default.
Code Execution	`code_execution`	Execute code to perform calculations and data analysis. Enabled by default.
MCP Server	`mcp_server`	Connect to remote MCP servers for external tool access.
File Search	`file_search`	Search your uploaded document corpora.

Field	Type	Required	Description
`type`	`string`	Yes	Must be `"mcp_server"`.
`name`	`string`	No	A display name for the MCP server.
`url`	`string`	No	The full URL for the MCP server endpoint.
`headers`	`object`	No	Key-value pairs sent as HTTP headers with every request to the server (for example, authentication tokens).
`allowed_tools`	`array`	No	Restrict which tools from the server the agent may call.

Event type	Delta type	Description
`content.delta`	`thought_summary`	Intermediate reasoning step from the agent.
`content.delta`	`text`	Part of the final text output.
`content.delta`	`image`	A generated image (base64-encoded).

Feature	Standard Gemini Models	Gemini Deep Research Agent
Latency	Seconds	Minutes (Async/Background)
Process	Generate -> Output	Plan -> Search -> Read -> Iterate -> Output
Output	Conversational text, code, short summaries	Detailed reports, long-form analysis, comparative tables
Best For	Chatbots, extraction, creative writing	Market analysis, due diligence, literature reviews, competitive landscaping

Field	Type	Default	Description
`type`	`string`	Required	Must be `"deep-research"`.
`thinking_summaries`	`string`	`"none"`	Set to `"auto"` to receive intermediate reasoning steps during streaming. Set to `"none"` to disable.
`visualization`	`string`	`"auto"`	Set to `"auto"` to enable agent-generated charts and images. Set to `"off"` to disable.
`collaborative_planning`	`boolean`	`false`	Set to `true` to enable multi-turn plan review before research begins.

Gemini Deep Research Agent

Supported Versions

Collaborative planning

Step 1: Request a plan

Step 2: Refine the plan (optional)

Step 3: Approve and execute

Visualization

Supported tools

Google Search

URL Context

Code Execution

MCP servers

Basic usage

File Search

Steerability and formatting

Multimodal inputs

Document understanding

Handling long-running tasks

Streaming

Stream event types

Follow-up questions and interactions

When to use Gemini Deep Research Agent

Agent configuration

Availability and pricing

Estimated costs

Safety considerations

Best practices

Limitations

What's next

Related Articles