Add `builtin_tools` to `Agent` #2102

mattbrandman · 2025-06-30T23:53:02Z

Fixes test and merge conflicts for #1722

Closes #840

Co-authored-by: Marcelo Trylesinski <[email protected]>

…dantic#1752) Co-authored-by: Marcelo Trylesinski <[email protected]>

- Added builtin_tools field to ModelRequestParameters - Merged new output_mode and output_object fields from main - Updated test snapshots to include all fields - Resolved import conflicts to include both builtin tools and profiles

…test_google.py

dmontagu · 2025-07-16T15:27:58Z

pydantic_ai_slim/pydantic_ai/builtin_tools.py

+
+    city: str
+    country: str
+    region: str


what is region?

More generally, are the required contents (of this field and the others on this class) vendor-specific in any way? Should we include examples?

dmontagu · 2025-07-16T15:30:46Z

pydantic_ai_slim/pydantic_ai/messages.py

+    part: ServerToolCallPart
+    """The server tool call to make."""
+
+    event_kind: Literal['server_tool_call'] = 'server_tool_call'


it makes me sad that some of our discriminators are snake_case (here) and some are kebab-case (part_kind). I guess you probably didn't introduce this inconsistency in this PR, but it feels bad. Maybe we should change it in v1 and just do value normalization during validation (i.e., replace any _ with - or vice versa).

pydantic_ai_slim/pydantic_ai/builtin_tools.py

pydantic_ai_slim/pydantic_ai/models/cohere.py

pydantic_ai_slim/pydantic_ai/models/gemini.py

pydantic_ai_slim/pydantic_ai/models/google.py

dmontagu · 2025-07-16T15:36:03Z

pydantic_ai_slim/pydantic_ai/models/google.py

+        if part.executable_code is not None:
+            items.append(ServerToolCallPart(args=part.executable_code.model_dump(), tool_name='code_execution'))
+        elif part.code_execution_result is not None:
+            # TODO(Marcelo): Is the idea to generate the tool_call_id on the `executable_code`, and then pass it here?


I feel like we can/should answer this question before merging?

pydantic_ai_slim/pydantic_ai/models/groq.py

pydantic_ai_slim/pydantic_ai/models/mistral.py

pydantic_ai_slim/pydantic_ai/models/openai.py

mattbrandman · 2025-07-16T16:48:20Z

@Kludex should we include code interpreter from openai as its accessible on the responses API or save that for a follow up given its a fairly complex set of types

Co-authored-by: David Montague <[email protected]>

mattbrandman · 2025-07-21T17:40:33Z

@Kludex unless we want to break these out into individual classes this feels like its in a pretty good spot for a first pass and to prevent it already feels like a giant PR

pydantic_ai_slim/pydantic_ai/agent.py

DouweM · 2025-07-23T14:49:44Z

pydantic_ai_slim/pydantic_ai/agent.py

+            if tool == 'web-search':
+                self._builtin_tools.append(WebSearchTool())
+            else:
+                self._builtin_tools.append(tool)


I think we should either support only passing an AbstractBuiltinTool, or include all built-in (to Pydantic AI) built-in tools as strings, meaning also code-execution. If we support strings, I think we should have a dict of name to class in builtin_tools.py, and use that here.

DouweM · 2025-07-23T14:50:20Z

pydantic_ai_slim/pydantic_ai/builtin_tools.py

+    """
+
+
+class UserLocation(TypedDict, total=False):


Can we call this WebSearchUserLocation or something so it's clear it belongs to that class?

DouweM · 2025-07-23T14:51:56Z

pydantic_ai_slim/pydantic_ai/messages.py

+
+@dataclass(repr=False)
+class ServerToolReturnPart(BaseToolReturnPart):
+    """A tool return message from a server tool."""


It's a bit confusing we're calling them "built-in tools" where the user passes them, but "server tools" here. Is there a specific reason for that, or can we standardize on one name? I prefer BuiltinToolReturnPart.

DouweM · 2025-07-23T14:53:36Z

pydantic_ai_slim/pydantic_ai/models/anthropic.py

+                        tool_call_id=item.id,
+                    )
+                )
+            elif isinstance(item, BetaCodeExecutionToolResultBlock):


Can we merge this with the elif isinstance(item, BetaWebSearchToolResultBlock): above?

DouweM · 2025-07-23T14:55:10Z

pydantic_ai_slim/pydantic_ai/agent.py

@@ -307,6 +311,8 @@ def __init__(
            output_retries: The maximum number of retries to allow for output validation, defaults to `retries`.
            tools: Tools to register with the agent, you can also register tools via the decorators
                [`@agent.tool`][pydantic_ai.Agent.tool] and [`@agent.tool_plain`][pydantic_ai.Agent.tool_plain].
+            builtin_tools: The builtin tools that the agent will use. This depends on the model, as some models may not
+                support certain tools. On models that don't support certain tools, the tool will be ignored.


I feel like silently ignoring built-in tools is unexpected, because the model is not going to behave as intended if it can't search the web or execute code. I'd prefer to raise an error from the model class if it sees an unsupported built-in tool, similarly to how we do for unsupported file/binary content types.

I'm not sure it's the same case. I expect builtin tools to fail far more than the different content inputs.

@Kludex I agree, but wouldn't you want the user to realize that they asked for something the model can't actually do?

pydantic_ai_slim/pydantic_ai/models/groq.py

DouweM · 2025-07-23T15:03:17Z

pydantic_ai_slim/pydantic_ai/models/openai.py

@@ -734,6 +754,7 @@ async def _responses_create(
    ) -> responses.Response | AsyncStream[responses.ResponseStreamEvent]:
        tools = self._get_tools(model_request_parameters)
        tools = list(model_settings.get('openai_builtin_tools', [])) + tools


Should we deprecate this setting and push people to use the new builtin_tools?

This setting currently supports FileSearchToolParam and ComputerToolParam as well, should we implement those as AbstractBuiltinTools?

Should we have a way to build an AbstractBuiltinTool with arbitrary built-in tool JSON supported by a given model, so users can start using them without having to wait for us to add support to the model class?

Should we deprecate this setting and push people to use the new builtin_tools?

Yes.

This setting currently supports FileSearchToolParam and ComputerToolParam as well, should we implement those as AbstractBuiltinTools?

I don't think they work properly right now, but the idea is to support them yes.

Should we have a way to build an AbstractBuiltinTool with arbitrary built-in tool JSON supported by a given model, so users can start using them without having to wait for us to add support to the model class?

Yes, but that needs to be model specific, so I left it out of this PR. I think we should have that possibility tho.

DouweM · 2025-07-23T15:03:57Z

pyproject.toml

@@ -286,4 +286,4 @@ skip = '.git*,*.svg,*.lock,*.css,*.yaml'
 check-hidden = true
 # Ignore "formatting" like **L**anguage
 ignore-regex = '\*\*[A-Z]\*\*[a-z]+\b'
-ignore-words-list = 'asend,aci'
+ignore-words-list = 'asend,aci,Hemishpere,synchonizing'


Why do we need those? Both of those clearly look like typos

The LLM generated, but I think I removed them already - We can remove them from here.

Co-authored-by: Douwe Maan <[email protected]>

Kludex and others added 30 commits May 14, 2025 11:00

Add builtin_tools to Agent

57e568b

make AbstractBuiltinTool serializable

97ab44b

Add more work on it

e3dda9d

Merge remote-tracking branch 'origin/main' into add-builtin-tools

3ad6d38

Add builtin tools

0b43f65

merge

fa7fd11

add more built-in-tools

32324fa

Fix test

f33e568

Add support on Groq

13d7433

Add support for Google

ac85205

Add support for MCP's Streamable HTTP transport (pydantic#1716)

c93633f

Timeout for initializing MCP client (pydantic#1833)

3a8b640

Require mcp 1.9.0+ (pydantic#1840)

360de87

Don't send empty messages to Anthropic (pydantic#1027)

cb4e539

Co-authored-by: Marcelo Trylesinski <[email protected]>

Add vendor_id and finish_reason to Gemini/Google model responses (p…

4e3769a

…ydantic#1800)

Fix units of sse_read_timeout timedelta (pydantic#1843)

ebb536f

Support functions as output_type, as well as lists of functions and o…

c8bb611

…ther types (pydantic#1785)

Enhance Gemini usage tracking to collect comprehensive token data (py…

6bcc1a8

…dantic#1752) Co-authored-by: Marcelo Trylesinski <[email protected]>

more

97ff651

merge

1d47e1e

merge

5f89444

merge

9512987

Pass tests

800a71a

Fix remaining merge conflict markers in openai.py, anthropic.py, and …

bc298d6

…test_google.py

add extra google

46c06c2

fix formatting

3496567

fix codespell

c193059

fixing types

427dec2

fixing types in gemini

866ad21