Disable reflection after tool use #1024

WorldInnovationsDepartment · 2025-03-01T15:12:33Z

Description

Hello, Pydantic AI Team 👋

I’d like to propose a feature that allows for the creation of highly specialized agents that only execute tools without streaming textual output from LLM model (Disable reflection after tool use).

Use Case:

Consider a Search Agent that solely runs tool calls, passing their outputs to an Analyst Agent for processing. The Analyst Agent then collaborates with a Critique Agent to construct the final response. This setup enables modular and efficient agent interactions where streaming text from intermediate agents is unnecessary.

Current Limitation:

From my understanding, the framework does not currently support an agent that exclusively executes tools without attempting to stream a final response. I attempted to implement a workaround using .iter() and breaking the stream when an answer starts forming, but this approach was unsuccessful.

Questions & Contribution:

Does this feature align with your roadmap or design philosophy?
Is there a way to implement this behavior using an existing but undocumented approach?
If this makes sense to you, I’d be happy to contribute a PR! Could you provide guidance on how best to implement it?

Looking forward to your thoughts! 🚀

References

I like how autogen managed to do this, their AssistantAgent (similar to Agent in pydantic-ai) has reflect_on_tool_use param and if it is False - FinalResult is just tool outputs.
Check the link to their docs for more details and check last block in diagram pls https://microsoft.github.io/autogen/dev/reference/python/autogen_agentchat.agents.html#autogen_agentchat.agents.AssistantAgent

WorldInnovationsDepartment · 2025-03-02T20:48:49Z

@dmontagu @Kludex, can we schedule a call, or could you provide me with useful tips for implementing this and I can create a PR? It’s a pretty useful feature that keeps coming up in different requests.

WorldInnovationsDepartment · 2025-03-03T07:04:06Z

@Kludex @dmontagu @samuelcolvin
I was able to implement reflect_on_tool_use=False with PydAI.
I specified a structured output with the following format:

class ResearchResponse(BaseModel):
is_final: bool = Field(...)

This forces PydAI to call the final_result tool to return a structured output.
I created a script that asks 20 questions, gathers latency statistics, and did the same for the Researcher (reflect_on_tool_use=False) + Analyst agent (no tools) in Autogen. Now, I have some latency stats.

The first screenshot shows PydAI, and the second one is Autogen. The difference is significant. As I understand, there is no way in PydAI to avoid calling final_result when generating text.

PydAI:

Autogen:

Finndersen · 2025-03-03T18:24:47Z

Is this issue similar/related? #142

WorldInnovationsDepartment · 2025-03-04T12:05:19Z

@Finndersen
I have created PR #1040 for my issue. It is similar, but as I understand, it will not be fixed in #142.

My approach is interrupting the agent run once all tools have completed execution.

Example:
A Search Agent runs five tool calls to gather information for a task. If I am forced to call an unnecessary tool just to avoid triggering text generation, I will waste time on that extra call. My approach is to terminate the agent’s run when it no longer needs to invoke any tool calls.

This enables the creation of agents that work exclusively with tools, eliminating unnecessary tool calls and reducing latency—something currently missing.

cc @Kludex @dmontagu

Finndersen · 2025-03-04T14:02:52Z

Does that change mean that all to calls will always return the results directly instead of to LLM? would probably want to make it a bit more flexible than that

WorldInnovationsDepartment · 2025-03-04T14:31:43Z

@Finndersen You are right. I think we can use the Autogen approach—just create a message with all tool_call outputs to reuse later. In Autogen, there is a specific class to distinguish between LLM-generated text. It’s called ToolCallsSummaryMessage, and it stores tool calls in string format. A very useful feature for speed optimizations

aristideubertas · 2025-03-04T15:08:14Z

You could probably also just use a decorator on a tool to signal that usage of that tool will always imply return of the function output without any interpretation.

This is pretty core functionality and it is very odd that it hasn't been implemented yet. It is arguably more important than Graphs or other advanced features.

DouweM · 2025-04-30T18:26:03Z

This should be addressed by #1463, which @dmontagu is working on. I'll close this until we determine it's not sufficient.

WorldInnovationsDepartment mentioned this issue Mar 3, 2025

Add tool_choice to ModelSettings #825

Draft

This was referenced Mar 3, 2025

Add option to not reflect on tool calls #1039

Closed

Disable reflection after tool use #1040

Closed

WorldInnovationsDepartment changed the title ~~Agent Capable of Running Tools Only (No Streaming Output)~~ Disable reflection after tool use Mar 4, 2025

DouweM closed this as not planned Won't fix, can't repro, duplicate, stale Apr 30, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disable reflection after tool use #1024

Disable reflection after tool use #1024

WorldInnovationsDepartment commented Mar 1, 2025 •

edited

Loading

WorldInnovationsDepartment commented Mar 2, 2025

WorldInnovationsDepartment commented Mar 3, 2025

Finndersen commented Mar 3, 2025

WorldInnovationsDepartment commented Mar 4, 2025 •

edited

Loading

Finndersen commented Mar 4, 2025

WorldInnovationsDepartment commented Mar 4, 2025 •

edited

Loading

aristideubertas commented Mar 4, 2025

DouweM commented Apr 30, 2025

Disable reflection after tool use #1024

Disable reflection after tool use #1024

Comments

WorldInnovationsDepartment commented Mar 1, 2025 • edited Loading

Description

Use Case:

Current Limitation:

Questions & Contribution:

References

WorldInnovationsDepartment commented Mar 2, 2025

WorldInnovationsDepartment commented Mar 3, 2025

Finndersen commented Mar 3, 2025

WorldInnovationsDepartment commented Mar 4, 2025 • edited Loading

Finndersen commented Mar 4, 2025

WorldInnovationsDepartment commented Mar 4, 2025 • edited Loading

aristideubertas commented Mar 4, 2025

DouweM commented Apr 30, 2025

WorldInnovationsDepartment commented Mar 1, 2025 •

edited

Loading

WorldInnovationsDepartment commented Mar 4, 2025 •

edited

Loading

WorldInnovationsDepartment commented Mar 4, 2025 •

edited

Loading