How to make a sub-agent to respond to the user's question in a tool-calling multi-agent architecture in v1

DataNoob0723 · October 31, 2025, 10:59pm

Hey, guys:

I am using the LangChain v1 to develpo a multi-agent system. I am following the new tool-calling architecture in the v1 documents. In this architecture, the supervisor agent will make the final response. I am wondering if there a way for a sub-agent to respond to the user’s question directly? The reason I am asking this is because for longer responses, if the supervisor needs to re-generate what a sub-agent generates, it gonna be pretty slow.

Any suggestions are appreciated!

pawel-twardziak · November 1, 2025, 9:08am

Hi @DataNoob0723

Yes. You don’t have to make the supervisor re-generate what a worker produced. In LangGraph v1 you can either stream the sub-agent’s tokens directly to the UI, let a worker be a terminal node (pass-through final), or use an interrupt to surface the worker’s message to the user. All three avoid the supervisor re-writing long outputs.

Stream the worker’s tokens directly (recommended for latency):
- Use the graph’s event stream and forward on_chat_model_stream events from the worker node to the client while it’s running. The supervisor can still own control-flow, but you don’t re-generate text; you simply surface the worker’s stream to the user.
Pass-through “final answer” from a worker (no re-generation):
- Define a convention/tool like final_answer(text) that a worker can call when it has the user-facing answer. Your graph logic detects this and ends the graph (or returns immediately) with that text. The supervisor then acts as a router/arbiter only and doesn’t paraphrase.
- Practically, add an edge from that worker to END, or have a small function node that copies the worker’s content to the final return value without invoking another LLM.
Use an interrupt to emit a worker response to the UI (human-in-the-loop style):
- A worker can interrupt(...) to yield control and surface content to the application. The app shows that message to the user, optionally collects input, and then resumes the graph. This is useful if you want an explicit “sub-agent speaks now” step.
Topology tweak: make the worker a terminal node:
- In a supervisor pattern, you don’t have to always return to the supervisor. You can route worker -> END so the worker is the final responder when appropriate. The supervisor remains a router, not a re-writer.

Notes:

You can combine streaming with any of the above so the user sees tokens as they are generated by the worker.
If you still want a single “final message” object at the end, keep the supervisor node but only pass through the worker’s content (no model call), or use the END edge from the worker.

References

LangGraph - Multi-agent concepts (supervisor/worker patterns, graph topology): Graph API overview - Docs by LangChain
LangGraph - Agent supervisor tutorial (routing, edges, making a worker terminal): Workflows and agents - Docs by LangChain
LangGraph - Interrupts (human-in-the-loop, pausing the graph to surface content/input): Interrupts - Docs by LangChain
LangGraph - Streaming/events (stream tokens from nodes to the UI): Models - Docs by LangChain

DataNoob0723 · November 1, 2025, 2:46pm

Thanks a lot for the insights!

hafeezniyaz · November 2, 2025, 11:51am

I wonder what if supervisor needs more than one sub agent to achieve the task ? would a conditional node to decide if we need to go back to the supervisor better or there is a better way to collabortate among agents in supervisor topology ? should it be a supervisor or swarm in such case ?

Topic		Replies	Views
Difference of creating Multi-agent LangChain intro-to-langgraph , python-help	3	285	September 29, 2025
Information is not passing in Supervisor agent LangGraph python-help	2	497	August 20, 2025
Token bloat when using create_agent LangGraph intro-to-langgraph , python-help	4	126	November 25, 2025
Inter Communication between agents using create_agent as a supervisor and as a tools Talking Shop intro-to-langgraph , python-help	22	86	January 18, 2026
Supervisor Agent, issue with making the subagent/tools aware? LangGraph python-help	3	801	August 21, 2025

How to make a sub-agent to respond to the user's question in a tool-calling multi-agent architecture in v1

References

Related topics