Make ToolCallLimitMiddleware proactive via before_model hook

29swastik · March 14, 2026, 4:45pm

The `ToolCallLimitMiddleware` is currently reactive, which means the limit logic is invoked after the limit is reached which has few downsides:

This might make LLM to utilize tools less effectively since it is not aware of tool limit from the beginning. When the limit is reached we panick the LLM by sending tool call limit reached message all of a sudden which it was not aware of
The reactive way of detecting tool call limit leads to additional tool call which doesn’t get used.

Use Case

LLM can plan ahead if it is aware of the concept of tool call limit and MAYBE converge better. Compare it with how a human would act in real life, in a game if we are aware of no of retries/no of tries left we do plan accordingly right?

Proposed Solution

This can be implemted by using `@before_model` hook where an additional message is passed which informs LLM how many tool calls left.

[
HumanMessage(content='Get all current affairs across the globe'),
HumanMessage(content=f'You are left with {tool_calls_left}' tool calls, plan accordingly), # inserted before model call
AIMessage(content=...)
ToolMessage(content=...)
HumanMessage(content=f'You are left with {tool_calls_left}' tool calls, plan accordingly), # inserted before next model call
AIMessage(content=...)
...
]

Alternatives Considered

The proposed solution requires no additional configuration through constructor parameters and the remaining tool call left message is sent after each iteration., we can make few enhancements to the proposed solution:

Introduce new attributes to `ToolCallLimitMiddleware` class with which developers can configure the behaviour of tool call remaining message sent to LLM.

class ToolCallLimitMiddleware(BaseCallbackHandler):
    def __init__(
        self,
        max_tool_calls: int = 50,
        proactive: bool = True,  # New flag
        warning_threshold: int = 5  # Warn when N calls left
    ):
        self.max_tool_calls = max_tool_calls
        self.proactive = proactive
        self.warning_threshold = warning_threshold

The tool call remaining message is sent using `HumanMessage` if this is not the right message type we can consider `ToolMessage`

I’ve created a github issue as well for the same

github.com/langchain-ai/langchain

Make ToolCallLimitMiddleware proactive via before_model hook

opened 04:38PM - 11 Mar 26 UTC

29swastik

langchain feature request external

### Checked other resources - [x] This is a feature request, not a bug report o…r usage question. - [x] I added a clear and descriptive title that summarizes the feature request. - [x] I used the GitHub search to find a similar feature request and didn't find it. - [x] I checked the LangChain documentation and API reference to see if this feature already exists. - [x] This is not related to the langchain-community package. ### Package (Required) - [x] langchain - [ ] langchain-openai - [ ] langchain-anthropic - [ ] langchain-classic - [ ] langchain-core - [ ] langchain-model-profiles - [ ] langchain-tests - [ ] langchain-text-splitters - [ ] langchain-chroma - [ ] langchain-deepseek - [ ] langchain-exa - [ ] langchain-fireworks - [ ] langchain-groq - [ ] langchain-huggingface - [ ] langchain-mistralai - [ ] langchain-nomic - [ ] langchain-ollama - [ ] langchain-openrouter - [ ] langchain-perplexity - [ ] langchain-qdrant - [ ] langchain-xai - [ ] Other / not sure / general ### Feature Description The `ToolCallLimitMiddleware` is currently reactive, which means the limit logic is invoked after the limit is reached which has few downsides: 1. This might make LLM to utilize tools less effectively since it is not aware of tool limit from the beginning. When the limit is reached we panick the LLM by sending tool call limit reached message all of a sudden which it was not aware of 2. The reactive way of detecting tool call limit leads to additional tool call which doesn't get used. ### Use Case LLM can plan ahead if it is aware of the concept of tool call limit and MAYBE converge better. Compare it with how a human would act in real life, in a game if we are aware of no of retries/no of tries left we do plan accordingly right? ### Proposed Solution This can be implemted by using `@before_model` hook where an additional message is passed which informs LLM how many tool calls left. ```py [ HumanMessage(content='Get all current affairs across the globe'), HumanMessage(content=f'You are left with {tool_calls_left}' tool calls, plan accordingly), # inserted before model call AIMessage(content=...) ToolMessage(content=...) HumanMessage(content=f'You are left with {tool_calls_left}' tool calls, plan accordingly), # inserted before next model call AIMessage(content=...) ... ] ``` If this appraoch makes sense I'll be happy to raise the PR! ### Alternatives Considered The proposed solution requires no additional configuration through constructor parameters and the remaining tool call left message is sent after each iteration., we can make few enhancements to the proposed solution: 1. Introduce new attributes to `ToolCallLimitMiddleware` class with which developers can configure the behaviour of tool call remaining message sent to LLM (as suggested in one of the comments below) ```py class ToolCallLimitMiddleware(BaseCallbackHandler): def __init__( self, max_tool_calls: int = 50, proactive: bool = True, # New flag warning_threshold: int = 5 # Warn when N calls left ): self.max_tool_calls = max_tool_calls self.proactive = proactive self.warning_threshold = warning_threshold ``` 2. The tool call remaining message is sent using `HumanMessage` if this is not the right message type we can consider `ToolMessage`

pawel-twardziak · March 14, 2026, 5:40pm

this is a nice and useful feature

Topic		Replies	Views
Implement ToolLimitationNode in stateGraph LangGraph python-help	31	941	December 16, 2025
Enhancement: Allow ToolCallLimitMiddleware to accept multiple tool names LangChain python-help	2	297	January 22, 2026
I am gonna use langchain builtin middleware in custom state graph LangGraph intro-to-langgraph , python-help	8	615	December 12, 2025
"message":"Function name was web_search____243532335@e but must be a-z, A-Z, 0-9, or contain underscores, dashes, and non consecutive dots with a maximum length of 256 LangChain python-help	7	118	February 20, 2026
Prevent last LLM call after tool calls LangChain python-help	7	282	March 16, 2026

Make ToolCallLimitMiddleware proactive via before_model hook

Use Case

Proposed Solution

Alternatives Considered

Related topics