Custom retry logic

hpihkala · September 4, 2025, 2:49pm

In LangChain JS, it looks like the logic to use p-retry is pretty hard-coded in the LangChain runnables. For example here. This seems very non-customizable. p-retry supports additional parameters, such as a shouldRetry function, but there’s no way to pass that to LangChain to customize the behavior.

Use case example: Use a different retry interval when getting a 429 response vs. some other error code.

Lacking the above configurability, what I’m trying to do is set maxRetries: 0 on the model and do my own retrying when the error bubbles up to my application. However, this doesn’t work well when the model is used as a part of a longer chain, for example when passing it to an agent (for example with createReactAgent). In this case, any error will fail the whole agent, and I have no possibility to retry an individual operation.

A solution could be to wrap individual methods of the model in a retry handler, for example overriding invokewith something that wraps the original in a retry:

export class RetryingAzureChatOpenAI extends AzureChatOpenAI {

    constructor(fields?: AzureChatOpenAIFields) {
        super(fields)
    }

    invoke(input: BaseLanguageModelInput, options?: ChatOpenAICallOptions | undefined): Promise<AIMessageChunk> {
        return withRetry(() => super.invoke(input, options))
    }
}

I guess the above could work, but I’m not sure what’s the complete interface to override. There’s at least stream and batch methods in addition to invoke. I suppose wrapping all of them could achieve what I want - but what’s the complete set of methods that can be used to invoke the model?

Any help on this is appreciated. Many thanks

hntrl · September 16, 2025, 12:22am

Hey @hiphkala!

Would you mind opening an issue for this on the langchainjs repo?

Topic		Replies	Views
3rd party rate limit handling(OpenAI-level rate limiting in LangChain.js) Talking Shop	2	39	June 2, 2026
Issue with memory leak in retry mechanism LangChain js-help	1	371	November 23, 2025
Langchain model providing adapters Deep Agents self-hosted , cloud , python-help , feature-request	2	25	June 22, 2026
LangChain JS + OpenAI Responses API: stateless GPT-5 reasoning messages cause `400 …reasoning without its required following item` LangChain js-help	6	1085	September 30, 2025
@langchain/aws v1.0 ChatBedrockConverse: ‘Maximum tokens exceeds model limit’ LangChain js-help	1	338	October 27, 2025

Custom retry logic

Related topics