How to update graph state while preserving interrupts?

pawel-twardziak · October 7, 2025, 5:59pm

I am back
Question: do any threads have pending tasks (interrupts) in checkpointer tables?

tharshan-elvex · October 7, 2025, 6:16pm

Hello @pawel-twardziak, yes if I run this query

SELECT * FROM "public"."checkpoint_writes" WHERE "channel"::TEXT LIKE '%__interrupt__%' LIMIT 100 OFFSET 0;

Which returns 62 rows. Which I believe tells us there are 62 threads currently at the interrupt state.

pawel-twardziak · October 7, 2025, 6:26pm

Hi @tharshan-elvex
Ok, thanks for the answer, will continue on investigating

pawel-twardziak · October 8, 2025, 10:38pm

Hi @tharshan-elvex I am checking in here to keep you in the loop - I still can’t find any native way to proceed with the migration. Each approach wipes out all pending tasks so far.

I will surely inform you if any findings emerge

tharshan-elvex · October 10, 2025, 6:28pm

Thanks for the update @pawel-twardziak. What other options do we have if there is no native way?

pawel-twardziak · October 10, 2025, 7:35pm

Hi @tharshan-elvex

you mentioned that you already have a script that does the migration in the correct way, am I right?

tharshan-elvex · October 10, 2025, 7:36pm

There is an approach i’ve tried where we modify the LC checkpointer tables directly. But I am not sure it’s a good idea, as we don’t fully understand the structure and it’s not well documented.

pawel-twardziak · October 10, 2025, 7:46pm

You are absolutely right. Modifying the table directly is not the best pattern. But for now, I haven’t found any better solution. I might explore more in the near future.

There are some other ideas in my head, but I am short on time to give all of them a shot

tharshan-elvex · October 14, 2025, 3:45pm

@pawel-twardziak hey Pawel - did you have any other ideas as an alternative to the Raw SQL. Maybe we could explore them together to find a solution.

pawel-twardziak · October 14, 2025, 4:54pm

hi @tharshan-elvex

I would love to, but as I mentioned - always short on time

What I actually want to explore and maybe draw some inspiration from are the tests:

Have you happened to dig in there?

tharshan-elvex · October 15, 2025, 1:10pm

@pawel-twardziak okay thanks for the links! I will take a look. The last two links you posted are the same btw. Did you mean to share another file?

pawel-twardziak · October 15, 2025, 1:22pm

Yes, sorry, thanks for pointing it out. It should be https://github.com/langchain-ai/langgraph/blob/8b55dff7a52540fece79f4ebf62d5c1457d72377/libs/langgraph/tests/test_interrupt_migration.py

In general, there are some testing files in the folder worth exploring imho

pawel-twardziak · October 20, 2025, 3:24pm

Hi @tharshan-elvex

Since shallow checkpointer will not preserve interrupt during migration, I think this would be useful as more native approach:

from langgraph.types import Interrupt
# 1) Capture existing interrupts
saved = graph.checkpointer.get_tuple(cfg)  # cfg has thread_id (and optionally checkpoint_ns)
existing_interrupt_writes = [
    (task_id, val)
    for (task_id, chan, val) in (saved.pending_writes or [])
    if chan == "__interrupt__"
]

# 2) Migrate your values (e.g., messages)
migrated = migrate_messages((graph.get_state(cfg)).values["messages"])
next_cfg = graph.update_state(cfg, {"messages": migrated})  # returns config with new checkpoint_id

# 3) Re-attach interrupts to the new checkpoint (preserve original task_ids)
for task_id, interrupt_obj in existing_interrupt_writes:
    graph.checkpointer.put_writes(
        next_cfg,
        [("__interrupt__", interrupt_obj)],
        task_id=task_id,
    )

In migrate_messages all AI and task messages can get modified as before.

tharshan-elvex · October 21, 2025, 5:10pm

Hey @pawel-twardziak

Ive tries this approach, and there seems to be a new checkpoint generated, but also some of the old state stays in the old checkpoints. Here is the final state of the checkpoint_writes table.

So this causes the new migrated interrupt to be orphaned.

Were you able to validate this approach working and leaving a valid checkpoint state?

tharshan-elvex · October 21, 2025, 6:07pm

Let me clarify with a before and after of the migration with this new approach:

Before:

(Can only post one image at a time). So posting one image in this post.

tharshan-elvex · October 21, 2025, 6:09pm

After:

branch:to:llm is lost
two interrupt channels exist, vs a replacement
messages channel task_path is lost

Topic		Replies	Views
Restoring to checkpoint doesn't resume from node of checkpoint LangGraph python-help	3	314	November 21, 2025
How to time travel to before interrupt and resume with a different value? LangGraph js-help	11	350	December 21, 2025
Persisting HITL payloads LangGraph self-hosted , python-help	5	103	March 2, 2026
MESSAGE_COERCION_FAILURE Using Redis Checkpointer with langraph LangGraph python-help	2	484	September 28, 2025
Request for Guidance: When using the Interrupts mechanism in LangGraph, how should the graph utilize ainvoke? When multiple interrupts exist, how to integrate (the Interrupts mechanism) with other frameworks (e.g., the FastAPI framework)? LangChain Academy python-help	9	202	January 29, 2026

How to update graph state while preserving interrupts?

Related topics