WebSocket + Stream Connection Keeps Disconnecting on LangGraph Platform

Issue Summary

Experiencing persistent connection stability issues on LangGraph Platform production deployment for 6+ days.

Technical Details

Stream Connections (/runs/{id}/stream)

  • net::ERR_CONNECTION_CLOSED during active streaming
  • Retry pattern: ECONNREFUSED → nginx 404 → success
  • Multiple drops per session

WebSocket Connections

  • ECONNREFUSED and ECONNRESET errors during active connections
  • Tool calls timeout, require page refresh to recover

Key Observations

  • Works perfectly: Local development (langgraph dev) and self-hosted
  • Fails consistently: LangGraph Platform only
  • WebSocket disconnects every 5-10 minutes on platform
  • nginx 404s suggest ingress-level issues
  • Both connection types affected simultaneously

Troubleshooting Attempted

  • Client-side pinging every 30s with ws.ping() - still disconnecting
  • Confirmed no deployments during disconnection times
  • No custom timeout configurations

Additional Issue

After package upgrades: disable_streaming=True setting ignored on platform but respected locally, causing duplicate message chunks.

Questions

  1. Known production infrastructure stability issues?
  2. Expected connection stability SLAs?
  3. Recommended WebSocket timeout configurations?

Has anyone experienced similar platform-specific connectivity issues? Any workarounds or configuration recommendations?

Hey, thank you for flagging these. We’ve seen a dramatic increase load over the last couple weeks, and as a result a few more intermittent errors such as this one. We’re actively working on this and will follow up later with any update(s).