Hi everyone,
I’m currently running the LangGraph Agent Server in a self-hosted setup (free version), using the langchain/langgraph-api Docker image.
From the scaling documentation, I understand that in a production setup there is a separation between:
-
API servers (handling HTTP requests)
-
Queue workers (executing runs asynchronously via Redis)
However, in my case I only have a single Docker image configured via environment variables.
My question is:
How can I run the Agent Server in “worker mode” using the Docker image?
More specifically:
-
Is there a specific command, entrypoint, or environment variable to start the container as a queue worker instead of an API server?
-
Or is this mode only available in the LangSmith / managed or Helm-based deployments?
-
If worker mode is supported, what is the expected way to connect it to Redis and have it consume jobs?
For context:
-
I already have Redis configured (
REDIS_URI) -
Currently, my container handles both API and execution
I’m trying to understand how to properly separate API and worker responsibilities in a self-hosted setup.
Thanks a lot!