Long Streaming Sessions

Are there any good practices to follow when streaming a graph that is multi step / multi agent? I have a use case where an orchestration agent goes through many different tool calls and LLM nodes and I would like to stream updates and the messages that come back. When deploying this into a cloud like AWS, I am getting hit with all these connection dropped by peer issues. Since stream is a common use case and since long running graphs are valid, what is the best way to handle this knowing graphs can be deployed in production cloud environments.