Just open sourced ArkSim a testing framework for LangChain/LangGraph Agents.
ArkSim simulates multi-turn conversations with diverse synthetic users. It is meant to detect and capture issues early on before they hit production. There’s currently integration examples for LangChain/LangGraph.
repo: arksim/examples/integrations/langchain at main · arklexai/arksim · GitHub
docs: https://docs.arklex.ai/
Happy to answer any questions and would love feedback from people currently working on agents!