How to integrate vLLM, sglang with LangChain?

dhr613 · July 30, 2025, 10:38am

I saw in the official documentation that they use ChatOpenAI to call a model locally launched via vLLM. However, I’m not sure if this method is applicable to all models launched via vLLM.

Additionally, I’m uncertain whether this approach is the most up-to-date and stable.

I have the same question regarding sglang.

Another important question: If I call a 72B model via an API versus deploying a 72B model via vLLM and then calling it using the method mentioned above, will there be any difference in tool usage and structured output performance?

Topic		Replies	Views
Langsmith deployment long model call turnaround time Deployment	1	150	December 16, 2025
Tool/Function Calling with Llama-3.2-3B-Instruct model (local) LangChain python-help	8	660	January 3, 2026
New init_chat_model format using LM Studio? LangChain self-hosted , python-help	4	2143	July 15, 2025
Starting AI career — what projects should I build after learning Python + ML basics? LangChain python-help	2	154	February 25, 2026
Vertexai express mode API support in langchain LangChain python-help	0	117	November 5, 2025

How to integrate vLLM, sglang with LangChain?

Related topics