No cost displayed in LangSmith when using LiteLLM + LangGraph

VettHor · April 16, 2026, 6:39pm

Hey, I’m using ChatLiteLLMRouter with LangGraph. Token count is tracked correctly in LangSmith, but cost is never shown - for any model (OpenAI, Anthropic, Gemini).

I tried adding langsmith callback as described here:

import litellm
from langchain_litellm import ChatLiteLLMRouter

litellm.success_callback = ["langsmith"]

llm = ChatLiteLLMRouter( # then use `llm` in my graph
   router=litellm_router,
   model="heavy",
   temperature=0,
   streaming=True,
)

This does show tokens + cost, but creates separate traces for each LLM call - outside my graph trace. And it’s not even consistent - as you can see in the screenshot, the first separate trace (Gemini) shows no cost at all, while the rest (Anthropic) do:

I don’t want that. What I want is the standard LangSmith behaviour:

Root graph run shows total tokens + total cost
Drilling into the run shows per-step token/cost breakdown

Do I need to write custom callbacks to achieve this? Any tips how to do it? Is there a built-in way? Should I do smth like described here?

Thanks in advance!

pawel-twardziak · April 16, 2026, 7:46pm

hi @VettHor

the docs page you linked - “a. Set a usage_metadata field on the run’s metadata” - is the right mental model, but it’s only step 1 of the cost pipeline: run.set(usage_metadata=...) only delivers tokens (or, optionally, raw input_cost/output_cost), it doesn’t price anything by itself. LangSmith prices a run only when three things are present together - a usage_metadata block, the pair ls_provider + ls_model_name on the run, and a matching regex row in Settings → Workspace → Models (Cost tracking, Metadata reference). ChatLiteLLMRouter never emits the ls_* pair (no _get_ls_params() override) and its response_metadata.model_name is the router group (e.g. "heavy"), not the model LiteLLM actually dispatched to - so LangSmith sees tokens with nothing to price them against, which is why cost is blank and why any occasional Anthropic figure is a misleading partial-match, not real tracking. The fix is to stop using litellm.success_callback = ["langsmith"] (it posts sibling traces outside your LangGraph tree) and instead apply that docs pattern from inside LangChain’s callback lifecycle: either (a) subclass the router and implement _get_ls_params() that reads the real model from LiteLLM’s response["model"] in _create_chat_result, then add matching pricing rows in the workspace; (a′) attach a BaseCallbackHandler whose on_llm_end patches the run’s metadata with ls_provider/ls_model_name - this is the direct LangGraph equivalent of the snippet on the page you linked; or (b) for non-linear pricing, compute the dollar amount with litellm.completion_cost() and write it onto the current run via get_current_run_tree().set(usage_metadata={"input_cost": ..., "output_cost": ...}) - all three keep everything inside one nested LangGraph trace with correct per-node cost aggregation.

dualdust · July 7, 2026, 3:10pm

I would separate this into two layers:

provider-level accounting: what the LiteLLM router knows about the concrete model, input tokens, output tokens, and cost
graph-level accounting: how those numbers are attached back to the LangGraph run/span you actually care about

If tokens are appearing but cost is missing, I would first check whether the model alias resolves to a pricing-aware model name before it reaches LangSmith. A lot of cost dashboards break at the alias layer: “heavy” is meaningful to the router, but not necessarily meaningful to the tracing layer.

The pattern I would try is to attach normalized usage metadata at the node boundary, not only through the global LiteLLM callback. That keeps the cost attribution inside the graph trace and avoids having separate LLM traces floating outside the run.

For production, I would also log cost by workflow/user/customer at the same time as the LangSmith trace id. The trace answers “where did tokens go?” but the product question is usually “which workflow or tenant made this expensive?”

Topic		Replies	Views
LiteLLM Router in LangChain: Missing Model Name and Cost in LangSmith Traces LangSmith Product Help intro-to-langgraph , python-help	3	150	April 14, 2026
Tracing Cost of LLM Tool Observability & Evals	2	568	August 18, 2025
Extending LangSmith Auto Cost Tracking to Include Agent API Calls LangSmith Product Help intro-to-langgraph , intro-to-langsmith , python-help	1	119	February 25, 2026
LangSmith cost/token attribution differs between LangGraph v2 streaming and v3 `astream_events` with Anthropic prompt caching Observability & Evals	1	47	June 10, 2026
Tokens not being tracked in langsmith LangSmith Product Help python-help	0	147	November 19, 2025

No cost displayed in LangSmith when using LiteLLM + LangGraph

Related topics