Thoughts on preserving timestamps & speaker metadata in LangChain audio transcripts?

Cosmos-Atom · February 5, 2026, 6:24pm

Hey all

While playing with LangChain’s local Whisper audio parser, I noticed that timestamps (and any speaker info) don’t really survive once audio becomes Documents everything ends up flattened into text.

That makes things like time-based search, jump-to-audio playback, or per-speaker summaries a bit awkward.

I’m thinking about exploring a small, backward-compatible way to preserve start/end timestamps (and optional speaker metadata) directly in Document.metadata. I’d genuinely love to work on this if it’s useful to others.

Curious if anyone else has hit this, or has thoughts on whether this belongs in LangChain vs downstream.

keenborder786 · February 7, 2026, 9:17pm

@Cosmos-Atom Are you refering to following parser: OpenAIWhisperParserLocal — 🦜🔗 LangChain documentation?

Topic		Replies	Views
Files in Agent Chat LangGraph self-hosted , intro-to-langgraph , js-help	3	763	February 19, 2026
Ability to access llm metadata in callback LangChain product-feedback , js-help	2	513	August 19, 2025
Does langmem supports meta data storage? LangSmith Product Help python-help	0	211	July 11, 2025
Does metadata in AIMessage affect token usage in LangGraph? LangGraph python-help	1	119	February 28, 2026
Timestamps and strings in concatenated streaming responses are added LangChain js-help	1	112	January 13, 2026

Thoughts on preserving timestamps & speaker metadata in LangChain audio transcripts?

Related topics