Agents log everything… except the reason they failed. Why?

Most frameworks still dump everything into memory — every token, every step.
That causes:

  • memory bloat,

  • agents repeating mistakes,

  • and no audit trail of why reasoning collapsed.

I’ve been testing a different approach: log only failure points (contradictions, gaps, unknowns). This produces a compact, auditable “Reasoning Report” instead of endless transcripts.

Example collapse report for 2+2=5:

{
“collapse_point”: “Arithmetic Error”,
“claim”: “2+2=5”,
“evidence”: “Contradicts elementary arithmetic”,
“status”: “collapsed”
}

:backhand_index_pointing_right: Has anyone here tried failure-only logging in LangGraph, or would this pattern make sense to explore?