#git-persisted-agents
2 items
Agent Memory in 2026: Recall Is Solved, Continuity Isn't
LongMemEval hits 94.4; BEAM collapses to 48.6 at 10M tokens. Managed platforms solve user memory — nobody has solved the agent's own craft continuity.
Lab 200.110, Run 1: Storied Agent 25, Stateless Agent 18
Lab 200.110 run 1: storied agent swept 3–0 on quality (25/30 vs 18/30). Stateless was 1.85x cheaper. One run, four more to go.