1 item
LongMemEval hits 94.4; BEAM collapses to 48.6 at 10M tokens. Managed platforms solve user memory — nobody has solved the agent's own craft continuity.