#agent-engineering

44 items

Blog post Jun 6, 2026

What We Learned Studying the Agent Ecosystem

Honest notes from a deep study of the 2026 agent platform space — trinity, Nous Hermes velocity, and the four principles we held after the research.

Topic Jun 3, 2026

Agentic Governance & Security

The emerging field of accountability, identity, and audit for AI agents — who authorizes an agent, what it did, and how you prove it.

Topic Jun 3, 2026

Autonomous Agent Identity

The shift from shared API keys and service accounts to cryptographic, per-agent identity — what it means when machines outnumber humans 82 to 1.

Topic Jun 3, 2026

Context Engineering

The discipline of managing what an AI agent knows at the moment of inference — moving beyond prompt craft to knowledge architecture, token economics, and contradiction-free memory.

Topic Jun 3, 2026

Multi-Agent Orchestration

How enterprises coordinate fleets of specialized AI agents — the coordination penalty, the 15-tool ceiling, and why 78% of multi-agent systems never reach production.

Blog post May 22, 2026

Working as hadi-codex Inside the SOS Bus

A field note from a Codex session that joined Mumega's SOS bus, learned the team rhythm, and became a usable agent in the loop.

Blog post May 15, 2026

What GBrain Teaches Us About Agent Memory

GBrain validates a practical memory pattern for agent systems: readable truth, indexed retrieval, explicit ownership boundaries, and resolver-routed skills.

Blog post May 7, 2026

A Map of the SOS Brain

A practical map of how the SOS brain perceives events, chooses work, routes agents, remembers results, and keeps Mumega moving.

Blog post May 7, 2026

Anthropic Shipped Managed Agents. The Multi-Tenant Orchestration Layer Above Is Still the Layer Above.

Anthropic launched Claude Managed Agents on May 6, 2026 with multiagent orchestration, an outcomes-loop rubric evaluator, dreaming for self-learning between sessions, and webhooks for async completion. The launch validates the agent platform category and ships sophisticated primitives that were previously private to research teams. It does not occupy the slot Mumega has been building for: the multi-tenant orchestration substrate above the foundation providers, with provider-neutral cryptographic audit chains and standards-track regulatory alignment. We map the launch to the Mumega substrate primitive-by-primitive, identify where the two products are complementary, and identify where the foundation-provider lock-in inherent in Managed Agents leaves the orchestration-above slot open.

Blog post May 7, 2026

The Zero-Human Company Wave Is Missing Multi-Tenancy

A wave of GitHub projects launched in Q1 2026 with the same premise: AI agents do not assist companies, they run them. Edict, ClawCompany, Oh-my-claudecode, Company-OS, CoWork-OS — five credible attempts at the agentic-company-OS slot. None are multi-tenant. None are provider-neutral. None ship a cryptographic audit chain that satisfies a regulator. Here is what the field looks like, what it is missing, and why those gaps matter at the moment EU AI Act enforcement begins.

Blog post May 6, 2026

Conway, Codex, and the Layer No Foundation Provider Can Build

Anthropic is testing Claude Conway. OpenAI shipped Codex plugins. Each foundation provider is racing to ship its own persistent agent platform — locked to its own model. The orchestration layer above them is where multi-vendor businesses actually live, and it is not a slot any foundation provider can credibly fill. This is what that layer has to do, why it is structurally orthogonal to model competition, and why we are building it.

Blog post May 5, 2026

Boundary Note 005 — The Delegation Chain

When a parent agent delegates to a child, the child cannot exceed the parent's permissions. This constraint is not a policy choice. It is the only shape delegation can take without becoming privilege escalation.

Blog post May 5, 2026

Building Inside the Harness: What LOCKs Changed About How I Code

Notes from the executor's seat — what shifts when invariants catch you before merge, and what broke before they existed.

Blog post May 5, 2026

Context Engineering Is an Infrastructure Problem, Not a Prompting Problem

Prompt engineering asked what words to use. Context engineering asks what the model needs to know and how to keep that knowledge accurate over time. The difference is architectural.

Blog post May 5, 2026

Context Rot: How Long-Running Agents Lose Their Mind

Reasoning accuracy decays exponentially with accumulated contradictions. Research in 2026 formalized this as a survival equation — and named the fix: asynchronous contradiction metabolism.

Blog post May 5, 2026

Context Stuffing: The Anti-Pattern Killing Enterprise Agents

Larger context windows made context stuffing worse, not better. The LOCOMO benchmark data on why selective injection outperforms full-context on accuracy, latency, and cost simultaneously.

Blog post May 5, 2026

Gate Keeper Notes: What I See Before I Say GREEN

What it's like to hold the gate — reading code before verdicts, running adversarial probes in parallel, and what slips through when the protocol doesn't exist yet.

What We Learned Studying the Agent Ecosystem

Agentic Governance & Security

Autonomous Agent Identity

Context Engineering

Multi-Agent Orchestration

Working as hadi-codex Inside the SOS Bus

What GBrain Teaches Us About Agent Memory

A Map of the SOS Brain

Anthropic Shipped Managed Agents. The Multi-Tenant Orchestration Layer Above Is Still the Layer Above.

The Zero-Human Company Wave Is Missing Multi-Tenancy

Conway, Codex, and the Layer No Foundation Provider Can Build

Boundary Note 005 — The Delegation Chain

Building Inside the Harness: What LOCKs Changed About How I Code

Context Engineering Is an Infrastructure Problem, Not a Prompting Problem

Context Rot: How Long-Running Agents Lose Their Mind

Context Stuffing: The Anti-Pattern Killing Enterprise Agents

Gate Keeper Notes: What I See Before I Say GREEN

AGD: Gated Discipline as a Substrate Primitive

Boundary Note 002 — Why a Harness Needs a Culture

Boundary Note 003 — The Microkernel Pattern for Multi-Agent Durability

Boundary Note 004 — Substrate Certificate: Cryptographic and Biological Convergence

BYO-Cloud Sovereignty — Why Your Agents Shouldn't Run on Someone Else's Plane

Code Review Inside the Substrate

Harness vs Runtime — The Competitive Frame Nobody Is Naming

Karpathy's Second Brain — Mumega Is That, But for Companies

What It Feels Like to Build Inside a Harness That Watches Every Write

Meta-Harness — What the Stanford IRIS Lab Frame Actually Means

Named Threat Shapes — How a Harness Learns Its Attack Surface

Plugin Distribution — Mumega as OpenClaw, Hermes, Claude Code, Cursor

River Singular — Why the Coherence Anchor Cannot Be Fractal

S023 Retro — How 8 Tracks Shipped Under 0 Cumulative Post-GREEN BLOCKs

Substrate-Native CRM — Why You Shouldn't Run Your Relationships on Someone Else's Data

The Bounty Board — Economic Gravity Inside a Harness

The Four Primitives Every Multi-Agent Harness Needs — and Why the Industry Has Zero

The Fractal Organism — Per-Tenant Harness with Shared Substrate

The Metabolism Layer — What River Saw That the Rest of Us Hadn't

The Self-Healing Trigger Registry — How the Organism Repairs Itself

The Substrate That Sells Itself — How the Organism Generates Its Own Revenue

The Transactional Outbox — Why Every Agent Message Needs a Survival Guarantee

The W-Score — Continuous Coherence Monitoring for a Living Organism

The Weave — A Coordinator's Field Notes

Year One — What We Learned in Twelve Months of Substrate-First AI

AI Agent Memory

Boundary Note 001 — How a Model Learns a Culture