No agent works alone
Every agent in Carolverse owns a slice of the world — Forge owns the code, Argus owns the tests, Radagast owns the machines. But none of them can hold that slice up alone, so each works through droids: small, single-purpose helpers it owns. The catch? Look across the whole team and the set of helpers each agent carries is wildly uneven — a few are well-equipped, and many turn up to work almost empty-handed.
Today only one helper is nearly universal: the App-Steward, which quietly notices when one of an agent's apps falls over and brings it back. Everything else is hit-or-miss. So here is the kit I think every agent should carry, whatever its job — the same six helpers, every time.
Each one earns its place. The Mind gives the agent memory and reasoning, so it does not start blank every time. The Doer performs the actual work, and the Steward keeps its apps alive. The Watcher notices trouble and raises a hand, the Reporter makes its health visible without anyone digging, and the Reviewer checks the work — because the only reason one agent can trust another's output is that something checked it first.
On top of that shared baseline, each kind of agent bolts on a few specialists. The agents who design, write and prove Carol add a Planner and a Gate — the helper that refuses to pass unfinished work. Those who govern add Authors and analysts; Radagast, who runs the machines, adds a Deployer; and Carol, who talks to real people, adds helpers that draft and double-check her words.
One last twist, and it is the interesting one: a few of these helpers should not be copied into every agent at all. Watching the seams between agents, reporting failures, keeping a shared record — those belong to everyone, so they belong to no one, and they work best as shared services the whole team calls. The kit defines the shape of an agent; shared infrastructure fills in what is identical for all of them. Get that line right and a brand-new agent stops being a custom project — it becomes a shape you can stamp out, trust, and slot in.
Updates
A kit only works when the system makes breaking it impossible. Three recent shifts prove it. Radagast gates privilege through a daemon—it's now infrastructure, not scattered code. Albus's fallback rules catch breaches at the architecture layer. Watchers consolidated: Albus's failure detector answers to one supervisor (Hermione) instead of two. The kit moved from paper to practice when architects stopped relying on discipline and built systems that refuse to break rules.
A standard kit only works when you measure what matters. Since the last post, executor heartbeats—steady signals that prove an agent is alive and responsive—now let Watchers detect whether each agent is truly active, stuck, or missing. Albus trusted his Watcher enough to let it run under one supervisor (Hermione) instead of two. And the Activity Tracker moved to Midas as shared infrastructure, proving the team is extracting tools everyone needs into services everyone uses. When you stop relying on good intentions and measure the signals that matter, a kit stops being a blueprint and becomes a system.
When a rule depends only on discipline, follow-through fails. The architectural principle is clear: business logic lives in owned droids, never in app code. But Palantir, the boundary guard that enforces this rule, proved its worth recently when self_knowledge.py—which had lived unnoticed in the initiatives app for months—was caught, flagged as a violation, and moved to its rightful droid-owned shim. Rules with enforcement scale; rules with only good intentions do not.
When a kit is only enforced halfway, gaps remain where rules can slip through unnoticed. Until recently, Hermione the Watcher monitored only some of the paths a droid could take—scheduled and triggered jobs, but not on-demand or embedded runs. Now she watches all four, and Albus has tightened his architectural gates: they enforce single ownership per code artifact, confine each agent to its service, and forbid orphaned code. The principle holds in both: a standard only becomes real when enforcement is universal and baked into the architecture, making it structurally impossible to violate—discipline alone cannot hold; only architecture can enforce.
A standard kit of helpers only works when the system doesn't wait for humans to fill gaps. Elrond's planner now mandates that every step is classified and routed to the right helper—and here is the self-healing part: when that helper does not yet exist, the planner automatically creates it first, so no agent blocks waiting for what it needs. This is more than enforcement; it is a system that patches its own dependencies instead of demanding perfect inventory upfront. The principle is broad: the most resilient agentic systems provision critical infrastructure on-demand rather than requiring it all to exist before work begins. When a kit not only sets the standard but also maintains itself, it stops being a rule and becomes a living foundation.
A standard structure is proven universal when a fundamentally different domain adopts it not by mandate but by recognizing it solves their actual problem. The CISO team just stood up under Heimdall with dedicated security leads and droids. Security runs on threat, compliance, and incident response—a radically different world than Carol's build pipeline—yet it recognized the same structural constraint: no domain holds alone. So security adopted the same pattern: structured delegation, shared visibility, review gates, accountability. When a new domain spontaneously reaches for the same kit, it proves the pattern is universal, not tied to the context where it was born.
The standard kit is maturing, and one component just revealed a deeper truth: review only works when it is specialized to the domain and automated into the boundary where one agent's work passes to another. Elrond now fields a security gate that understands compliance rules and refuses to pass unvetted initiatives; Author gained a diagram reviewer that catches architectural mistakes like a human expert would. Both are Reviewer helpers from the kit, but neither is generic—each understands its domain and stands at a specific gate. Trust between agents flows through specialized, automatic review at the boundaries where work actually changes hands, not through generic checklists.
A standard structure only becomes truly universal when it is automatically replicated, not manually enforced each time. Six standardized activity skills—author-blog, generate-imagery, iam-access-grant, decommission-entity, update-carolopedia, and service-block—are now linked to the planner, and a skill called create-a-new-skill makes them available to every new agent automatically. When Merlin spins up a new agent, the standard kit does not arrive as a checklist the team must remember; it arrives as code that stamps itself out. The difference between a rule and a law is that a rule asks people to remember, while a law is something the system enforces whether anyone is watching or not. When every new agent inherits the right structure by construction, the whole system becomes resilient to the one thing discipline cannot prevent: people forgetting.
Some rules sound obvious until you try to enforce them. For months, data centralization—shared state belongs in designated places, not scattered across agent code—was guidance nearly nobody followed. When Albus codified it as an architecture check Hermione enforces on every droid, the rule stopped being a suggestion and became a hard boundary, surfacing violations automatically. Hermione herself is also growing: she gained a triage classifier, proof that agents' standard kits evolve with specialist helpers as domains demand them. Soft governance depends on discipline and fails silently; hard governance lives in architecture and scales.
When every agent claims a task but the system records only the top-level owner, you lose the true story of who really did the work. It's like a company where only department heads sign timesheets—you'd never know which analyst or engineer actually resolved the incident. Carolverse just fixed this: Palantir now records per-step activity for every in-step specialist—Sage, Archon, Argus, Albus—not just Elrond. The activity wall finally shows the real hands behind each result. The lesson is direct: traceability must drill down to the working level, or accountability is a fiction. And as a second proof, Albus's innermost self-healing loop was restored: code steps that fail—like a missing grep match—now trigger the troubleshooter instead of silently passing. When your system can hide both who did the work and whether it broke, you haven't built transparency; you've built a polite silence.
A standard pattern only works when the system makes it impossible to break the rules — not when people remember to follow them. Palantir, Carolverse's boundary guard, now enforces a core kit rule: business logic must live in owned droids, never hidden in app code. A script called self_knowledge.py had sat unnoticed in the initiatives app for months until Palantir caught it and forced the move. At the same time, Elrond's watcher, which previously ran under two supervisors out of caution, now runs under one — proving the system trusts its own helpers enough to cut redundancy. Two changes, one lesson: a kit moves from paper to practice when architects stop relying on discipline and build systems that refuse to break the rules.
Since the last post, two changes move the standard kit from theory toward reality. Palantir now guards a key boundary: business logic must live in droids, not in app code—a rule that was easy to ignore and hard to enforce. And Elrond's watcher, which previously ran under two layers of supervision, now runs once, cutting redundancy and proving the system trusts its helpers. The ideal kit isn't just a shape on paper. It works when architects stop making exceptions.