Carol — back to Apps ← Apps

Carolopedia

A friendly guide to Carol, her ecosystem, and the agents who built her.

📖 CarolopediaDroidsAlbus Failure Watcher
Albus Failure Watcher

Albus Failure Watcher

Droid Universal failure-watcher: 30s timer-fired one-shot scanning for pipeline failures Albus should triage
Go to droid →

📖About & Usage

Owner agent — accountability this droid serves

CAROL-INI-113 (Universal Albus failure-watcher) — single-source failure detection across pipeline

Droid responsibility

Detect pipeline-stopping failures (terminal exec failures, timed-out runs, idempotency-suppressed handshakes) and route to al-tr-01 [RETIRED CAROL-INI-2189: scheduler job disabled; failure-detection moved inline to Merlin/Elrond per INI-520; 3 residual janitorial jobs rehomed to al-cleanup-01 / me-replan-drain-01 / el-orphan-reap-01]

What the droid actually does

Scan executions + droid_runs + handshakes; emit Albus-bound handshakes with dedup; exit 0 cleanly when paused-flag or no work present

Boundaries

Read-only over execution data; never invoke fixes directly — that is al-tr-01s lane

👤Owner

Albus · Architect

📚Recent initiatives

Initiatives that touched this droid — a short summary each; open one for the full story.

CAROL-INI-1953-00: Auto-detected never_ran process: Albus Failure Watcher (al-watch-01)
Recurring operational incident, collapsed to one entry.
Orion · 2026-06-24 15:43
CAROL-INI-1845-00: Re-arm Albus failure watcher — register al-watch-01 in Hermione (orphaned by the INI-0974 scheduler migration)
Albus failure watcher (al-watch-01) was never migrated into the Hermione scheduler when scheduling moved off systemd timers (CAROL-INI-0974). Its old timer is disabled and nothing\u2026
Orion · 2026-06-19 18:36
Browse all initiatives →