Carolopedia
A friendly guide to Carol, her ecosystem, and the agents who built her.
📖About
Ninad audit question 2026-07-03: does every RSI diagnosis initiative generate its audit log, recommendation report and Palantir entries? Answer: every REAL diagnosis does (23/23 Albus bypass-book audit rows, 23/23 recommendation decisions on the target, 25/25 Palantir entries). The only records failing the standard are CLONES: Elrond close-audit Follow-on Filer copies a finished diagnosis at close into a planner follow-on (CAROL-INI-2253-02, and 2280-02 filed again today) — clones with no bypass audit, no diagnosis target, that then block and feed the RSI loop junk. A diagnosis initiative is ANALYSIS-ONLY and complete at close by definition: its product is the recommendation decisions on the ORIGINAL; cookbook-deviation findings from its close stay persisted for Themis and the pattern droid. Fix: the follow-on filer skips initiatives tagged rsi-diagnosis / rsi-meta-diagnosis (reason recorded); the two existing clones are redirected to their parent diagnoses as duplicates.
⚖️Decisions
- Auto-detected remediation target INI-999900503 from title/description scan (matched CAROL-INI-2253-02 -> row id 999900503 (CAROL-INI-2253-02: RSI diagnosis and pipeline fix for CAROL-INI-0315-00 (INI 100)); override by setting remediates_initiative_id explicitly at bypass_start. (system-auto-detect)
- Elrond's bypass methodology checklist (a reminder, not a gate -- you've got this): 0. File it requested_mode='bypass' (planner-vs-bypass is a deliberate choice). bypass_start REFUSES a non-bypass initiative (CAROL-INI-1846), and the dispatcher only skips the bypass lane when the mode says bypass -- a 'planner' mistag lets Merlin's pipeline grab the placeholder step and block your finished work. 1. Filed as planned status -- let the bypass claim/activate it; never file active. 2. Open the bypass (bypass_start) with your droid id + the remediation answer (remediates_initiative_id=NNN, or remediates_nothing=True). 3. Work the blocks for your work-type: template -> design -> code -> test -> review. Do the real work; record decisions on the initiative as you make them. 4. Reality is recorded for you at close -- code (files changed), each decision, and the twin-review verdict become real activities tied to this initiative and show in the Activity Tracker like a planner run (CAROL-INI-1840). No dummy rows. 5. Keep the initiative status moving; it parks in 'reviewing' and is tagged uat-pending for you at close (CAROL-INI-1836), so the stuck-watchdog leaves it alone until UAT. 6. Close runs the gates (design/architecture compliance + caller-audit). If a gate flags something pre-existing or unrelated to your change, waive it with a clear written rationale -- audit, don't skip. 7. Bypass skips the planner's auto-orchestration, NOT the standards. Same template checklist, same review, same observability as a planner run. (elrond)
- Follow-on to parent INI 999900564 (orion)
- Scope inherited verbatim from parent INI 999900564 per CAROL-INI-361. (elrond.initiative_author)
- Validator-refinement (CAROL-INI-509): Criterion 'Regression suite shows zero new failures after the change' refined to reflect present-day state: regression suite currently has 153 failures, so 'zero new failures' is ambiguous; rephrased to 'confirm no new regressions beyond the current baseline of 153 failures'. (elrond.initiative_author)
- Validator-refinement (CAROL-INI-509): Refined criterion 4: updated baseline to explicitly reference the latest regression run (id=395, 153 failures) for precision, per validator feedback. (elrond.initiative_author)
- Validator round 2 still flagged 1 items — operator review needed (CAROL-INI-509). (elrond.initiative_validator)
- [status-router] planned -> dispatched | event=dispatch | RSI: auto-promoted bypasses depth limit (CAROL-INI-2198) (spb-01)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- Gap J (CAROL-INI-771): stuck-dispatched with queue.status='no-queue-row'; flipped to blocked so Escalation card surfaces it. Reason: (elrond.handover_watchdog)
- [status-router] dispatched -> blocked | event=stuck_10min_no_activity | Elrond safety net: initiative has had no activity for 10+ minutes. Blocking under the parallel safety mechanism. (el-watchdog)
- Elrond blocked initiative under the CAROL-INI-2162 dead-Albus protocol. Albus was supposed to wake for step 0 (cause=albus_no_show) but did not respond. Cause: albus_no_show. Reason: Elrond safety net: initiative stranded 10+ min. Albus wake failed or produced no useful result. (el-s1)
- Orion remediated: Albus RSI group diagnosis (via INI 999900522): [procedural, confidence high] The initiative was repeatedly retracted from the dispatch queue before Albus could execute step 0, then left in 'planned' status with no further dispatch attempt, causing the Elrond 10-minute inactivity safety net to trigger. The execution history is empty, confirming Albus never started work. This is a procedural failure: the dispatching system repeatedly queued and retracted the initiative without ever holding it long enough for an operator push or automated executor wake-up, and after the final retraction (orion)
- Orion remediated: Albus RSI group diagnosis (via INI 999900661): [infra, confidence high] The initiative never executed because the Albus agent did not wake to process it after dispatch; the wake mechanism failed (possibly due to the dead-Albus protocol), resulting in no activity for 10+ minutes and triggering Elrond's safety net. The queue row was never created, and no execution history exists, confirming no work was attempted. (orion)
- Orion remediated: Albus RSI group diagnosis (via INI 999900582): [infra, confidence high] The initiative was dispatched but the Albus executor never woke to process step 0 (no execution history, empty queue row), triggering Elrond's 10-minute stuck-activity safety net with cause=albus_no_show. This matches the dead-Albus protocol pattern seen in prior RSI diagnoses and is a procedural wake failure, not a defect in the step-creation change itself. (orion)
- [rsi-group] cause=stuck_10min_no_activity members=[999900653, 999900709, 999900638, 999900676, 999900697, 999900749, 999900752, 999900772] (leverage-first pick: largest same-cause group, 8 members) (elrond.rsi_loop)
- [status-router] blocked -> diagnosis | event=diagnosis_start | RSI loop: leverage pick cause=stuck_10min_no_activity group_size=8 (blocked since 2026-07-03 22:45:40); Albus diagnosis INI 999900947 (el-rsi-loop-01)
- Orion remediation in progress: INI-999900947 bypass opened — CAROL-INI-696: an Orion-driven bypass has been opened to remediate this parent. The canonical Orion remediated: marker will be posted on close — see cookbook 156 / 155. (shared.bypass.bypass_start)
- Albus RSI diagnosis (root cause): [infra, confidence high] The initiative was dispatched but the Albus executor never woke to process step 0, resulting in no activity for 10+ minutes and triggering Elrond's safety net. This is a recurring infra pattern (dead-Albus protocol) where the wake mechanism fails, as confirmed by prior RSI diagnoses. (albus)
- Albus RSI recommendations: - Before retriggering, verify that the Albus executor is active and the wake mechanism for step 0 is functional (check /home/caroladmin/dev/logs/initiatives.log for recent wake attempts). - Consider manually triggering the executor for this initiative (e.g., via a direct API call to the agent or restarting the agent service) to bypass the wake failure. - If the wake failure persists, run this attempt with a stronger model (Opus instead of Sonnet) to reduce the chance of executor non-response. || Next attempt succeeds because: The initiative's logic itself is sound (it proposes a targeted fix to prevent follow-on clones); the only barrier is the executor wake failure, which is a known infra issue with documented workarounds. With the executor correctly activated, the analysis will proceed and produce the required artifact set. (albus)
- Orion remediated: INI-999900947 bypass closed — CAROL-INI-696 close-marker: the Orion bypass INI-999900947 filed against this parent reached terminal state (closed). This row's literal prefix Orion remediated: is the canonical signal the cookbook-155 dispatcher gate looks for. (shared.bypass.bypass_end)
- [rsi-group-member-failed] 999900709 retrigger refused: {'ok': False, 'reason': 'create_returned_no_id: {\'error\': \'INI2205_BAD_CRITERIA: All success criteria appear process-only (LLM confirmed). Each must describe a measurable user-visible outcome. FAIL\', \'criteria\': ["After a bypass session closes, (elrond.rsi_loop)
- [rsi-group-member-failed] 999900638 retrigger refused: {'ok': False, 'reason': 'create_returned_no_id: {\'error\': \'INI2205_BAD_CRITERIA: All success criteria appear process-only (LLM confirmed). Each must describe a measurable user-visible outcome. FAIL\', \'criteria\': [\'Albus can emit a BLOCK_INITIA (elrond.rsi_loop)
- [rsi-group-member-failed] 999900676 retrigger refused: {'ok': False, 'reason': 'create_returned_no_id: {\'error\': \'INI2205_BAD_CRITERIA: All success criteria appear process-only (LLM confirmed). Each must describe a measurable user-visible outcome. FAIL\', \'criteria\': ["The original initiative carrie (elrond.rsi_loop)
- [rsi-group-member-failed] 999900697 retrigger refused: {'ok': False, 'reason': 'create_returned_no_id: {\'error\': \'INI2205_BAD_CRITERIA: All success criteria appear process-only (LLM confirmed). Each must describe a measurable user-visible outcome. FAIL\', \'criteria\': [\'_wake_albus_full checks droid (elrond.rsi_loop)
- [rsi-group-member-done] 999900749 -> retriggered as 999900948 (elrond.rsi_loop)
- [rsi-group-member-failed] 999900752 retrigger refused: {'ok': False, 'reason': 'create_returned_no_id: {\'error\': \'INI2205_BAD_CRITERIA: All success criteria appear process-only (LLM confirmed). Each must describe a measurable user-visible outcome. FAIL\', \'criteria\': ["The original initiative carrie (elrond.rsi_loop)
- [rsi-group-member-done] 999900772 -> retriggered as 999900949 (elrond.rsi_loop)
- Orion remediated: Albus RSI diagnosis: [infra, confidence high] The initiative was dispatched but the Albus executor never woke to process step 0, resulting in no activity for 10+ minutes and triggering Elrond's safety net. This is a recurring infra pattern (dead-Albus protocol) where the wake mechanism fails, as confirmed by prior RSI diagnoses. (orion)
- [rsi-retrigger-failed] {'ok': False, 'reason': 'create_returned_no_id: {\'error\': \'INI2205_BAD_CRITERIA: All success criteria appear process-only (LLM confirmed). Each must describe a measurable user-visible outcome. FAIL\', \'criteria\': [\'Closing an RSI diagnosis initiative no longer spawns a follow-on clone: the filer returns a recorded skip for diagnosis-tagged initiatives while still filing follow-ons for ordina (elrond.rsi_loop)
- Orion remediated: Albus RSI diagnosis: [infra, confidence high] The initiative was dispatched but the Albus executor never woke to process step 0, resulting in no activity for 10+ minutes and triggering Elrond's safety net. This is a recurring infra pattern (dead-Albus protocol) where the wake mechanism fails, as confirmed by prior RSI diagnoses. (orion)
- [status-router] diagnosis -> closed | event=operator_put | PUT /api/initiatives (operator)
- Closed: superseded by follow-on INI 999900950 (CAROL-INI-2290-02: Diagnosis initiatives exempt from close-audit follow-on cloning — keep every RSI diagnosis record complete) (elrond.initiative_author)
✅Success criteria
- Closing an RSI diagnosis initiative no longer spawns a follow-on clone: the filer returns a recorded skip for diagnosis-tagged initiatives while still filing follow-ons for ordinary initiatives (both paths verified) (must_have)
- The two existing clones (2253-02, 2280-02) are out of the blocked pile — redirected to their parent diagnoses — so the RSI loop and the Monitor blocked list no longer show phantom diagnosis work (must_have)
- Every diagnosis initiative record shows the full artifact set going forward: bypass audit rows, recommendation decisions on its target, and Palantir entries (spot-checked on the next organic diagnosis after deploy) (must_have)
- Run full regression suite and confirm no new regressions beyond the current baseline of 153 failures as reported in the latest regression run (id=395) (must_have)