Skill

autoresearch-x

Iterate anything. Prove everything. Autonomous iteration engine with evidence-chain tracking. Use when: "iterate on X", "optimize X until Y", "autoresearch", "auto-research", "run experiment loop", "debug X systematically", "investigate X", "analyze X with checklist", "loop until fixed", "overnight experiment", "keep trying until it works", or any task requiring autonomous iteration with structured tracking. Supports three modes: optimize (metric-driven code iteration), debug (phased diagnosis with hypothesis tracking), investigate (evidence-based research/analysis). Inspired by karpathy/autoresearch — generalized to any domain.

Popularity

Stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/autoresearch-x:autoresearch-x

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

*Iterate anything. Prove everything.*

Supporting Files

ref-branching.mdref-debug-mode.mdref-investigate-mode.mdref-optimize-mode.mdref-tracking.mdtemplates/debug.mdtemplates/investigate.mdtemplates/optimize.mdtemplates/report-debug.mdtemplates/report-investigate.mdtemplates/report-optimize.md

SKILL.md

564 lines · ~6.2k tokens(exceeds 5k compaction limit)

Stats

LanguagePython

Stars2

MaintenanceExcellent

Last CommitApr 3, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

autoresearch-x

Iterate anything. Prove everything.

An autonomous iteration engine. Set up a tracked run and iterate autonomously — optimizing code, debugging failures, or investigating questions — until the target is met or the user interrupts.

When to Use

Optimizing code against a benchmark metric
Debugging a failure systematically (not random fix attempts)
Investigating logs, data, or systems against a checklist
Running overnight experiments while you sleep

Do NOT use for one-shot tasks, tasks without measurable criteria, or simple edits.

Invocation

Skill Mode (Single Session)

/autoresearch-x — interactive setup (guided questions)
/autoresearch-x --template optimize|debug|investigate — start from a template
/autoresearch-x --program path/to/program.md — use an existing program.md
/autoresearch-x resume <tag> — resume a previous run
/autoresearch-x status — show current run status
Natural language — if the user describes a task (e.g., "optimize the API latency", "debug why auth fails"), infer mode/target/scope from context, draft a program.md, and present it for review. No flags needed.

Agent Teams Mode (`--teams` flag via Coordinator CLI)

uv run autoresearch-x --teams --program path/to/program.md — start teams mode
uv run autoresearch-x --teams --chat — interactive program.md design then teams mode
uv run autoresearch-x resume <tag> — resume interrupted teams run
uv run autoresearch-x status <tag> — check teams run status
uv run autoresearch-x cleanup — remove old runs (default: 7 days)

When to use teams mode: Complex tasks (30+ iterations) where context window pressure becomes a problem. Each iteration uses fresh teammates with clean context.

When to use skill mode: Simple tasks (5-10 iterations) where a single session suffices.

Three Modes (summary)

Mode	Goal	Phases	Key rule
optimize	Improve a metric	baseline → iterate	One change per iteration, keep/discard by metric
debug	Fix a failure	observe → diagnose → fix	Progressive scope, hypothesis matrix
investigate	Answer questions	gather → analyze → conclude	Evidence chains, Toulmin structure

BEFORE PROCEEDING: Read the reference files for your selected mode.

optimize: MUST READ: ref-optimize-mode.md
debug: MUST READ: ref-debug-mode.md
investigate: MUST READ: ref-investigate-mode.md

All modes: MUST READ: ref-tracking.md Branching (v2): MUST READ: ref-branching.md

Setup Phase

Before the loop starts, complete this checklist:

Read reference files — already loaded via <reference> tags above.
Agree on a run tag — date-based default (e.g., mar23-api-latency). If tag exists, append suffix.
Create branch — git checkout -b autoresearch-x/<tag> from current HEAD.
Read/write program.md — user provides one, selects a template, or use the Interactive Setup flow below.

Interactive Setup (when no program.md provided)

If the user's task description gives enough context, infer necessary info from context — draft a complete program.md and skip straight to the Program Review step. Only ask questions (Steps A-F) for sections you can't confidently infer. The goal is to get to iteration fast, not to interrogate the user.

Track inferred sections: As you build the draft, maintain a list of which sections were inferred from context vs explicitly provided by the user. For example, if the user said "optimize my API latency" and you inferred the eval command from project files, record inferred_sections: [Evaluation, Scope, Constraints, Checklist]. This list is passed to the reviewer for extra scrutiny on inferred content.

If the user provides no context at all (bare /autoresearch-x), guide them through this flow:

Step A: Mode (AskUserQuestion — 3 choices)

"What kind of iteration are you doing?"

optimize — improve a measurable metric (latency, accuracy, throughput)
debug — systematically diagnose and fix a failure
investigate — answer questions with evidence chains

Step B: Target (free-form)

"Describe success in one sentence — what does 'done' look like?" Example: "Reduce p99 API latency below 200ms" / "Auth test passes reliably" / "Root cause of 403 errors identified with evidence"

Step C: Scope (AskUserQuestion + free-form) Auto-detect candidate files from recent git changes, error stacktraces, or project structure. Present detected files and ask user to confirm, add, or remove. Classify each as modify or readonly.

Step D: Evaluation (free-form, with mode-specific examples)

"What command measures progress? What metric should I track?" Show examples based on selected mode:

optimize: uv run bench.py --json → metric: p99_latency_ms → target: < 200
debug: pytest tests/test_auth.py -x → metric: pass/fail → target: pass
investigate: checklist completion → metric: items_resolved → target: all

Step E: Constraints (AskUserQuestion — presets + free-form override)

"How much budget for this run?"

Light — 10 iterations, 30min timeout
Medium — 30 iterations, 1h timeout
Heavy — 50 iterations, 2h timeout
Infinite -- no max iteration, no timeout, no stop!!!
User can override any value after selecting preset.

Step F: Context (free-form, optional — can skip)

"Any background I should know? What's been tried before?" If user says "skip" or "none", leave ## Context empty.

Step F.1: Program Review (automated — no user interaction)

Dispatch the reviewer subagent to validate the draft program.md:

Agent(
  subagent_type="autoresearch-x:reviewer",
  description="review draft program",
  prompt="
    Review this draft program.md for an autoresearch-x run.

    ## Draft Program
    <full program.md content assembled from Steps A-F>

    ## Mode
    <selected mode>

    ## Project Root
    <current working directory>

    ## Inferred Sections
    <list of section names that were inferred vs user-provided>

    Follow your review protocol. Return structured review.
  "
)

Process the review results:

Status: Approved — proceed directly to Step G.
Status: Issues Found — process each finding:
- BLOCK items: Ask the user a targeted question for each (one at a time via AskUserQuestion). Incorporate their answer into the draft. If the answer changes the mode, re-dispatch the reviewer.
- AUTO-FIX items: Apply the suggested fix to the draft silently. Note auto-fixes to show in Step G (e.g., "Auto-filled: pass/fail criteria set to exit code 0").
- Recommendations: Collect to show as notes in Step G.

After processing, proceed to Step G with the polished draft.

Skip conditions — do NOT dispatch the reviewer when:

--program path/to/program.md is used and the program contains ## Reviewed: PASS (previously reviewed)
resume <tag> is used (resuming an existing run)

When --program is used without ## Reviewed: PASS, the reviewer DOES run to validate the user-supplied program. Set inferred_sections: [] (none inferred, all user-provided).

Step G: Review & Approve (AskUserQuestion — approve / edit section) Show the polished program.md to the user. If there were auto-fixes or recommendations from the reviewer, show them:

"Here's your program.md. Ready to go?"

If auto-fixes were applied: "Note: I auto-filled [section] with [value] based on project context." If recommendations exist: "Reviewer notes: [recommendation text]"

Approve — write file (append ## Reviewed: PASS marker), continue setup
Edit a section — user picks which section to revise, re-ask that step. If the edit changes Scope, Evaluation, or Target, re-dispatch Step F.1 before re-showing Step G. For cosmetic edits (Context, Constraints presets), skip re-review.

Auto-generate the ## Checklist based on mode and answers:

optimize: baseline → profile → 3 approach slots (named from context if available)
debug: observe (add logging) → diagnose (test hypotheses) → fix (apply solution)
investigate: one checklist item per question/topic from the target

Parse checklist — extract items from program.md, initialize tracking.
Create .autoresearch-x/<tag>/ — directory structure for branching:
- branches/main/results.tsv (header only)
- branches/main/iterations/ directory
- branches.tsv (header + initial main row: main\t-\tactive\t1.0\t0\t-\t0\t<timestamp>)
- all-results.tsv (header only — same as results.tsv but with branch_id column prepended)
- report.md (skeleton)
- Archive program.md as program.v1.md
Add to .gitignore — append .autoresearch-x/ to project root .gitignore if not present. Create .autoresearch-x/.gitignore containing !*.
Activate guardrail hooks — run bash ${CLAUDE_PLUGIN_ROOT}/hooks/run-control.sh activate <tag> to enable iteration discipline hooks. This writes <tag>:main to .autoresearch-x/.active which activates scope-guard, iteration-gate, eval-bypass-detector, and completion-check hooks.
Verify prerequisites — eval command runs, scope files exist, dependencies available.
Establish baseline — first run with no changes. Record baseline metric.
Confirm with user — show setup summary, then go autonomous.

program.md Structure

Required sections marked with *.

# autoresearch-x: <run name>

## Target *
<One specific, measurable sentence.>

## Mode *
optimize | debug | investigate

## Checklist *
- [ ] Item 1
- [ ] Item 2

## Scope
- modify: path/to/file.py
- readonly: path/to/other.py

## Evaluation
- command: `uv run bench.py --json`
- metric: val_bpb
- target: < 0.95

## Constraints
- max_iterations: 50
- timeout: 2h

## Context
<Background, what's been tried, domain knowledge>

## Reviewed: PASS
<Appended automatically by setup flow on approval — do not add manually>

★ THE OUTER LOOP (Branch Manager) ★

The Branch Manager wraps the inner iteration loop, handling branch selection, switching, forking, and mind explosion. See ref-branching.md for full details.

OUTER LOOP:
  1. READ branches.tsv → compute priorities for all non-stalled, non-pruned branches
  2. SELECT highest-priority branch
  3. SWITCH to that branch:
     a. Clean working tree check (git stash if dirty)
     b. git checkout autoresearch-x/<tag>/<branch_id>
     c. bash ${CLAUDE_PLUGIN_ROOT}/hooks/run-control.sh switch-branch <branch_id>
  4. RUN inner loop for K iterations (default K=3)
     - Worker receives cross-branch summary from all-results.tsv
     - EARLY EXIT: if branch hits 5 consecutive discards, stop inner loop, mark stalled
     - FORK CANDIDATES: record in pending_forks list (deferred — no immediate fork)
  5. UPDATE branches.tsv (priority, metrics, stall_count)
     Rebuild all-results.tsv (merge all branches/*/results.tsv)
  6. CREATE PENDING FORKS (deferred from inner Step 1):
     - For each: verify limits (max 8 branches, max 3 fork depth)
     - git tag checkpoint/cp-NNN, create branch, init tracking dir
     - Add row to branches.tsv with status=suspended
  7. CHECK GLOBAL STALL:
     - If ALL branches stalled: trigger Strategist (mind explosion)
     - See ref-branching.md for full Strategist dispatch and program revision gate
  8. CHECK COMPLETION:
     - If any branch completed: generate final report, stop
     - If total iterations >= max_iterations: stop
     - If mind explosions >= 3 with all still stalled: stop
  9. GOTO 1

Cross-branch context for Workers: Before dispatching a Worker, build a summary from all-results.tsv of what other branches tried and their outcomes. Include as ## Cross-Branch Context (read-only) in the Worker prompt. This prevents redundant exploration.

Single-branch runs: If no forks are created, the outer loop just selects main every time — functionally identical to v1.

★ THE INNER ITERATION LOOP ★

This is the core of autoresearch-x. Everything above is setup. This section governs every iteration within a branch.

EACH ITERATION = EXACTLY ONE EXPERIMENTAL IDEA

ONE optimization (e.g., add TCP_NODELAY), OR
ONE algorithm change (e.g., replace bytes concat with bytearray), OR
ONE parameter tuning (e.g., increase buffer size to 256KB)
If your change description needs "and" or "also", split into separate iterations
If you think "while I'm here, I'll also..." — STOP. That is the NEXT iteration.

In debug mode OBSERVE phase: you MAY batch logging for multiple hypotheses in one commit (logging does not conflict). But you still cannot batch logic changes.

WHY: Multiple simultaneous changes make it impossible to attribute improvement or regression to any single change. Science requires controlling variables.

MAIN AGENT MUST DISPATCH — NEVER DO THE WORK DIRECTLY

To modify code → dispatch Worker subagent (subagent_type: "autoresearch-x:worker") To run eval → dispatch Evaluator subagent (subagent_type: "autoresearch-x:evaluator")

Main agent ONLY: orchestrate, review diffs, decide keep/discard, git ops, tracking.

If you catch yourself opening a file to edit: STOP. Write a Worker dispatch instead. If you catch yourself running the eval command: STOP. Write an Evaluator dispatch instead.

WHY: The agent that writes the fix must not judge if it worked. Separating them prevents confirmation bias and rationalization.

Pre-Iteration Self-Check

Before EVERY iteration, verify:

Previous iteration fully recorded? (results.tsv row + iterations/ file)
About to try EXACTLY ONE change? (not two, not three)
Will dispatch to Worker? (not write code myself)
Will dispatch to Evaluator? (not run eval myself)
Budget remaining? (show: [iter N/max | N remaining | best: X | target: Y | branch: <id>]), count total iterations across ALL branches, if no remaining budget, abort!!!!
Total time remaining? (read program.md for start time & total timeout, show [start: X | remaining: Y]), if no remaining budget, abort!!!!

The 9-Step Protocol

REPEAT until target met or budget exhausted or timeout:

Step 1: REVIEW & PLAN

First, review previous results before planning:

Read results.tsv — scan all rows for patterns (what worked, what didn't, streaks)
Read the last 2-3 iterations/<commit>.md files for detailed observations
Check: any "keep" results to build on? Any "discard" patterns to avoid?
Check: stuck? (5+ consecutive discards → trigger Stuck Protocol)

Then state ONE specific change to try, informed by what you just reviewed.

"Iterations 2-4 all tried CPU-side optimizations with no gain. The eval output shows I/O wait is 80%. Try adding TCP_NODELAY to reduce small-frame latency."

For debug mode, also state the phase (observe/diagnose/fix), target hypothesis, and which evidence supports this direction.

Fork candidate detection: If you identify 2+ genuinely different root strategies (not parameter variations), record them in pending_forks but do NOT fork immediately. Pick one strategy for this iteration. Forks are created by the outer loop between branch visits. See ref-branching.md for fork rules.

Step 2: DISPATCH WORKER

Agent(subagent_type="autoresearch-x:worker", prompt="...", description="iter N: ...")

Include: program.md content, ONE instruction, scope, phase, previous iteration descriptions (NO metrics), and cross-branch summary (one-line summaries of what other branches tried, from all-results.tsv).

Worker reports: files changed, lines modified, observations.

Step 3: REVIEW DIFF

Verify:

Only scoped files modified
Only ONE idea implemented (not two smuggled in)
Phase rules respected (debug: no logic in OBSERVE/DIAGNOSE)

Violations → discard with description "out-of-scope change rejected."

Step 4: COMMIT

git add <scoped files>
git commit -m "iter N: <one-sentence description>"

Record the short commit hash (7 chars).

Step 5: DISPATCH EVALUATOR

Agent(subagent_type="autoresearch-x:evaluator", prompt="...", description="eval iter N")

Include: eval command, metric name, target comparison, extraction command, timeout.

Evaluator reports: metric_value, target_met, extraction_method, eval_duration, peak_output.

Step 6: DECIDE

keep: metric improved, or metric equal with simpler code
discard: metric same or worse without simplification benefit
Debug mode: keep = test passes or useful evidence gathered

Step 7: ACT ON DECISION

keep: update "current best" metric. Branch advances.
discard: git checkout <prev_commit> -- <scoped files> then commit the revert.

Step 8: RECORD

retrieve current time via timetool or bash date!!!

Append one row to .autoresearch-x/<tag>/results.tsv. All columns required. Use - for N/A.

Step 9: WRITE DETAIL

Create .autoresearch-x/<tag>/iterations/<commit>.md. Update report.md.

DO NOT START THE NEXT ITERATION until ALL of these are done:

Change committed or reverted (step 4 or 7)
Row appended to results.tsv (step 8)
iterations/.md written (step 9) with retrieved current time

These three artifacts are PROOF that an iteration happened. Without them, the iteration did not happen. Proceed to step 1 ONLY after step 9.

Autonomy Protocol

Never-Stop Rule

Once the loop begins, do NOT pause to ask "should I continue?" Work autonomously until:

Optimize: metric target met
Debug: target test passes
Investigate: all checklist items resolved with evidence
Budget exhausted or human interrupts

Branch Stall Detection (replaces v1 Stuck Protocol)

If 5 consecutive iterations on the current branch produce no keep:

Stop the inner loop immediately (do not complete remaining K iterations)
Mark the branch as stalled in branches.tsv (stall_count = 5)
Log "branch stalled" in report.md
Return control to the outer loop (Branch Manager)
- The outer loop will switch to a higher-priority branch
- If ALL branches are stalled, the outer loop triggers a mind explosion (Strategist agent)
- See ref-branching.md for the full Strategist protocol

Do NOT attempt within-branch recovery (no ad-hoc pivots). Recovery happens at the outer loop level.

Crash Handling

Quick fix (typo, missing import): fix and re-run
Fundamental issue: log crash, revert, move on
3 consecutive crashes: try radically different approach
Continue with new plan until budget exhausted, DON'T STOP!!!!

Run Completion

When the run ends (target met, budget exhausted, or user interrupts):

YOU CANNOT STOP WITHOUT A FINAL REPORT.

The completion-check hook will BLOCK you from stopping if report.md is missing a ## Conclusion section. Complete ALL steps below before stopping.

Finalize report.md — add a ## Conclusion section at the end with:
- Outcome: whether the target was met (YES/NO + evidence)
- Statistics: total iterations (across all branches), keeps, discards, crashes
- Branch summary: per-branch stats (iterations, best metric, final status)
- Best result: the best metric achieved, which branch and commit produced it
- Key findings: (optimize) what worked and why; (debug) root cause and fix; (investigate) answers to each checklist question
- Gitgraph: mermaid gitGraph visualization of the full branch tree (see ref-branching.md)
- Cross-branch analysis: what strategies worked, what failed, and shared failure patterns
- Mind explosions: if any occurred, summarize strategist proposals and their outcomes
- Recommendations: next steps if target not met, or how to maintain gains if it was
- Timeline: start time, end time, total duration

Deactivate guardrail hooks:

bash ${CLAUDE_PLUGIN_ROOT}/hooks/run-control.sh deactivate

Show final status to user — display the conclusion summary inline so the user sees it without opening the report file

Examples

Optimize: API Latency

results.tsv:
2026-03-23T10:01  a1b2c3d  baseline  keep     -  -  312  baseline: 312ms
2026-03-23T10:07  b2c3d4e  iterate   keep     -  -  245  added connection pooling
2026-03-23T10:14  c3d4e5f  iterate   discard  -  -  251  tried async handlers — worse
2026-03-23T10:20  d4e5f6g  iterate   keep     -  -  189  query result caching — target met!

Debug: Auth Failure

results.tsv:
10:01  a1b  observe   keep  -          -          -  added logging for H1,H2,H3 (batch)
10:05  b2c  observe   keep  -          -          -  added timing probes for all 3
10:09  c3d  diagnose  keep  a1b,b2c    H1:++,H2:--,H3:?  -  matrix: H2 eliminated (rate=0)
10:16  e5f  diagnose  keep  a1b..d4e   H1:++,H3:--        -  H3 eliminated, H1 confirmed
10:22  f6g  fix       discard  e5f     H1:confirmed       -  tried cert pre-stage — still fails
10:33  h8i  diagnose  keep  g7h        H1:refined         -  cert gap is in renewal, not rotation
10:38  i9j  fix       keep  h8i        -          -  added cert renewal overlap — PASSES

Investigate: Error Patterns

results.tsv:
10:01  a1b  gather   keep  -      -  -  collected 7 days of error logs
10:05  b2c  gather   keep  -      -  -  fetched deploy timestamps from CI
10:09  c3d  analyze  keep  a1b,b2c  H1:++  -  403 correlates with deploy (7/7)
10:17  e5f  analyze  keep  c3d,d4e  H1:++  -  cert rotation is the mechanism
10:24  g7h  conclude keep  c3d,e5f  -  -  root cause: deploy triggers cert rotation

Agent Teams Architecture (v3)

Overview

Agent Teams mode uses Claude Code's teammate feature instead of subagents. Each iteration spawns fresh teammates with clean context, eliminating context window pressure for long runs (30+ iterations).

Key Differences from Skill Mode

	Skill Mode	Agent Teams Mode
Context	Single session accumulates history	Fresh teammates each iteration
Communication	Subagent report back	Inbox/outbox JSON files + mailbox
Skills access	Loaded by main agent	Auto-loaded by each teammate
Hooks	Main session only	Each teammate loads hooks independently
Use case	5-10 iterations	30+ iterations

Architecture

Coordinator (Python CLI)
├── State Manager     — state.json, results.tsv, branches.tsv
├── Teammate Manager  — spawn/poll/shutdown via claude --teammate
├── Branch Manager    — priority scoring, fork, stall detection
└── Program Parser    — parse program.md

Each iteration:
  Coordinator → inbox/planner.json → Planner teammate → outbox/planner.json
  Coordinator → inbox/worker.json  → Worker teammate  → outbox/worker.json
  Coordinator → inbox/evaluator.json → Evaluator teammate → outbox/evaluator.json
  Coordinator decides keep/discard, commits/reverts, records results

Teammate Definitions

Located in .claude/agents/:

planner.md — reads history, proposes ONE change
worker.md — executes code modifications (no eval access)
evaluator.md — runs eval commands (no code change access)
strategist.md — mind explosion when all branches stall

State Files

.autoresearch-x/<tag>/
├── state.json           — RunState (current progress, budget, metrics)
├── results.tsv          — iteration results
├── all-results.tsv      — cross-branch consolidated results
├── branches.tsv         — branch registry
├── inbox/               — task files for teammates
├── outbox/              — result files from teammates
├── iterations/          — detailed iteration notes
├── branches/
│   ├── main/
│   │   ├── results.tsv
│   │   └── iterations/
│   └── <fork-name>/
│       ├── results.tsv
│       └── iterations/
└── report.md            — final report

autoresearch-x

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

autoresearch-x

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

autoresearch-x

When to Use

Invocation

Skill Mode (Single Session)

Agent Teams Mode (--teams flag via Coordinator CLI)

Three Modes (summary)

Setup Phase

Interactive Setup (when no program.md provided)

program.md Structure

★ THE OUTER LOOP (Branch Manager) ★

★ THE INNER ITERATION LOOP ★

Pre-Iteration Self-Check

The 9-Step Protocol

Step 1: REVIEW & PLAN

Step 2: DISPATCH WORKER

Step 3: REVIEW DIFF

Step 4: COMMIT

Step 5: DISPATCH EVALUATOR

Step 6: DECIDE

Step 7: ACT ON DECISION

Step 8: RECORD

Step 9: WRITE DETAIL

Autonomy Protocol

Never-Stop Rule

Branch Stall Detection (replaces v1 Stuck Protocol)

Crash Handling

Run Completion

Examples

Optimize: API Latency

Debug: Auth Failure

Investigate: Error Patterns

Agent Teams Architecture (v3)

Overview

Key Differences from Skill Mode

Architecture

Teammate Definitions

State Files

Similar Skills

autoresearch-x

When to Use

Invocation

Skill Mode (Single Session)

Agent Teams Mode (--teams flag via Coordinator CLI)

Three Modes (summary)

Setup Phase

Interactive Setup (when no program.md provided)

program.md Structure

★ THE OUTER LOOP (Branch Manager) ★

★ THE INNER ITERATION LOOP ★

Pre-Iteration Self-Check

The 9-Step Protocol

Step 1: REVIEW & PLAN

Step 2: DISPATCH WORKER

Step 3: REVIEW DIFF

Step 4: COMMIT

Step 5: DISPATCH EVALUATOR

Step 6: DECIDE

Step 7: ACT ON DECISION

Step 8: RECORD

Step 9: WRITE DETAIL

Autonomy Protocol

Never-Stop Rule

Branch Stall Detection (replaces v1 Stuck Protocol)

Crash Handling

Run Completion

Examples

Optimize: API Latency

Debug: Auth Failure

Agent Teams Mode (`--teams` flag via Coordinator CLI)

Agent Teams Mode (`--teams` flag via Coordinator CLI)