From autoimprove
15 agent definitions for the autoimprove improvement loop. Each agent is a Markdown file with YAML frontmatter (`name`, `description`, optional `model`).
How this agent operates — its isolation, permissions, and tool access model
Agent reference
autoimprove:agents/agentsThe summary Claude sees when deciding whether to delegate to this agent
15 agent definitions for the autoimprove improvement loop. Each agent is a Markdown file with YAML frontmatter (`name`, `description`, optional `model`). Owns all agent prompt files for the autoimprove loop. Defines the dispatch contract, tool access, and model selection for each agent. The AR triad sequencing constraint (E→A→J foreground sequential) lives here. Agents are the execution layer o...
15 agent definitions for the autoimprove improvement loop. Each agent is a Markdown file with YAML frontmatter (name, description, optional model).
Owns all agent prompt files for the autoimprove loop. Defines the dispatch contract, tool access, and model selection for each agent. The AR triad sequencing constraint (E→A→J foreground sequential) lives here.
Agents are the execution layer of the autonomous improvement loop. Each agent specializes in a phase (grind / propose / research) or a quality check (AR triad, reviewer, transformer). Skills orchestrate; agents execute. Without this folder, there is no autonomous improvement.
Markdown + YAML frontmatter only. No runtime code in agent files. Model selection via frontmatter model: field. Tool access via frontmatter allowed-tools: (or inherited default from Claude Code).
.md files under agents/skills/benchmark/experiments.tsv, context.json, epoch-baseline.json) — owned by root / skillsplugin.json) — owned by rootskills/run/benchmark/skills/_shared/skills/rubrics/Belongs here:
agents/new-agent.mdRoute elsewhere:
skills/run/benchmark/skills/<name>/SKILL.mdpython3 ~/.claude/skills/applying-agentsmd-rulebook/scripts/check-agentsmd.py --root .
Agent behavior is tested via skills/challenge/ and skills/test/. No dedicated unit tests for prompt files — challenges are the verification gate.
Update this file when:
agents/ or an existing one is removedThe adversarial review chain runs foreground: Enthusiast and Adversary dispatch in parallel (PARALLEL RULE — SKILL.md:141), Judge dispatches sequentially after both complete. Never background.
| Agent | File | Role |
|---|---|---|
autoimprove:enthusiast | enthusiast.md | Finds genuine improvements — aggressive optimism |
autoimprove:adversary | adversary.md | Maps heuristics + safe zones independently — no findings input |
autoimprove:judge | judge.md | Arbitrates E findings using A's adversarial context |
autoimprove:enthusiast-spec | enthusiast-spec.md | Spec variant — gaps/ambiguities in design docs |
autoimprove:adversary-spec | adversary-spec.md | Spec variant — maps prose heuristics + safe zones (planned-work markers, defaults) |
autoimprove:judge-spec | judge-spec.md | Spec variant — arbitrates spec findings using A-spec's adversarial context |
| Agent | File | Role |
|---|---|---|
autoimprove:experimenter | experimenter.md | Spawned per-experiment into worktree; blind to scoring |
autoimprove:proposer | proposer.md | Phase 2: drafts structured proposals when grind stagnates |
autoimprove:researcher | researcher.md | Phase 3: investigation memos, no code changes |
| Agent | File | Role |
|---|---|---|
autoimprove:reviewer | reviewer.md | Scores skills against rubrics, identifies gaps |
autoimprove:transformer | transformer.md | Rewrites skills applying rubric transforms |
autoimprove:convergence-analyst | convergence-analyst.md | Analyzes experiment history for patterns |
autoimprove:challenge-runner | challenge-runner.md | Runs challenge suites against agent outputs |
autoimprove:idea-explorer | idea-explorer.md | Explores improvement ideas before committing |
autoimprove:docs-regenerate | docs-regenerate.md | Regenerates docs from git diff (diff-only, never full source) |
autoimprove:adversarial-review command or directly via Agent toolexperiments/research-<timestamp>.mdnpx claudepluginhub tokyo-megacorp/autoimproveSurgical 1-2 file editor for typo fixes, single-function rewrites, mechanical renames, comment removal, format tweaks. Refuses 3+ files, new features, cross-file changes. Returns caveman diff receipt.
Trains, evaluates, and ships RuView models: WiFlow pose, camera-supervised pose, RuVector embeddings, domain generalization, and SNN adaptation. Handles GPU training on GCloud and Hugging Face publishing.