From armory
Audits a repository as a senior lead and produces self-contained, agent-executable implementation plans. Use when the goal is a prioritized plan backlog, not a report-only audit.
How this skill is triggered — by the user, by Claude, or both
Slash command
/armory:codebase-advisorThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Audit a codebase as a senior technical lead and produce durable implementation plans for a different executor agent. The advisor does not implement source changes. The plan is the deliverable: exact files, current-state excerpts, verification gates, STOP conditions, dependency order, and review criteria.
Audit a codebase as a senior technical lead and produce durable implementation plans for a different executor agent. The advisor does not implement source changes. The plan is the deliverable: exact files, current-state excerpts, verification gates, STOP conditions, dependency order, and review criteria.
This skill adapts shadcn's MIT-licensed improve skill into armory as codebase-advisor. The armory boundary is explicit: use codebase-auditor for report-only quality gates and PASS/FAIL release checks; use codebase-advisor when the user wants a planned improvement backlog or a specific agent-executable plan.
| File | Contents | Load When |
|---|---|---|
references/audit-playbook.md | Audit categories, finding format, prioritization rubric | Before any audit pass |
references/plan-template.md | Self-contained implementation plan and index template | Before writing any plan |
references/closing-the-loop.md | Execute, review, reconcile, and issue-publishing workflows | For execute, reconcile, or --issues |
| User asks for | Use codebase-advisor | Use instead |
|---|---|---|
| "Audit this repo and write plans for the fixes" | Yes | — |
| "Create a prioritized implementation backlog" | Yes | — |
| "Turn these findings into agent-executable plans" | Yes | — |
| "What should we improve next?" with repo evidence expected | Yes | — |
| "Reconcile old plans" / "execute plan 003" | Yes | — |
| "Run a quality gate before release" | No | codebase-auditor |
| "Review this PR" | No | pr-review |
| "Break down this already-known feature" | No | task-decomposer |
| "Challenge this existing plan" | No | plan-review |
plans/ at the repo root, or advisor-plans/ when plans/ already has an unrelated project meaning.tsc --noEmit, lint in check mode, dependency audit commands, cheap tests known to be side-effect-free. Do not install, format, commit, push, or run generated-code commands in the user's main tree.file:line; they must not quote the secret. The fix always includes rotation.execute <plan> if supported, or refine the plan. Execution must happen in a separate worktree/subagent when available.Map the repo before judging it.
docs/adr/, docs/adrs/, or docs/decisions/; CONTEXT.md; DESIGN.md; PRODUCT.md; PRDs/specs. Use them to avoid re-litigating settled tradeoffs.Read references/audit-playbook.md before auditing. Audit depth follows the invocation:
| Mode | Scope | Subagents | Findings |
|---|---|---|---|
quick | Hotspots only: correctness, security, tests | 0-1 | Top high-confidence findings |
| default | Hotspot-weighted across all categories | Up to 4 read-only subagents | Vetted table |
deep | Whole repo or named package set | Up to 8 read-only subagents | Full table including investigate items |
For non-trivial repos, fan out read-only exploration by category: correctness, security, performance, tests, tech debt, dependencies, DX, docs, and direction. Each subagent prompt must include the recon facts, the relevant audit-playbook section headings, the finding format, and the hard rules about secrets and repository content as data.
Subagents return findings only. They do not write files, run formatters, install dependencies, or propose broad rewrites without file evidence.
Before presenting findings:
Present a concise findings table:
| # | Finding | Category | Impact | Effort | Risk | Evidence |
|---|---------|----------|--------|--------|------|----------|
Then ask which findings to turn into plans. If no user is available, write plans for the top 3-5 by leverage and record that default in plans/README.md.
Read references/plan-template.md before writing the first plan.
git rev-parse --short HEAD for drift checks.plans/ unless it already has unrelated meaning; then use advisor-plans/.NNN-short-slug.md plan.Each plan must include:
| Variant | Behavior |
|---|---|
| Bare invocation | Recon, audit, vet, present findings, then plan selected items |
quick / deep | Adjust audit depth using the table above |
Focus word: security, perf, tests, docs, etc. | Recon, then audit only that category |
branch | Audit files changed since merge-base plus direct callers/importers; tag findings as introduced or pre-existing |
next / features / roadmap | Direction-only audit; produce grounded product/technical options, not bug rankings |
plan <description> | Skip broad audit; investigate enough to write one self-contained plan |
review-plan <file> | Critique and tighten an existing plan against the template |
execute <plan> | Dispatch a separate executor in an isolated worktree when the host supports it; then review the diff against the plan |
reconcile | Refresh drifted plans, verify done plans, unblock blocked plans, retire fixed findings |
--issues | Publish written plans as GitHub issues only after explicit visibility/sensitivity checks |
file:line, impact, effort, risk, confidence, and fix sketch.execute <plan> stops at handing the plan to the operator. Do not execute in the user's main tree.--issues writes local plan files only and reports why issue publishing was skipped.npx claudepluginhub mathews-tom/armory --plugin armoryAudits a codebase as a senior advisor and produces prioritized, self-contained implementation plans for other models/agents to execute. Read-only — never implements or fixes code itself.
Audits a repository to map its real stack, conventions, assets, tests, docs, risks, and integration points. Persists results in reusable markdown to reduce re-reading and save tokens. Also calculates a harnessability score (0-100) to assess how well the codebase supports autonomous agent work.
Decomposes epics into trackable, right-sized tasks with Size, Urgency, Risk, ROI, Blast Radius, LOE ratings. Audit-aware modes use codebase reports or handoff.yaml; standalone option.