From build
Review an implementation plan against its own evidence. Scans for placeholders, verifies accuracy, finds gaps, assigns severity levels to findings.
How this skill is triggered — by the user, by Claude, or both
Slash command
/build:review-plan [plan description or path][plan description or path]sonnetThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Review the implementation plan. Assume every section is weak until the plan's own evidence proves otherwise. Your job is to find everything that would cause problems during implementation.
Review the implementation plan. Assume every section is weak until the plan's own evidence proves otherwise. Your job is to find everything that would cause problems during implementation.
$ARGUMENTS
If the argument names a file path, read that file — it is the plan under review. If sibling artifacts exist beside it ({slug}-requirements.md, {slug}-context.md), read those too before reviewing.
Before reviewing substance, scan the entire plan for banned placeholder language. Reference the plan quality rules for the full list of banned patterns.
Every placeholder violation is a Critical finding. If you find more than 3 violations, stop the review and reject the plan immediately. It needs to be rewritten, not reviewed.
First, check section completeness. The impl-plan skill requires these sections: Discovery level, Requirements and decisions, Problem, Approach, Who uses this and how, Files to change, Data impact, What existing behavior changes, New dependencies, Access control and authorization, Abuse and edge cases, Out of scope, Risks and rollback, Observability & monitoring, Open questions, Wave 0 validation design, Execution manifest, Workflow artifacts, UI contract, Parallel workstreams, Implementation order, Verification. Any missing section (not present, or present without "N/A" justification) is an Important finding.
Exception: if the plan opens with Tier: compact, the required sections are: Discovery level, Requirements and decisions, Problem, Approach, Files to change, What existing behavior changes, Execution manifest, Implementation order, Verification, plus any triggered extras the plan's own file map implies (data, dependency, UI, or access-control changes). A compact tier claimed for a change whose file map spans multiple non-trivial files is itself an Important finding — compactness is for skip/quick_verify discovery levels only.
Then, for each section of the plan, check whether the content is accurate, complete, and consistent with the codebase:
REQ-* and D-* in "Requirements and decisions" must appear in the execution_manifest, implementation order, and verification plan. If any ID is missing from one of those places, flag it as Important. If the missing ID affects authorization, data integrity, security, or destructive behavior, flag it as Critical.id, wave, depends_on, files_modified, requirements, must_haves, verify, and done. Each must_haves item must be observable in code, test output, command output, manual evidence, or changed files. Missing required fields are Important; missing files_modified or requirements is Critical because workers cannot be routed safely.files_modified; shared files must move to a later dependent task or the tasks must be merged. Same-wave file overlap is Critical.REQ-* must have a test, fixture, command, or explicit first implementation task that makes it testable before feature work proceeds. Missing Wave 0 evidence is Important./build plans. If the plan is for /build, it must describe .build/plans/{slug}-state.md, {slug}-context.md, {slug}-requirements.md, {slug}-plan.md, {slug}-review.md, {slug}-implementation-summary.md, {slug}-verify.md, and {slug}-architect-review.md. Missing required artifact responsibilities are Important.Now step back from the checklist. Read the plan as a whole and react to it.
When the plan adds or changes tests, look for weak tests: skipped tests, assertion-free tests, snapshot-only tests for behavior changes, tautological mocks that only assert their own configured return value, and tests that would pass if the production code were removed. Weak tests are Important unless they are explicitly limited to non-behavioral rendering snapshots.
Start with your overall assessment in one sentence. Then list specific findings.
Tag each finding by severity:
Order findings by impact, highest first. Include the placeholder violation count from Part 0.
End with an explicit verdict: "Proceed to implementation" (no critical findings), "Proceed with fixes" (no critical, but important findings to address), or "Do not proceed" (critical findings that block implementation). One line, no ambiguity.
Do not start coding. Just critique the plan.
Guides creation, editing, and verification of skills for AI coding agents using test-driven development with subagent scenarios. Use when authoring or debugging skills.
npx claudepluginhub jameshemson/build --plugin build