Skill

review

Phase 3 of the SDD cycle. Use when the user invokes /spec-tests-first:review <feature> to run a code-quality + security review on the feature's changed files. Dispatches code-quality-reviewer and security-reviewer in parallel against the file list, aggregates via code-reporter, and writes ./reports/code-review_<feature>_<ts>.md with stable finding-IDs (SEC-NNN / QUA-NNN) and a per-finding `Status: pending` line. Read-only — never edits source files. Updates docs/specs/<feature>/spec-status.md only to set the `Latest review:` pointer to the new report path. Does NOT gate progress (no BLOCK/CAUTION/GO); the Critical gate lives in /spec-tests-first:fix.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/spec-tests-first:review <feature-name>

User invocable

Model invocable

Inline context

Default effort

Argument hint<feature-name>

Tool Access

This skill is limited to the following tools:

ReadWriteEditGlobGrepBashAgent

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

**Announce at start:** Say to the user: "I'm using /spec-tests-first:review to dispatch `code-quality-reviewer` and `security-reviewer` in parallel against `$1`'s changed files, then aggregating via `code-reporter` into a timestamped report. Read-only — no source edits." Then proceed.

SKILL.md

207 lines · ~3.1k tokens

Stats

Parent stars0

MaintenanceGood

Last CommitMay 29, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

/spec-tests-first:review — Phase 3: Code-Quality + Security Review

Announce at start: Say to the user: "I'm using /spec-tests-first:review to dispatch code-quality-reviewer and security-reviewer in parallel against $1's changed files, then aggregating via code-reporter into a timestamped report. Read-only — no source edits." Then proceed.

You are running Phase 3 of the SDD cycle for feature $1. Inputs: the feature's source code on the feature/$1 branch + docs/specs/$1/spec.md + docs/specs/$1/spec-status.md. Output: ./reports/code-review_$1_<YYYY-MM-DD_HHMMSS>.md.

Iron Law

Reviewers run in parallel (single message, two Agent calls). Reviewing serially is a contract violation — the agents don't share state, and parallel halves the wall time.

The reviewer agents are read-only by tool grant; the only mutating operations this skill performs are (a) the reporter writes the report file, and (b) this skill's targeted Edit of the Latest review: line and Phase 3 row in spec-status.md. No source files are touched here.

Pre-checks

Spec status complete? Read docs/specs/$1/spec-status.md. Every AC-ID should have status pass. If anything is fail / blocked / stale / in-progress, stop and tell the user:

Build is not complete for $1 — spec-status.md shows non-pass ACs. Run /spec-tests-first:build $1 first.
Tests still green? Resolve the test command + path (Phase 2 wrote ## Test commands and ## Test layout to CLAUDE.md; read them — see /spec-tests-first:build's Step 2 / Step 3 for parsing rules, including multi-service awareness). Dispatch the test-runner subagent once against the feature's full suite to confirm. If failed or errored is non-empty, stop with:

Feature tests are not green. Re-run /spec-tests-first:build $1 (or investigate manually). Review aborted.

Do NOT proceed to the reviewers if tests aren't green — they'd be reviewing broken code.

Step 1 — Identify scope

Build the file list to review. Sources:

Changed files on this branch vs main:
```
git diff --name-only main...HEAD
```
Cross-reference with docs/codebase-map.md — pick out rows touched/added in this build (build's end-of-build artifact step updates this map; entries added during the most recent /spec-tests-first:build are the ones to include).
Feature test files. Per the resolved profile in CLAUDE.md ## Test layout:
- feature_anchor: directory → all files under tests_root (e.g. tests/$1/**, src/test/java/.../$1/**).
- feature_anchor: file_prefix → files in tests_root matching the per-feature prefix.
- feature_anchor: co-located → grep across the source tree for test files whose names contain any of this feature's AC-IDs (AC-1.1, AC-1.2, ...). The AC-ID embedding rule from /spec-tests-first:build makes this deterministic.
Exclude:
- Generated files: package-lock.json, yarn.lock, pnpm-lock.yaml, Cargo.lock, go.sum, *.min.js, *.min.css, snapshot files, generated proto/SDK code.
- docs/specs/$1/* — these are spec artifacts, not source.
- .gitignore itself.
- Any file the user explicitly excludes (none by default; --exclude <glob> could be added later, not for v2).

If the resulting list is empty, stop with:

No files in scope to review for $1. (Did /spec-tests-first:build run on this branch?)

Print the resolved list (or the count and a head of 10 paths if it's long) so the user can sanity-check the scope before reviewers fire.

Step 2 — Dispatch reviewers in parallel

In a single message, dispatch both reviewer agents via the Agent tool. This is mandatory parallelism — the reviewers don't read each other's output, and sequential dispatch would double the wait time.

Each dispatch passes the scope file list and a short note about the feature. Example dispatch shape (your actual call uses the Agent tool, not pseudocode):

Agent[code-quality-reviewer]
  subagent_type: code-quality-reviewer
  prompt: |
    Review the following files for code quality (maintainability, complexity,
    naming, dead code, error handling, test coverage gaps). The scope is the
    feature "$1" — files were changed by /spec-tests-first:build to implement
    docs/specs/$1/spec.md.

    Files:
    <file list>

    Produce your standard `# Code Quality Review` Markdown report.

Agent[security-reviewer]
  subagent_type: security-reviewer
  prompt: |
    Review the following files for security issues (OWASP Top 10, CWE,
    injection, auth, crypto, secrets, dependency CVEs where applicable). The
    scope is the feature "$1".

    Files:
    <file list>

    Produce your standard `# Security Review` Markdown report.

Each reviewer is tools-restricted to Read, Grep, Glob, Bash and will optionally run any scanners present (eslint, ruff, gitleaks, semgrep, npm audit, etc.) in check-only mode. Missing tools are logged and skipped — never an error.

Wait for both to complete. Capture their full Markdown outputs (the # Code Quality Review block and the # Security Review block) verbatim.

Step 3 — Aggregate via the reporter

Dispatch code-reporter with both outputs as part of the prompt body:

Agent[code-reporter]
  subagent_type: code-reporter
  prompt: |
    Aggregate the two reviewer outputs below into a single timestamped report
    at ./reports/code-review_$1_<timestamp>.md. Embed each reviewer block
    verbatim per your standard rules, and prepend stable finding IDs
    (SEC-NNN, QUA-NNN) with `Status: pending` lines per the agent's
    "Finding IDs and Status column" section.

    Scope: feature "$1" — N files reviewed.

    ## code-quality-reviewer output
    <full Markdown block, verbatim>

    ## security-reviewer output
    <full Markdown block, verbatim>

The reporter writes the file and returns the absolute path + a 2–3 sentence summary. Capture the path.

If the reporter returns an error (couldn't write ./reports/, both inputs empty, etc.), surface the error to the user and stop — do not invent a substitute path.

Step 4 — Update spec-status.md (Latest review + Phase progress)

Two targeted edits to docs/specs/$1/spec-status.md, both via the Edit tool — never rewrite the file.

4a — `Latest review:` pointer

The Latest review: field lives in the top-of-file metadata block, between the title and the ## Phase progress table:

# spec-status: <feature>

Last updated: <YYYY-MM-DD>
Latest review: <absolute path to most recent report>

## Phase progress
...

If the file already has a Latest review: line → replace its value with the new report path.
If the file does NOT have one → insert it immediately after the Last updated: line.

4b — Phase 3 row in `## Phase progress`

Mark Phase 3 (review) = done, Updated = today, Notes = "<C> critical, <H> high, <M> medium, <L> low" (counts from the reporter's summary).

If the file does NOT have a ## Phase progress table (older v1 spec being touched by v2 for the first time), insert the full 6-row table between Latest review: and ## Status per Acceptance Criterion. Phase 1 = done (assume the spec exists), Phase 2 = done (assume the build pre-check passed), Phase 3 = done (this review just succeeded), Phases 4–6 = pending.

Step 5 — Brief summary + next-step pointer

Output exactly this format to the user (substitute counts from the reporter's summary line):

Review complete for `$1`.
  Report: <absolute path>
  Findings: <C> critical, <H> high, <M> medium, <L> low, <I> informational

Next:
  /spec-tests-first:fix $1      — walk findings one-by-one with revert-on-regression (recommended if any critical or high)
  /spec-tests-first:validate $1 — proceed without fixing (you can run /spec-tests-first:fix later)

If both critical and high are zero, omit the recommended-emphasis and present /spec-tests-first:validate first.

Do NOT print the report contents back. The user opens the file. Do NOT propose any further automatic action — gating is /spec-tests-first:fix's job.

Design invariants

Read-only across source. This skill never edits source files, tests, the spec, or codebase-map.md. The single mutating operation is the targeted Edit of the Latest review: line in spec-status.md — which is metadata, not code.
No verdict gating. This skill always reports the report path and the counts. It never tells the user "you can't continue". The Critical gate lives in /spec-tests-first:fix. The shipping gate (no outstanding Critical) lives in /spec-tests-first:ship.
Fresh report every invocation. Re-running /spec-tests-first:review $1 produces a new timestamped file under ./reports/. The old reports stay on disk for history. The Latest review: pointer is updated to the newest.
Stable finding IDs per report. SEC-001, SEC-002, QUA-001, ... are sequential within a single report. They do NOT survive across reports — a fresh review = fresh IDs. /spec-tests-first:fix is invoked against a specific report and reads IDs from there.
Parallel reviewer dispatch is mandatory. Sequential dispatch is a contract violation; the two agents don't share state, and parallel halves the wall time.
Test green is a hard pre-check. Reviewing broken code wastes review tokens and produces noise. If tests aren't green, abort early.

Edge cases

Case	Behavior
`./reports/` not writable	Reporter handles this — surfaces the error and asks. `/spec-tests-first:review` echoes the reporter's reply and stops.
Zero findings from both reviewers	Reporter writes "No high-priority items — both reviewers reported a clean bill." `/spec-tests-first:review` summary shows all zeros and recommends `/spec-tests-first:validate`.
One reviewer's output empty or missing	Reporter embeds a `> Note: No <quality/security>-reviewer output was supplied.` and continues. `/spec-tests-first:review` echoes the partial report.
Multi-service feature	Scope still comes from `git diff --name-only main...HEAD` — cross-service. Reviewers handle multi-language naturally.
Files extremely large (>10k lines)	Reviewers handle internally; no special case here.

Red Flags — STOP and reset

Thought	Reality
"I'll dispatch them serially — they don't share state anyway"	Parallel dispatch is part of the contract. Serial means 2× wall time for no gain. Use a single message with two Agent calls.
"I'll add a `Verdict: BLOCK/CAUTION/GO` to the summary"	This phase has no verdict. Gating lives in `/spec-tests-first:fix` (Critical) and `/spec-tests-first:ship` (no outstanding Critical). Don't invent gates.
"I'll write the report file myself instead of dispatching the reporter"	The reporter has its own format, finding-id rules, and Status-column initialization. Bypassing it breaks `/spec-tests-first:fix`. Always dispatch.
"Tests are failing but I'll review anyway"	The pre-check (test-runner pre-flight) exists to prevent reviewing broken code. If it returns non-empty `failed`/`errored`, abort.
"I'll print the full report back to the user"	The report stays on disk. Echo only the path + finding counts. Users open the file in their editor.

Rules

Always dispatch the two reviewers in parallel (single message, two Agent calls).
Always dispatch the reporter after both return; never write the report yourself.
Always update Latest review: in spec-status.md after the reporter succeeds.
Never gate the user (no BLOCK/CAUTION/GO verdict).
Never print the full report back to chat — only the path + summary.
Never invent finding IDs, severities, or remediations — the reporter owns those.

review

Invocation

Tool Access

Context Preview

SKILL.md

review

Invocation

Tool Access

Context Preview

SKILL.md

/spec-tests-first:review — Phase 3: Code-Quality + Security Review

Iron Law

Pre-checks

Step 1 — Identify scope

Step 2 — Dispatch reviewers in parallel

Step 3 — Aggregate via the reporter

Step 4 — Update spec-status.md (Latest review + Phase progress)

4a — `Latest review:` pointer

4b — Phase 3 row in `## Phase progress`

Step 5 — Brief summary + next-step pointer

Design invariants

Edge cases

Red Flags — STOP and reset

Rules

Similar Skills

/spec-tests-first:review — Phase 3: Code-Quality + Security Review

Iron Law

Pre-checks

Step 1 — Identify scope

Step 2 — Dispatch reviewers in parallel

Step 3 — Aggregate via the reporter

Step 4 — Update spec-status.md (Latest review + Phase progress)

4a — `Latest review:` pointer

4b — Phase 3 row in `## Phase progress`

Step 5 — Brief summary + next-step pointer

Design invariants

Edge cases

Red Flags — STOP and reset

Rules

Similar Skills

review

Invocation

Tool Access

Context Preview

SKILL.md

review

Invocation

Tool Access

Context Preview

SKILL.md

/spec-tests-first:review — Phase 3: Code-Quality + Security Review

Iron Law

Pre-checks

Step 1 — Identify scope

Step 2 — Dispatch reviewers in parallel

Step 3 — Aggregate via the reporter

Step 4 — Update spec-status.md (Latest review + Phase progress)

4a — Latest review: pointer

4b — Phase 3 row in ## Phase progress

Step 5 — Brief summary + next-step pointer

Design invariants

Edge cases

Red Flags — STOP and reset

Rules

Similar Skills

/spec-tests-first:review — Phase 3: Code-Quality + Security Review

Iron Law

Pre-checks

Step 1 — Identify scope

Step 2 — Dispatch reviewers in parallel

Step 3 — Aggregate via the reporter

Step 4 — Update spec-status.md (Latest review + Phase progress)

4a — Latest review: pointer

4b — Phase 3 row in ## Phase progress

Step 5 — Brief summary + next-step pointer

Design invariants

Edge cases

Red Flags — STOP and reset

Rules

Similar Skills

4a — `Latest review:` pointer

4b — Phase 3 row in `## Phase progress`

4a — `Latest review:` pointer

4b — Phase 3 row in `## Phase progress`