Skill

spec-review-codex

Use when a brainstorming design spec has been written and needs adversarial review before implementation planning, using OpenAI Codex as the independent reviewer. Requires the codex CLI. Triggers on: spec review codex, codex spec review, review spec with codex, codex review, review spec, spec review.

Popularity

Stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/claude-skills:spec-review-codex

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Adversarial review of design specs using Codex as an independent reviewer. Loops until the spec passes with zero CRITICAL and zero IMPORTANT findings.

Supporting Files

spec-review-prompt.md

SKILL.md

168 lines · ~2.2k tokens

Stats

LanguageJavaScript

Stars1

MaintenanceExcellent

Last CommitJun 16, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Severity	Count
CRITICAL	X
IMPORTANT	X
ADVISORY	X
MINOR	X

Spec Review via Codex

Adversarial review of design specs using Codex as an independent reviewer. Loops until the spec passes with zero CRITICAL and zero IMPORTANT findings.

Why a different agent: The spec was written by this Claude instance. Self-review has author bias — the same blind spots that produced the issue prevent detecting it. Codex is a fresh model with no shared conversation context, making it an effective adversarial reviewer. Codex has filesystem access, so it verifies file paths and code references against the actual repo.

Sibling skill: spec-review-local does the same review with a local model served by LMStudio — use it when offline or when Codex is unavailable.

The Job

Locate the spec file
Send to Codex for adversarial review
Read findings
If verdict is NEEDS REVISION: fix the spec, loop back to step 2
If verdict is PASS: report clean to user
Maximum 3 review iterations (prevent infinite loops)

Do NOT proceed to implementation planning until the spec passes review.

Prerequisite: codex must be on PATH and authenticated. Verify with command -v codex — abort with a clear error if missing.

Step 1: Locate the Spec

If the user provided a file path as argument, use it
Otherwise, scan docs/superpowers/specs/ for the most recent spec by date prefix (YYYY-MM-DD). Match *-design.md
If no spec found, ask the user for the path (this is the only blocking question — without a spec there is nothing to review)

Read the spec file, then announce and proceed immediately — do not wait for confirmation:

"Sending <spec-path> to Codex for adversarial review."

This skill runs autonomously: it is a self-validator that hardens the spec before it reaches the user. Pausing for human approval at the start or between iterations defeats its purpose. Go straight to Step 2.

Step 2: Send to Codex for Review

Build the Codex command. The review prompt lives at ${CLAUDE_PLUGIN_ROOT}/skills/spec-review-codex/spec-review-prompt.md.

The reviewer needs to read the spec and the codebase but must not modify anything, so run Codex with the read-only sandbox. Capture the findings via --output-last-message (which writes Codex's final message to the findings file) rather than asking the model to write the file itself.

PLUGIN_ROOT="${CLAUDE_PLUGIN_ROOT}"
REVIEW_PROMPT="${PLUGIN_ROOT}/skills/spec-review-codex/spec-review-prompt.md"
SPEC_FILE="<path-to-spec>"
FINDINGS_FILE="/tmp/spec-review-findings-$(date +%s).md"

codex exec --sandbox read-only --output-last-message "$FINDINGS_FILE" "$(cat "$REVIEW_PROMPT")

---

# Spec to Review

$(cat "$SPEC_FILE")

---

# Instructions

1. Follow the review procedure above against this spec.
2. Verify all file paths, function names, and line numbers referenced in the spec against the actual codebase. The repository root is the current working directory. You are sandboxed read-only — do not attempt to write or modify files.
3. Your final message must be the complete findings document.
4. Use the exact output format specified in the review prompt.
5. End with the Summary table and Verdict."

Run this via Bash. Codex's final message (the findings) lands in $FINDINGS_FILE via --output-last-message.

Timeout: 120 seconds. If Codex times out, report the timeout to the user and ask whether to retry or skip.

On failure (codex not authenticated, network error): codex will exit non-zero. Report the exact stderr to the user and stop — do not loop.

Step 3: Read and Present Findings

Read the findings file. Parse the summary table at the bottom for counts and verdict.

Present to the user:

Spec Review — Iteration N/3

Severity Count
CRITICAL X
IMPORTANT X
ADVISORY X
MINOR X

Spec altitude: design / detailed-implementation Verdict: PASS / NEEDS REVISION

Severity	Count
CRITICAL	X
IMPORTANT	X
ADVISORY	X
MINOR	X

If PASS → go to Step 5. (ADVISORY/MINOR findings may remain on a PASS — surface them as notes, do not loop on them.) If NEEDS REVISION → go to Step 4.

List each CRITICAL and IMPORTANT finding (not ADVISORY or MINOR) with its title, problem, and suggested fix so the run stays transparent, then go straight to Step 4 and fix them. Do not ask for approval before fixing — the autonomous fix/re-review loop is the core of the skill. Only CRITICAL and IMPORTANT findings drive the loop; ADVISORY and MINOR are reported, never fixed-and-re-reviewed.

Step 4: Fix and Loop

For each finding (CRITICAL first, then IMPORTANT — ignore ADVISORY and MINOR here):

Read the quoted spec text from the finding
Read the suggested fix
Decide comply vs reframe (see Fixing Guidelines): a normal finding gets the fix applied with the Edit tool; an altitude finding (one demanding the spec transcribe detail a named source of truth already pins) gets reframed into a coverage rule, not enumerated
Apply the edit and briefly note what was changed

Convergence / enumeration-creep check (before re-running): Compare this iteration's IMPORTANT findings to the previous iteration's. If they are the same category AND merely finer-grained versions of the same underlying concern (e.g. round 2 said "enumerate the error branches," round 3 says "enumerate even more error branches"), the loop is ratcheting on altitude, not substance. Stop early and report:

"Findings are converging on enumeration detail, not substance. The design appears sound; the remaining findings are altitude disagreements better treated as ADVISORY. Treating as PASS with notes."

Then go to Step 5 — this is a PASS-with-notes outcome, not a max-iteration failure.

Otherwise, after all fixes are applied:

Increment the iteration counter
If iteration < 3 → go back to Step 2
If iteration = 3 → report to user:

"Reached maximum review iterations (3). Remaining findings: [list]. Please review the spec manually before proceeding."

Step 5: Report Clean

When Codex returns PASS:

"Spec passed adversarial review (iteration N/3, zero CRITICAL/IMPORTANT findings)."

If there are MINOR findings, list them: "N MINOR suggestions (non-blocking): [titles]"

The spec is now ready for implementation planning.

Fixing Guidelines

When fixing findings:

CRITICAL (contradictions, wrong references): Verify the correct information from the codebase before fixing. Do not guess.
CRITICAL (missing file paths / functions): Grep the codebase to find the correct path or function name. Update the spec with verified information.
IMPORTANT (ambiguous requirements): Pick the most reasonable interpretation and make it explicit. Add a "Decision:" note inline so the user sees what was decided.
IMPORTANT (missing error paths): Add a brief failure handling paragraph. Keep it proportional to the spec's existing level of detail.
IMPORTANT (missing edge cases): Add to the relevant section. If there's an edge cases table, add rows. If not, add a bullet list.
Never remove content to fix a finding. Clarify, correct, or expand instead.
Never change the architectural approach to fix a finding. If a finding suggests the approach is wrong, flag it to the user instead of changing it.
Reframe, don't comply, on altitude findings. If a finding asks the spec to transcribe implementation detail (enumerate more branches, cases, or guard returns) that a named external source of truth already pins — a characterization suite, golden master, or referenced source range — do NOT enumerate. Instead reframe the requirement as a coverage rule pointing at that source, and add a one-line Decision: note recording the choice. This is the altitude analogue of "never change the architectural approach": the reviewer is pushing the spec to the wrong altitude, and the right response is to reframe, not obey. If the finding was already ADVISORY, no spec edit is needed at all — just note it.

Iteration State

Track across iterations:

iteration: Current iteration number (1-3)
spec_path: Path to the spec being reviewed
findings_files: List of findings file paths (for audit trail)
fixed_count: Total findings fixed across all iterations
important_categories: The set of categories of this iteration's IMPORTANT findings — compared against the previous iteration to detect enumeration creep (Step 4 convergence check)

All findings files are preserved in /tmp/ for the user to inspect after the review completes.

spec-review-codex

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

spec-review-codex

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

Spec Review via Codex

The Job

Step 1: Locate the Spec

Step 2: Send to Codex for Review

Step 3: Read and Present Findings

Step 4: Fix and Loop

Step 5: Report Clean

Fixing Guidelines

Iteration State

Similar Skills

Spec Review via Codex

The Job

Step 1: Locate the Spec

Step 2: Send to Codex for Review

Step 3: Read and Present Findings

Step 4: Fix and Loop

Step 5: Report Clean

Fixing Guidelines

Iteration State

Similar Skills