Skill

adversarial-review

Launch a fresh-context adversarial review of specs or implementation. Uses the adversary agent with information asymmetry to find gaps, contradictions, and missing edge cases. Minimum 3 clean passes to convergence.

Popularity

Parent stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/vsdd-factory:adversarial-review [specs|implementation]

User invocable

Model invocable

Forked subagent

Default effort

Argument hint[specs|implementation]

Configuration

Agentadversary

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Launch the adversary agent to review specs or implementation with fresh context.

SKILL.md

218 lines · ~4k tokens

Stats

LanguageRust

Parent stars1

MaintenanceExcellent

Last CommitJun 13, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Adversarial Review

Launch the adversary agent to review specs or implementation with fresh context.

Invocation: This skill can be invoked by the orchestrator autonomously (during Phase 1d, Phase 5 convergence, and between fix bursts) or by the user via /vsdd-factory:adversarial-review. The disable-model-invocation: false setting allows orchestrator-initiated invocation, which is required for autonomous VSDD convergence loops. The user can still invoke it directly at any time.

The Iron Law

NO APPROVAL WITHOUT FRESH-CONTEXT REVIEW FIRST

Violating the letter of the rule is violating the spirit of the rule. Fresh context means the adversary has not seen prior review passes, the author's explanations, or the orchestrator's summary. Loading any of those contaminates the asymmetry the pattern depends on.

Announce at Start

Before any other action, say verbatim:

I'm using the adversarial-review skill to launch a fresh-context adversary pass on .

Then create TodoWrite entries: one per planned pass (minimum 2).

Red Flags

Thought	Reality
"I already reviewed this, I can skip the adversary pass"	Self-review is not adversarial review. Dispatch.
"The spec is obviously correct, one pass is enough"	Minimum is 3 clean passes. The rule exists because round 1 systematically misses things.
"Let me summarize the prior pass for the adversary to save tokens"	That destroys fresh context. Dispatch with only the target artifact.
"The adversary found nothing, let's call it done"	Zero findings after a short prompt is a prompt bug, not convergence. Re-dispatch with sharper scope.
"This finding isn't really critical, I'll downgrade it"	Severity is the adversary's call, not the orchestrator's. Record as-is.
"The same finding keeps appearing, the adversary is stuck"	It keeps appearing because it isn't fixed. Fix it, then re-run.
"Novelty is LOW after one pass, we've converged"	Minimum 3 clean passes. No exceptions.
"Let me tell the adversary what the prior reviewer found"	Information asymmetry is the mechanism. Do not leak prior findings.

Templates

Read and follow the output format in:

${CLAUDE_PLUGIN_ROOT}/templates/adversarial-review-template.md — review document structure
${CLAUDE_PLUGIN_ROOT}/templates/adversarial-review-index-template.md — review index
${CLAUDE_PLUGIN_ROOT}/templates/adversarial-finding-template.md — individual finding format

Target and Scope

Parse $ARGUMENTS to determine review target and scope:

Target (first positional argument):

specs — review spec documents (default)
implementation — review source code against specs

Scope (optional, after target):

--scope=full — read all documents in the target domain. Use for convergence candidates. This is the default.
--scope=diff-from:<commit> — focus on files changed since <commit>. Use between fix bursts to verify the last burst's changes. The adversary receives ONLY the changed files plus their immediate dependencies (parent BC, parent index).
--scope=paths:<pattern> — focus on specific paths (e.g., specs/architecture/, specs/behavioral-contracts/BC-2.05.*). Use for targeted verification of a specific subsystem or domain.

Examples:

/vsdd-factory:adversarial-review specs
/vsdd-factory:adversarial-review specs --scope=full
/vsdd-factory:adversarial-review specs --scope=diff-from:abc1234
/vsdd-factory:adversarial-review specs --scope=paths:specs/architecture/
/vsdd-factory:adversarial-review implementation --scope=paths:src/security/

When --scope is not full, note the scope limitation in the review output header so readers know the review was targeted, not comprehensive.

Worktree-Identity Preflight (MANDATORY)

Before dispatching the adversary for a Perimeter-1 per-story review, the orchestrator MUST resolve and embed the worktree-identity tuple. Skipping this step causes the adversary to read the wrong git tree — either a stale worktree .factory/ snapshot (#169) or the wrong feature checkout (#176) — producing phantom "absent file" findings or a dangerous false-GREEN review.

Orchestrator steps (pre-dispatch):

Resolve and assert the full identity tuple in one step by calling the tested helper:

STORY_ID="<story-id>"
EXPECTED_HEAD_SHA="<orchestrator-recorded-implementer-tip-sha>"
IDENTITY_TUPLE="$(STORY_ID="$STORY_ID" EXPECTED_HEAD_SHA="$EXPECTED_HEAD_SHA" \
  "${CLAUDE_PLUGIN_ROOT}/bin/resolve-worktree-identity.sh")" || {
  echo "dispatch-error: worktree-identity preflight failed — $IDENTITY_TUPLE"; exit 1
}

The helper resolves worktree-abs-path (SPACE-SAFE, anchored story-ID matching), verifies the .factory mount, and asserts git rev-parse HEAD == EXPECTED_HEAD_SHA. Any non-zero exit is a preflight failure — STOP and fix before dispatching.

LOCAL vs PR-level model: The LOCAL adversary cascade runs PRE-PUSH (there is no remote tracking branch yet). The EXPECTED_HEAD_SHA is therefore the SHA the orchestrator recorded immediately after the TDD-green step — NOT resolved from @{upstream} or any remote ref. At the PR-level perimeter (post-push), the expected value IS the pushed remote-branch tip; @{upstream} is appropriate only at that perimeter. Do not conflate the two contexts.
Confirm story-id: the story ID from the STORY-INDEX entry (e.g., S-12.08).
Resolve canonical-repo-root: extracted from the canonical-repo-root: field in the helper's output (step 1 above). The adversary reads all spec, BC, ADR, and story-spec files from <canonical-repo-root>/.factory/... — the entire <worktree>/.factory/ tree is a stale snapshot and off-limits as spec ground-truth. Do NOT use cd "$(git rev-parse --git-common-dir)/.." && pwd as an alternative — that construction is CWD-relative and reproduces the #169 wrong-root bug when invoked from outside the repo. The only correct resolution is via the SCRIPT_DIR-anchored helper output.

Embed the identity tuple in the adversary task prompt verbatim:

WORKTREE-IDENTITY TUPLE (pre-verified by orchestrator):
  worktree-abs-path:   <absolute-path>
  feature-HEAD-SHA:    <orchestrator-recorded-implementer-commit-sha>
  story-id:            <story-id>
  canonical-repo-root: <main-repo-root>

Adversary behavior (enforced by adversary.md Worktree-Identity Preflight section):

The adversary MUST ASSERT the identity tuple is present before producing any findings. If the tuple is missing or the paths are inconsistent, the adversary emits a dispatch-error and halts — it does NOT proceed to content review.

Primary structural guarantee — PATH-CONFINEMENT: The true structural guarantee that prevents reading the wrong tree is PATH-CONFINEMENT: the adversary reads feature-code evidence ONLY from paths rooted at the embedded worktree-abs-path, and spec/BC/ADR ground-truth ONLY from paths rooted at the embedded canonical-repo-root/.factory/. Because these absolute paths are embedded by the orchestrator (after being resolved and asserted by resolve-worktree-identity.sh), the adversary cannot accidentally read the wrong checkout or the wrong spec snapshot regardless of CWD or context. The ASSERT-before-findings step is defense-in-depth: it catches a MISSING tuple (dispatch misconfiguration) immediately, before any reads occur. It is NOT the primary mechanism that prevents wrong-tree reads — path-confinement is.

Why the orchestrator embeds, not the adversary verifies independently: The adversary is read-only (no Bash access); it cannot run git rev-parse HEAD itself. The orchestrator has Bash access and runs the SHA capture and equality assertion pre-dispatch via resolve-worktree-identity.sh. The adversary's role is to ASSERT the embedded identity tuple is present and consistent — confirming the orchestrator completed the preflight — not to independently compute it.

Filename Collision Guard (MANDATORY pre-flight)

Before writing any review file, the orchestrator MUST check for filename collisions:

Compute the target path: .factory/cycles/<current-cycle>/adversarial-reviews/pass-<N>.md
Check if the target path exists. If yes AND the existing file's content differs from what would be written:
- REFUSE the write. Do not overwrite.
- Emit a clear error: "Collision: <target-path> already exists with different content. Use a different cycle name or pass number."
- Point the caller to the cycle bootstrap skill (/vsdd-factory:factory-cycles-bootstrap) to set up a new cycle directory.
Check for legacy flat files. If .factory/specs/adversarial-review-pass-*.md files exist (pre-cycle layout), warn:
- "Legacy review files found in .factory/specs/. Consider running /vsdd-factory:factory-cycles-bootstrap to migrate to cycle-keyed layout."

This prevents silent overwrites of historical reviews in long-lived projects where Phase-1 and Phase-3 reviews coexist. A future enhancement would automate this as a preflight helper script.

For Spec Review

Full scope — read all spec documents:

.factory/specs/product-brief.md
.factory/specs/domain-spec/L2-INDEX.md → read index, then all sections (if exists)
.factory/specs/prd.md
.factory/specs/prd-supplements/*
.factory/specs/behavioral-contracts/*
.factory/specs/verification-properties/*
.factory/specs/architecture/*

Diff scope (--scope=diff-from:<commit>) — read only files changed since the specified commit, plus their parent indexes (BC-INDEX, ARCH-INDEX, etc.). This focuses the adversary on verifying the latest fix burst without diluting attention across the full spec corpus.

Path scope (--scope=paths:<pattern>) — read only files matching the specified glob pattern. Useful for targeted subsystem review.

Attack with the adversary protocol. Write findings to .factory/cycles/<current>/adversarial-reviews/.

Policy Rubric Auto-Loading (MANDATORY)

Before dispatching the adversary, the orchestrator MUST load the project's policy registry and inject it into the adversary's task prompt:

Read .factory/policies.yaml — if the file doesn't exist, skip (policies are also baked into agent prompts as a fallback)

For each policy, format as a rubric block:

POLICY <id> (<name>): <description>
Severity: <severity>. Scope: <scope>.
Verification steps:
  1. <step 1>
  2. <step 2>
  ...

Append all rubric blocks under a ## Project Policy Rubric heading in the adversary's task prompt
The adversary must execute the verification steps for each policy and report compliance per-policy in its findings. Verification steps give the adversary the concrete procedure — not just the rule name, but HOW to check it.

This replaces manual copy-pasting of policy text into every adversary dispatch. The registry is the single source of truth for which policies a project has adopted.

Why both agent prompts AND policies.yaml? Agent prompts carry the enforcement logic (what to do when a violation is found — severity classification, fix procedure, escalation rules). The registry carries the catalog (which policies exist, what they cover, what scope they apply to). The adversary needs both: the logic from its prompt to know HOW to check, and the catalog from policies.yaml to know WHAT to check for this specific project.

For Implementation Review

Read specs first, then review source code against them. Focus on spec drift and silent failures. Scope flags apply here too — --scope=paths:src/security/ focuses on a specific module.

Post-Adversary Persistence (MANDATORY)

The adversary agent has only Read/Grep/Glob tools — it cannot write files. After the adversary returns findings as chat text, the orchestrator MUST persist them:

Capture the adversary's full output verbatim (do not summarize or filter)
Determine the target path: .factory/cycles/<current-cycle>/adversarial-reviews/pass-<N>.md
- Read .factory/current-cycle for the cycle name (operator-defined slug; e.g., <release>-greenfield or <feature>-patch)
- If no current-cycle file exists, use the active cycle from STATE.md
- <N> is the pass number (1-based, sequential within the cycle)
Dispatch state-manager to write the findings file at the target path
Dispatch state-manager to update the adversarial review index (ADV-P<N>-INDEX.md) in the same cycle directory

Decision rationale: Option (a) chosen over granting adversary scoped Write access because: (1) preserves fresh-context information asymmetry — adversary never sees its own prior files; (2) state-manager already owns all .factory/ commits; (3) no harness support for path-scoped tool allowlists.

If the orchestrator skips this step, findings are lost when the conversation context resets. This was observed in practice when direct adversary spawns returned findings as chat text that disappeared on session boundary.

Pass Management

Each review is a numbered pass (ADV-P1, ADV-P2, etc.)
After each pass, assess novelty decay
When novelty is LOW (findings are refinements, not gaps), report convergence
Minimum 3 clean passes, maximum 10 before escalating

Lessons Learned (apply to ALL projects)

Fix Root Causes, Not Symptoms

When a finding shows BC-to-story drift (wrong error codes, missing struct fields, wrong formulas), the fix MUST be: read the authoritative BC, then rewrite the contradicting story section from scratch. Never apply targeted text replacements without first reading both the BC and the story section. In practice, incremental line-level patches caused the same findings to recur across 3-5 passes (S-3.01 security limits survived 3 fix attempts; S-3.04 alias system required 6 passes before a full rewrite resolved it in one pass).

Accumulate Invariants Across Passes

After each adversarial fix cycle, update the adversary prompt with ALL confirmed invariants (struct fields, error codes, version pins, dependency rules, persistence models). The invariant list grows monotonically. Each subsequent pass checks confirmed invariants efficiently and focuses on finding NEW issues.

No Early Termination

Do NOT shortcut to "it's clean" after 2 consecutive clean passes. Fresh-context review has compounding value — the adversary makes genuinely new findings through pass 9+ in complex projects, including findings every prior pass missed (e.g., phantom crate references that only surface when the adversary reads dependency-graph.md with truly fresh eyes).

Minimum convergence requirement: 3 consecutive clean passes (not 2). Even near-convergence, keep running passes until the minimum is met.

Trajectory Monotonicity

Finding counts must decrease monotonically across passes. If any pass shows MORE findings than the previous pass, this is a regression — stop and investigate root cause before proceeding. Possible causes:

New scope was added without pre-validation against the invariant list
A fix introduced a new defect
The adversary's perimeter expanded unexpectedly

Do NOT continue convergence passes until the regression is explained and resolved.

Pre-validate New Scope Additions

When new stories are added during adversarial convergence, they must be written by an agent with access to the full invariant list from prior passes. New stories should be pre-validated against known invariants before being committed. In practice, each new story introduced 3-5 findings because they lacked the rigor of adversarially-converged originals.

adversarial-review

Popularity

Invocation

Configuration

Context Preview

SKILL.md

adversarial-review

Popularity

Invocation

Configuration

Context Preview

SKILL.md

Adversarial Review

The Iron Law

Announce at Start

Red Flags

Templates

Target and Scope

Worktree-Identity Preflight (MANDATORY)

Filename Collision Guard (MANDATORY pre-flight)

For Spec Review

Policy Rubric Auto-Loading (MANDATORY)

For Implementation Review

Post-Adversary Persistence (MANDATORY)

Pass Management

Lessons Learned (apply to ALL projects)

Fix Root Causes, Not Symptoms

Accumulate Invariants Across Passes

No Early Termination

Trajectory Monotonicity

Pre-validate New Scope Additions

Similar Skills

Adversarial Review

The Iron Law

Announce at Start

Red Flags

Templates

Target and Scope

Worktree-Identity Preflight (MANDATORY)

Filename Collision Guard (MANDATORY pre-flight)

For Spec Review

Policy Rubric Auto-Loading (MANDATORY)

For Implementation Review

Post-Adversary Persistence (MANDATORY)

Pass Management

Lessons Learned (apply to ALL projects)

Fix Root Causes, Not Symptoms

Accumulate Invariants Across Passes

No Early Termination

Trajectory Monotonicity

Pre-validate New Scope Additions

Similar Skills