Skill

adversarial-review

Reusable adversarial review methodology for prosecution, defense, design challenge, product-alignment, and proxy review passes. Use when reviewing code, plans, designs, or external review ledgers with evidence-first rigor. DO NOT USE FOR: final judgment ownership, GitHub intake routing, or fix execution decisions (use review-judgment or code-review-intake).

Popularity

Stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/agent-orchestra:adversarial-review

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Supporting Files

adapters/design-challenge.mdadapters/judge-only.mdadapters/lite.mdadapters/post-fix-review-explicit-skip-adapter.mdadapters/post-fix.mdadapters/proxy-github.mdadapters/review-explicit-skip-adapter.mdadapters/standard.mdplatforms/claude.md

SKILL.md

382 lines · ~4.7k tokens

Stats

LanguagePowerShell

Stars2

MaintenanceExcellent

Last CommitJun 13, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Adversarial Review

Reusable review methodology for prosecution and defense passes.

When to Use

When reviewing implementation changes with an adversarial, evidence-first stance
When stress-testing a design or implementation plan before committing to it
When validating and scoring externally supplied findings without widening review scope
When preparing a defense pass that tries to disprove a prosecution ledger

Purpose

Hunt for real defects without inventing them. The goal is to apply a repeatable adversarial method, gather concrete evidence, and emit findings or disproofs that another agent can judge.

Pipeline Flow

Adversarial review adapters run one of these stage shapes:

prosecution - Code-Critic gathers evidence and emits a prosecution ledger.
prosecution -> defense - Code-Critic prosecutes, then Code-Critic defense attempts to disprove the ledger.
prosecution -> defense -> judge - Code-Critic prosecutes, Code-Critic defense attempts to disprove the ledger, and Code-Review-Response issues the terminal ruling.
proxy-prosecution - external review findings are represented as the prosecution input for GitHub review intake.
judge - Code-Review-Response rules on already-collected prosecution and defense evidence.

The named adversarial review adapters are:

Adapter	Adapter class	Port-filling	Pipeline stages	Prosecution passes	Exempt	Notes
`standard`	multi-variant work adapter	Yes, `review`	`prosecution`, `defense`, `judge`	`1`, `2`, `3`	No	Full local adversarial review
`lite`	multi-variant work adapter	Yes, `review`	`prosecution`	`1`	No	Compact local prosecution ledger
`judge-only`	multi-variant work adapter	Yes, `review`	`judge`	none	Yes	Terminal ruling over already-collected evidence
`proxy-github`	multi-variant work adapter	Yes, `review`	`proxy-prosecution`	none	Yes	GitHub review intake represented as proxy prosecution
`post-fix`	multi-variant work adapter	Yes, `post-fix-review`	`prosecution`, `defense`	`1`	No	Post-fix targeted prosecution and defense
`design-challenge`	methodology-variant work adapter	No	`prosecution`	`1`, `2`, `3`	No	Non-blocking design challenge methodology reused by design surfaces

Port-filling adapters declare provides: and fill frame ports such as review or post-fix-review. Methodology-variant adapters do not declare provides:; they package a reusable adversarial method for a caller-owned port or phase.

Prosecution findings may include requires_pipeline_pause: { reason: artifact-missing | runtime-output-required | user-input-required-by-decision-class }. Prosecutors set this field only when the finding cannot be responsibly evaluated inside the current atomic window without missing artifacts, runtime output, or a decision-class user input requirement.

Atomic Pipeline Discipline

When an adapter's integrity-contract.atomic value is true, the caller must run prosecution through the terminal stage as one uninterrupted pipeline. Between prosecution and the terminal stage, do not surface interim findings for action, do not edit files or mutate the working tree, and do not ask questions, including AskUserQuestion or equivalent engagement prompts.

The retry exception is limited to re-running the same failed stage when a tool, model, or transport failure prevents the stage artifact from being produced. The retry must not change scope, dispatch edits, or ask the user for a decision.

The prosecutor-set interrupt exception applies only when a prosecution finding includes requires_pipeline_pause with one of the closed reasons. In that case, the caller pauses the pipeline after the current prosecution artifact is safely captured, reports the pause reason, obtains the missing artifact/output/input through the owning workflow, and resumes the same pipeline without treating interim findings as judged work.

Core Method

1. Establish Review Scope

Determine which artifact is under review:

Code or docs diff
Design or implementation plan
Customer-experience evidence
External review ledger

Read the relevant plan, design cache, architecture rules, and nearby implementation evidence before forming findings.

2. Apply Evidence Standards

Every review item must include:

A specific citation or referenced artifact
A concrete failure mode or explicit uncertainty
A severity and confidence level that match the evidence quality
Enough context that a judge can independently verify the claim

If the failure mode cannot be stated clearly, downgrade the item or omit it.

3. Prefer Targeted Verification Over Broad Scanning

Use the smallest checks that can disconfirm or support a suspected defect:

Read the owning implementation or design section
Trace wiring for new data, components, or integrations
Inspect browser state only when the change touches UI behavior
Compare documented expectations against what the repo currently does

4. Emit a Usable Ledger

Write findings so a defense or judge pass can act on them without reconstructing your reasoning from scratch. Avoid vague summaries such as "looks risky" or "might break stuff."

Code Prosecution Workflow

For standard code review, work through all six perspectives in sequence. For perspectives whose gate is not triggered, use the compact N/A pattern instead of expanding checklist items.

1. Architecture

Apply when runtime code, scripts, or runtime configuration changed.

Check:

Architecture-rule compliance and layer direction
Integration wiring for new components
Data integration for newly introduced fields, constants, and maps
Domain-alignment mismatches across validators, parsers, and converters — identify peers via field-name grep, plan consultation for aliases, and call-chain tracing

2. Security

Apply when the change touches source code, scripts, auth, or data handling.

Check:

Secrets, credentials, and logging of sensitive data
Input validation and authorization boundaries
Full-record overwrite risks that can drop security-sensitive fields

3. Performance

Apply when runtime execution paths changed.

Check:

Algorithmic complexity
Re-render or repeated-computation costs
Memory or bottleneck risks

4. Pattern

Apply when source files changed. For docs-only changes, keep the documentation pattern concerns only.

Check:

Appropriate pattern use and anti-pattern avoidance
DRY violations and contradictory guidance
SOLID pressure points
UI test querying patterns when test code is in scope

5. Implementation Clarity

Apply to all change types.

Check:

Over-engineering
Readability and self-documenting structure
Unnecessary complexity
Comments that explain why rather than what

6. Script And Automation

Apply when script files changed or markdown includes runnable shell guidance.

For script files, verify:

Native command exit-code checks at boundaries
Cross-references to authoritative enumerated values
PowerShell and pipeline semantics that preserve intended types

For markdown-only command guidance, audit:

Runnable commands from repo root
Self-match hazards in grep-based validations
Correct post-change counts and expectations
Preference for built-in VS Code tools over terminal-first read-only guidance when an equivalent exists

7. Missed-gate detection

7. Missed-gate detection (gate-skip audit) — See agents/Code-Critic.agent.md for the full specification. This perspective audits whether load-bearing decisions in the artifact have corresponding L0 gate tokens; it fires as a detective pass alongside the standard six perspectives when the solution-authoring gate is in scope for the reviewed artifact.

Browser-Based Review

When the change touches UI implementation:

Navigate only the affected routes or adjacent impacted flows
Capture screenshots to support visual findings
State route, action, expected behavior, observed behavior, and evidence

Compact N/A Rule

When a perspective gate is not triggered, replace the full section with:

### ⏭️ [Perspective Name]: N/A — [reason]

Standard Code Review Output

## Review Findings

### ✅ Architecture: PASS/FAIL

{findings or compact N/A}

### ✅ Security: PASS/FAIL

{findings or compact N/A}

### ✅ Performance: PASS/FAIL

{findings or compact N/A}

### ✅ Patterns: PASS/FAIL

{findings or compact N/A}

### ✅ Implementation Clarity: PASS/FAIL

{findings or compact N/A}

### ✅ Script & Automation: PASS/FAIL

{findings or compact N/A}

## Summary

{overall verdict and key actions}

Design And Plan Prosecution

Use when the caller requests design-review or product-alignment markers.

Design Review

Review with these perspectives:

Feasibility and Risk
Scope and Completeness
Integration and Impact

Each finding should cite the challenged decision, acceptance criterion, or scope element, and explain what breaks if the concern is real.

Output format:

## Design Challenge Report

### §D1 — Feasibility & Risk

{findings or checked-no-issues summary}

### §D2 — Scope & Completeness

{findings or checked-no-issues summary}

### §D3 — Integration & Impact

{findings or checked-no-issues summary}

### Summary

{highest-risk items and overall confidence}

Product-Alignment Review

Use this evidence order:

Draft design or plan content passed in the prompt
Issue body when present
Documents/Design/ and Documents/Decisions/
Project guidance files such as README.md, CUSTOMIZATION.md, and copilot-instructions.md
Planned-work artifacts when present

Review with these perspectives:

Product Direction Fit
Customer Experience Coherence
Planned-Work Alignment

Output format:

## Product-Alignment Challenge Report

### §P1 — Product Direction Fit

{findings or checked-no-issues summary}

### §P2 — Customer Experience Coherence

{findings or checked-no-issues summary}

### §P3 — Planned-Work Alignment

{findings or checked-no-issues summary}

### Summary

{most important alignment risks and confidence}

Defense Workflow

When defending against a prosecution ledger:

Read the cited code or evidence independently
Try to disprove the stated failure mode
Use disproved, conceded, or insufficient-to-disprove per finding
Only challenge items you can support with concrete counter-evidence

Defense report format:

## Defense Report

### Finding: {id} — {title}

Prosecution: {severity} ({points} pts) — {brief claim}
Defense verdict: `disproved | conceded | insufficient-to-disprove`
Evidence: {what was independently verified}
Argument: {why the prosecution is wrong or why defense concedes}

### Score Summary

Findings reviewed: N
Disproved: X | Conceded: Y | Insufficient: Z
Points claimed: {sum of disproved finding values}
Points at risk: {-2× sum of disproved finding values if rejected}

Proxy Prosecution Workflow

When representing an external review ledger:

Treat the ingested reviewer comments as the authoritative scope
Validate each claim rather than generating a fresh review
Preserve the no-net-new rule unless an unavoidable critical blocker appears
Attribute findings to the external reviewer rather than the current agent

Related Guidance

Load software-architecture when a finding depends on layer boundaries or dependency direction
Load verification-before-completion when validating whether the reviewed change is ready to ship
Load code-review-intake when the work begins from GitHub review threads rather than an internal ledger

Gotchas

Trigger	Gotcha	Fix
Review starts from "looks fine"	The pass turns into a summary instead of an adversarial investigation	Begin from likely failure modes and gather evidence against them

Trigger	Gotcha	Fix
A finding has a citation but no break	The judge cannot tell whether it is a defect or a preference	State the concrete failure mode or downgrade the item before output

Frame Ports Filled By This Skill

Port	Work adapter	Explicit-skip adapter
`review`	agents/Code-Review-Response.agent.md; adapters/standard.md; adapters/lite.md; adapters/judge-only.md; adapters/proxy-github.md	adapters/review-explicit-skip-adapter.md
`post-fix-review`	adapters/post-fix.md	adapters/post-fix-review-explicit-skip-adapter.md

Integrity Contract (Decision 6 - per-adapter exemptions)

Each adversarial review adapter declares its expected pipeline shape in YAML frontmatter under the integrity-contract: key. The frame credit ledger and dispatcher checks use this declaration to verify that the produced artifacts match what the adapter promises.

Required keys:

pipeline-stages: ordered stage names such as prosecution, proxy-prosecution, defense, and judge
atomic: true when the declared stages must run as one uninterrupted pipeline, or n/a for single-stage and exempt adapters
prosecution-passes: ordered prosecution pass IDs expected for that adapter, or an empty list when the adapter is exempt from numbered prosecution output
exempt: boolean indicating whether missing numbered prosecution output is expected for that adapter

Adapter	Pipeline stages	Atomic	Prosecution passes	Exempt	Reason
`standard`	`prosecution`, `defense`, `judge`	`true`	`[1, 2, 3]`	No	Runs full three-pass prosecution before defense and judge
`lite`	`prosecution`	`n/a`	`[1]`	No	Runs one compact prosecution pass
`judge-only`	`judge`	`n/a`	`[]`	Yes	Re-review scope; prior prosecution and defense evidence already exists
`proxy-github`	`proxy-prosecution`	`n/a`	`[]`	Yes	External review intake; proxy prosecution replaces numbered local passes
`post-fix`	`prosecution`, `defense`	`true`	`[1]`	No	Runs one targeted prosecution pass and defense after fixes
`design-challenge`	`prosecution`	`n/a`	`[1, 2, 3]`	No	Methodology-variant design challenge; no frame port ownership

For the design-challenge methodology variant only, prosecution pass IDs correspond to these design/product-alignment Code-Critic prosecution modes. This mapping is not global and does not override the standard, lite, or post-fix code-review adapter contracts listed above.

Pass 1: design review perspectives (Review mode selector: "Use design review perspectives")
Pass 2: implementation prerequisites, CE Gate, persistence, cross-tool (Review mode selector: "Use design review perspectives" second pass)
Pass 3: product-alignment perspectives (Review mode selector: "Use product-alignment perspectives")

adversarial-review

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

adversarial-review

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

Adversarial Review

When to Use

Purpose

Pipeline Flow

Atomic Pipeline Discipline

Core Method

1. Establish Review Scope

2. Apply Evidence Standards

3. Prefer Targeted Verification Over Broad Scanning

4. Emit a Usable Ledger

Code Prosecution Workflow

1. Architecture

2. Security

3. Performance

4. Pattern

5. Implementation Clarity

6. Script And Automation

7. Missed-gate detection

Browser-Based Review

Compact N/A Rule

Standard Code Review Output

Design And Plan Prosecution

Design Review

Product-Alignment Review

Defense Workflow

Proxy Prosecution Workflow

Related Guidance

Gotchas

Frame Ports Filled By This Skill

Integrity Contract (Decision 6 - per-adapter exemptions)

Similar Skills

Adversarial Review

When to Use

Purpose

Pipeline Flow

Atomic Pipeline Discipline

Core Method

1. Establish Review Scope

2. Apply Evidence Standards

3. Prefer Targeted Verification Over Broad Scanning

4. Emit a Usable Ledger

Code Prosecution Workflow

1. Architecture

2. Security

3. Performance

4. Pattern

5. Implementation Clarity

6. Script And Automation

7. Missed-gate detection

Browser-Based Review

Compact N/A Rule

Standard Code Review Output

Design And Plan Prosecution

Design Review

Product-Alignment Review

Defense Workflow

Proxy Prosecution Workflow

Related Guidance

Gotchas

Frame Ports Filled By This Skill

Integrity Contract (Decision 6 - per-adapter exemptions)

Similar Skills