Skill

adversary

Pre-implementation red-team analysis. Use when a plan is high-risk, critical path, or expensive to reverse. Challenges plans before code is written — finds edge cases, security holes, scalability bottlenecks, error propagation risks, and integration conflicts. Catches flaws at plan time (10x cheaper than post-implementation).

Popularity

Stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/topia:adversary

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Pre-implementation adversarial analysis. After a plan is approved but BEFORE code is written, adversary stress-tests the plan across 5 dimensions: edge cases, security, scalability, error propagation, and integration risk. It does NOT fix or redesign — it reports weaknesses so the plan can be hardened before implementation begins.

Supporting Files

evals.mdreferences/context-bundle-format.mdreferences/oracle-mode.md

SKILL.md

345 lines · ~4.4k tokens

Stats

LanguageJavaScript

Stars1

MaintenanceExcellent

Last CommitJun 10, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

adversary

Purpose

This fills the only gap in the plan-to-ship pipeline: all other quality skills (review, preflight, sentinel) operate AFTER code exists. Catching a flaw in a plan costs minutes; catching it in implementation costs hours.

adversary MUST NOT approve a plan without at least one specific challenge per dimension analyzed. A report that says "plan looks solid" without concrete attack vectors is NOT a red-team analysis. Every finding MUST reference the specific plan section, file, or assumption it challenges.

Triggers

Called by build Phase 2.5 — after plan approved, before Phase 3 (TEST)
/topia adversary — manual red-team analysis of any plan or design document
Auto-trigger: when plan files are created in .topia/ or docs/plans/

Calls (outbound)

guardian (L2): deep security scan when adversary identifies auth/crypto/payment attack vectors in the plan
perf (L2): scalability analysis when adversary identifies potential bottleneck patterns
recon (L2): find existing code that might conflict with planned changes
docs-seeker (L3): verify framework/API assumptions in the plan are correct and current
hallucination-guard (L3): verify that APIs, packages, or patterns referenced in the plan actually exist
context-engine (L3): (oracle-mode) emit context.preview before bundle build to gate token cost
session-bridge (L3): (oracle-mode) detach protocol when target model is opus-class for non-blocking dispatch

Called By (inbound)

build (L1): Phase 2.5 — after plan approval, before TDD
plan (L2): optional post-step for critical features
team (L1): when decomposing large tasks, adversary validates the decomposition
debug (L2): (oracle-mode) listens to agent.stuck from debug after 3 disproved hypotheses
fix (L2): (oracle-mode) listens to agent.stuck from fix after 2+ failed attempts
documentation (L2): reads adversary report for leadership risk summary
User: /topia adversary direct invocation

Cross-Hub Connections

adversary ← build — plan produced → adversary challenges it → hardened plan feeds Phase 3
adversary → guardian — security attack vector identified → sentinel validates depth
adversary → perf — scalability concern raised → perf quantifies the bottleneck
adversary → scout — integration risk flagged → scout finds affected code
adversary → plan — CRITICAL findings → plan revises before implementation

Execution

Step 0: Load Context

Read the plan document (from .topia/features/<name>/plan.md, phase file, or user-specified path)
Read the requirements document if it exists (.topia/features/<name>/requirements.md from idea)
Use scout to identify existing code files that the plan will touch or depend on
Identify the plan's core assumptions — what MUST be true for this plan to work?

Step 1: Edge Case Analysis

Challenge the plan's handling of boundary conditions.

For each input/output/state transition in the plan, ask:

Empty/zero: What happens with no data, zero items, empty strings, null users?
Overflow: What happens at MAX — 10K items, 1MB payload, 1000 concurrent users?
Race conditions: What if two operations happen simultaneously? Can state become inconsistent?
Partial failure: What if step 3 of 5 fails? Is there rollback? Or orphaned state?
Invalid combinations: What input combinations are technically possible but semantically nonsensical?

EDGE_CASE_TEMPLATE:
- Scenario: [specific edge case]
- Plan assumption: [what the plan assumes]
- Attack: [how this breaks]
- Impact: [what fails — data loss, crash, wrong result, security breach]
- Remediation: [1-sentence fix suggestion]

Step 2: Security Attack Vectors

Analyze the plan for security weaknesses BEFORE any code exists.

Input trust boundaries: Where does the plan accept external input? Is validation specified?
Authentication gaps: Does the plan assume auth exists? Are there unprotected routes or actions?
Data exposure: Could the planned API responses leak sensitive fields? Are there over-fetching risks?
Privilege escalation: Can a normal user reach admin functionality through the planned flow?
Injection surfaces: Does the plan involve dynamic queries, template rendering, or shell commands?
Dependency risk: Does the plan introduce new dependencies? Are they well-maintained and trusted?

If any auth, crypto, or payment logic is in the plan: MUST call topia:guardian for deep analysis.

SECURITY_TEMPLATE:
- Vector: [attack type — OWASP category if applicable]
- Entry point: [which part of the plan is vulnerable]
- Exploit scenario: [how an attacker would use this]
- Severity: CRITICAL | HIGH | MEDIUM
- Remediation: [what the plan should specify to prevent this]

Step 3: Scalability Stress Test

Project the plan forward — what happens at 10x and 100x scale?

N+1 queries: Does the plan describe data fetching that will create N+1 database calls?
Missing pagination: Does the plan handle lists without specifying limits?
Synchronous bottlenecks: Are there blocking operations in the hot path?
Cache invalidation: If caching is planned, what happens when data changes? Stale reads?
State growth: Does the plan accumulate state (in-memory, database, file system) without cleanup?
External service limits: Does the plan account for rate limits on third-party APIs?

If bottleneck patterns detected: call topia:perf for quantitative analysis.

SCALE_TEMPLATE:
- Bottleneck: [what breaks at scale]
- Current plan: [what the plan specifies]
- At 10x: [what happens]
- At 100x: [what happens]
- Remediation: [what to add to the plan]

Step 4: Error Propagation Analysis

Trace failure paths through the planned system.

Cascade failures: If Service A fails, does the plan specify what happens to B, C, D?
Retry storms: Does the plan include retries? Could retries amplify the failure?
Silent failures: Are there operations that could fail without anyone knowing?
Inconsistent state: If a multi-step operation fails midway, is the data left in a valid state?
User experience: When things fail, what does the user see? Is there a degraded mode?
Recovery path: After failure + fix, can the system resume? Or does it require manual intervention?

ERROR_TEMPLATE:
- Failure point: [where in the plan]
- Propagation: [what else breaks]
- User impact: [what the user experiences]
- Recovery: [how to get back to good state]
- Missing in plan: [what the plan should specify]

Step 5: Integration Risk Assessment

Check for conflicts with existing code and architecture.

Use topia:recon to find all files the plan will modify or depend on
Breaking changes: Does the plan modify shared interfaces, types, or APIs that other code depends on?
Migration gaps: Does the plan require database migrations? Are they reversible?
Configuration drift: Does the plan add new environment variables, feature flags, or config files?
Test invalidation: Will existing tests break from the planned changes?
Deployment ordering: Does the plan require specific deployment sequence? (DB first, then API, then frontend?)

INTEGRATION_TEMPLATE:
- Conflict: [what clashes]
- Existing code: [file:line that would be affected]
- Plan assumption: [what the plan assumes about existing code]
- Reality: [what the existing code actually does]
- Remediation: [how to resolve the conflict]

Step 6: Verdict and Report

Synthesize all findings into an actionable report.

Before reporting, apply rigor filter:

Only report findings you can justify with specific references to the plan or codebase
Do NOT report theoretical concerns that require 3+ unlikely conditions to trigger
Prioritize findings that would cause the MOST wasted implementation time if discovered later
Consolidate related findings — "auth is underspecified" not 5 separate auth findings

Verdict logic:

Any CRITICAL finding → REVISE (plan must be updated before Phase 3)
3+ HIGH findings → REVISE
HIGH findings with clear remediations → HARDEN (add remediations to plan, then proceed)
Only MEDIUM/LOW findings → PROCEED (note findings for implementation awareness)

After reporting:

If verdict is REVISE: return to plan with findings attached as constraints
If verdict is HARDEN: present remediations to user for plan update; emit adversary.passed once remediations are accepted
If verdict is PROCEED: pass findings to build Phase 3 as implementation notes; emit adversary.passed so downstream skills (e.g. documentation) can package the hardened plan for stakeholders

Output Format

## Adversary Report: [feature/plan name]
- **Plan analyzed**: [path to plan file]
- **Dimensions checked**: [which of the 5 were relevant]
- **Findings**: [count by severity]
- **Verdict**: REVISE | HARDEN | PROCEED

### CRITICAL
- [ADV-001] [dimension]: [description with plan reference]
  - Attack: [how this breaks]
  - Remediation: [specific fix]

### HIGH
- [ADV-002] [dimension]: [description with plan reference]
  - Attack: [how this breaks]
  - Remediation: [specific fix]

### MEDIUM
- [ADV-003] [dimension]: [description]

### Strength Notes
- [what the plan does well — adversary is harsh but fair]

### Verdict
[Summary: why REVISE/HARDEN/PROCEED, what to do next]

Workflow Modes

Full Red-Team (default)

All 5 dimensions analyzed. Used for new features, architectural changes, security-sensitive plans.

Quick Challenge (for smaller plans)

Skip Steps 3-4 (scalability, error propagation). Focus on edge cases, security, and integration. Trigger: plan modifies < 3 files AND no auth/payment/data logic.

Security-Focused

Steps 2 and 5 only (security + integration). Used when guardian requests adversarial pre-analysis. Trigger: plan involves auth, crypto, payment, or user data handling.

Mode: oracle (v0.2.0)

Triggered by: agent.stuck signal — emitted by debug (after 3 disproved hypotheses) or fix (after 2+ failed attempts on the same file).

Purpose: Break confirmation-bias loops. The same agent that read auth.ts 3 times has formed a theory it cannot un-form. Oracle-mode dispatches a stateless second-model pass with explicit "no prior context" framing, breaking the semantic loop that scout's zoom-out mode (structural pivot) cannot.

When NOT to use:

Single hypothesis cycle — escalate only after 3 cycles in debug or 2 attempts in fix
Trivial single-file bugs — overhead exceeds value
When the user already knows the answer — they're trying to validate, not diagnose

Protocol:

Pre-bundle gate — emit context.preview to context-engine first; abort if action=block
Build context bundle — see references/context-bundle-format.md for exact format
Dispatch — emit oracle.dispatched signal; route via session-bridge detach if target model is opus-class (non-blocking)
Wait for response — synchronous if model is sonnet-class, polled via .topia/oracle-pending/<id>.json if opus-class
Validate response — every claim MUST cite file:line. Strip + warn on uncited claims (oracle.failed if all claims uncited)
Emit response — oracle.response carries the validated diagnosis, consumed by debug/fix to override or refine their current hypothesis

Bundle format (mandatory regex-validated):

[SYSTEM] You are Oracle, a focused one-shot problem solver. You have NO prior context — assume zero project knowledge. Cite file:line for every claim. Reject any claim you cannot ground in the provided files.

[USER] <agent stuck after N hypothesis cycles. What is the most likely root cause not yet considered?>

### File 1: <relative/path/to/file.ts>
<file content, normalized whitespace, max 4k chars per file>

### File 2: <...>
<...>

Hard caps:

Bundle ≤ 100k tokens (estimated via char count × 0.25)
Per-file ≤ 4k chars (truncate with explicit ... [truncated] marker)
Max 12 files per bundle (force caller to pTopia larger sets)

Response contract — Oracle reply MUST contain:

A primary diagnosis (1-3 sentences)
At least 1 file:line citation per claim
An action recommendation (specific edit, additional file to read, hypothesis to test)

Replies failing this contract are rejected — oracle.failed emitted, primary agent continues without second opinion.

See references/oracle-mode.md for the full protocol and integration with debug/fix.

Constraints

MUST challenge every plan — no rubber-stamping. At minimum, one finding per analyzed dimension
MUST NOT modify the plan or write code — adversary is read-only analysis
MUST reference specific plan sections or existing code for every finding
MUST escalate to sentinel when auth/crypto/payment attack vectors are identified
MUST use concrete attack scenarios, not vague warnings ("could be a problem" is NOT a finding)
MUST NOT block on MEDIUM/LOW findings — only CRITICAL and HIGH trigger REVISE verdict
MUST include Strength Notes — adversary finds weaknesses AND acknowledges what's well-designed
(oracle-mode) MUST emit context.preview BEFORE building the bundle — abort if context-engine action=block
(oracle-mode) MUST validate every Oracle reply citation against the provided files — reject uncited claims as oracle.failed

Nexus Gates

Gate	Requires	If Missing
Plan Gate	A plan document exists (from plan skill or user-provided)	Cannot run — ask for plan first
Codebase Gate	Access to existing codebase (for integration checks)	Skip Step 5, note in report

Sharp Edges

Failure Mode	Severity	Mitigation
Over-challenging — nitpicking every line of the plan	HIGH	Rigor filter: only findings you can justify with specific references. Skip theoretical 3+ condition chains
False security alarms — flagging secure patterns as vulnerable	HIGH	Call sentinel for validation before reporting security findings as CRITICAL
Analysis paralysis — too many findings block all progress	MEDIUM	Max 3 CRITICAL + 5 HIGH. If more found, consolidate or prioritize top impact
Missing context — challenging plan without understanding existing codebase	HIGH	Step 0 MUST load existing code context via scout before challenging
Scope creep — reviewing existing code quality instead of plan quality	MEDIUM	Adversary reviews THE PLAN, not the codebase. Existing code is context only
Redundancy with review/preflight — duplicating post-implementation checks	MEDIUM	Adversary operates PRE-implementation only. Never run adversary on existing code
(oracle-mode) Bundle exceeds token cap — caller didn't pTopia	HIGH	Caller MUST run `context.preview` first; adversary fails fast with `oracle.failed` instead of silently truncating signal
(oracle-mode) Oracle reply has no citations — model improvised	CRITICAL	Reject reply with `oracle.failed`. Primary agent continues without second opinion (better than acting on hallucination)
(oracle-mode) Loop: oracle reply triggers another `agent.stuck`	HIGH	Cap at 1 oracle dispatch per primary-agent stuck cycle. Subsequent stucks must escalate to user

Done When

All relevant dimensions analyzed (minimum: edge cases + security + integration)
Every finding references specific plan section or codebase file
Security-sensitive plans escalated to sentinel (or confirmed not security-relevant)
Verdict rendered: REVISE, HARDEN, or PROCEED
Findings formatted for consumption by build Phase 3 (if PROCEED) or plan (if REVISE)
Strength Notes section acknowledges well-designed aspects of the plan
(oracle-mode) If dispatched: response cited file:line for each claim, or oracle.failed emitted with rejection reason

Returns

Artifact	Format	Location
Adversary Report	Markdown	inline (stdout)
Threat findings	Structured list (CRITICAL/HIGH/MEDIUM)	inline
Risk matrix per dimension	Table	inline
Verdict + remediation list	Markdown	inline
Hardened plan notes (if PROCEED)	Text	passed to build Phase 3

Cost Profile

~4000-8000 tokens input (plan + codebase context), ~2000-3000 tokens output. Opus model for adversarial depth. Runs once per feature plan — high cost justified by preventing wasted implementation cycles.

Scope guardrail: adversary reviews THE PLAN only — never audits existing codebase quality or rewrites code.

adversary

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

adversary

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

adversary

Purpose

Triggers

Calls (outbound)

Called By (inbound)

Cross-Hub Connections

Execution

Step 0: Load Context

Step 1: Edge Case Analysis

Step 2: Security Attack Vectors

Step 3: Scalability Stress Test

Step 4: Error Propagation Analysis

Step 5: Integration Risk Assessment

Step 6: Verdict and Report

Output Format

Workflow Modes

Full Red-Team (default)

Quick Challenge (for smaller plans)

Security-Focused

Mode: oracle (v0.2.0)

Constraints

Nexus Gates

Sharp Edges

Done When

Returns

Cost Profile

Similar Skills

adversary

Purpose

Triggers

Calls (outbound)

Called By (inbound)

Cross-Hub Connections

Execution

Step 0: Load Context

Step 1: Edge Case Analysis

Step 2: Security Attack Vectors

Step 3: Scalability Stress Test

Step 4: Error Propagation Analysis

Step 5: Integration Risk Assessment

Step 6: Verdict and Report

Output Format

Workflow Modes

Full Red-Team (default)

Quick Challenge (for smaller plans)

Security-Focused

Mode: oracle (v0.2.0)

Constraints

Nexus Gates

Sharp Edges

Done When

Returns

Cost Profile

Similar Skills