Audits post-iteration behavior evidence quality in three tiers: deep evidence for stories, impacted scenarios, sentinel corpus regressions using parallel adversarial review.
How this skill is triggered — by the user, by Claude, or both
Slash command
/iterative-development:auditing-progressThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Runs after every iteration as part of the planning cycle. Verifies behavior evidence quality in three tiers using **parallel adversarial review (PAR)** — two paired auditor subagents evaluate the same work in parallel with competitive framing.
Runs after every iteration as part of the planning cycle. Verifies behavior evidence quality in three tiers using parallel adversarial review (PAR) — two paired auditor subagents evaluate the same work in parallel with competitive framing.
The audit answers: "Does durable, reusable evidence exist at the correct seam for every externally observable behavior this iteration touched?"
Invoked by iterative-development after every running-an-iteration call, before picking the next iteration.
Read the per-epic requirement files in docs/superpowers/iterations/requirements/, docs/superpowers/iterations/behavior-scenarios.md, and docs/superpowers/iterations/behavior-corpus.md:
done:ITER-<current> and scenarios added or updated in this iteration. Audit every AC and its proof obligation thoroughly.sentinel in the behavior corpus. Compare against the pre-iteration baseline from running-an-iteration step 3.Following the PAR methodology in skills/shared/parallel-adversarial-review.md:
auditor-subagent-prompt.md. Include ALL THREE tiers:
skills/shared/par-reviewer-wrapper.mdFollowing PAR aggregation rules:
requirements/ (status pending) or flip existing stories back from done to pendingroadmap.md to add a follow-up iteration for the gapsReturn the audit result (clean or gaps) to the orchestrator. The orchestrator decides whether to loop or terminate.
| Tier | What it checks | Failure means |
|---|---|---|
| Deep evidence | Every AC + proof obligation for current iteration | Story not done, evidence too weak |
| Impacted behavior | Scenarios whose surfaces were touched | Stale or broken scenario |
| Sentinel corpus | High-value journey scenarios | Regression in previously-working behavior |
| Reads | Writes | Dispatches |
|---|---|---|
requirements/, behavior-scenarios.md, behavior-corpus.md, product code/tests | requirements/ (gaps), roadmap.md (new iteration) if gaps, behavior-scenarios.md (stale flags) | Two auditor subagents in parallel (PAR) |
skills/shared/parallel-adversarial-review.md — PAR methodologyskills/shared/par-reviewer-wrapper.md — competitive framing wrapperskills/shared/behavior-evidence-formats.md — scenario and proof obligation formatsauditor-subagent-prompt.md — auditor-specific prompt templatenpx claudepluginhub prime-radiant-inc/prime-radiant-marketplace --plugin iterative-developmentExecutes next pending iteration from iterative-development roadmap: picks iteration, decomposes code/evidence tasks, runs sentinel baseline, dispatches implementing tasks, tests impacted/sentinel scenarios, updates artifacts.
Validates AI agent claims against evidence trail in coding workflows. Catches unsubstantiated 'done', 'tests pass', 'fixed' without proof like outputs, diffs, or logs. Auto-triggers on completion keywords.
Verifies completed work with a 3-tier evidence-based process. Validates tests, linting, types, builds exist and pass, plus deep audit for milestones and PRs. Enforces no completion claims without fresh evidence.