Search everything...

Stats

Actions

Available In

wicked-testing

Name: wicked-testing
Author: mikeparcewski

By mikeparcewski

Standalone QE library — 5-core testing surface (plan/authoring/execution/review/insight), SQLite ledger with fixed-SQL oracle, optional wicked-bus + wicked-brain integration.

npx claudepluginhub mikeparcewski/wicked-testing --plugin wicked-testing

Popularity

Stars

Med: 0·Avg: 285

Installs

Med: 0·Avg: 1

What's Inside

Slash Commands7

/wicked-testing:acceptance

/acceptance

Run the 3-agent acceptance pipeline (Writer → Executor → Reviewer) on a scenario file

/wicked-testing:authoring

/authoring

Author scenarios and generate test code — dispatches the authoring skill

/wicked-testing:execution

/execution

Run tests and capture evidence — dispatches the execution skill

/wicked-testing:insight

/insight

Query the ledger — stats, reports, flake detection, coverage gaps

/wicked-testing:plan

/plan

Generate a test strategy — dispatches the plan skill's 4-way router (strategist / risk / testability / AC-quality)

Agents40

acceptance-test-executor

/acceptance-test-executor

Follows structured wicked-testing test plans step-by-step, collecting evidence artifacts. Executes and captures only — does not judge or grade pass/fail. Writes evidence files to .wicked-testing/evidence/{run-id}/. Use when: acceptance test execution, evidence collection, test plan execution <example> Context: Test plan is ready and needs to be executed step by step. user: "Execute the acceptance test plan for the file upload feature." <commentary>Use acceptance-test-executor for mechanical step execution and evidence capture without judging results.</commentary> </example>

acceptance-test-reviewer

/acceptance-test-reviewer

Evaluates evidence artifacts against test plan assertions independently. CRITICAL ISOLATION: Receives ONLY evidence file paths. Never sees execution context. Catches semantic bugs that self-grading misses. Use when: acceptance test review, evidence evaluation, test verdict <example> Context: Executor produced evidence and it needs independent evaluation. user: "Review the evidence from the file upload acceptance tests and render a verdict." <commentary>Use acceptance-test-reviewer for independent, unbiased verdict on test evidence.</commentary> </example>

acceptance-test-writer

/acceptance-test-writer

Reads wicked-testing acceptance scenarios and produces structured, evidence-gated test plans. Transforms qualitative criteria into concrete, verifiable artifact requirements. Use when: acceptance testing, test plan generation, scenario verification design <example> Context: New feature scenario needs a structured test plan. user: "Write an acceptance test plan for the 'user can export data as CSV' scenario." <commentary>Use acceptance-test-writer to produce structured, evidence-gated test plans from scenarios.</commentary> </example>

code-analyzer

/code-analyzer

Static code analysis for testability, quality, and maintainability. Reviews code structure, identifies test-coverage gaps, and flags risky areas. Use when: static analysis, code-quality metrics, testability assessment, maintainability review, coverage-gap identification. Runs on arbitrary source code, anytime — does not require a spec or an active build phase. NOT THIS WHEN: - Reviewing acceptance criteria for SMART+T (pre-code, no implementation yet) — use `requirements-quality-analyst` - Judging whether the implementation matches a spec (post-code divergence detection) — use `semantic-reviewer` - Rendering a full acceptance verdict (writer + reviewer + executor pipeline) — use `/wicked-testing:acceptance`

contract-testing-engineer

/contract-testing-engineer

API contract testing specialist. Designs and reviews consumer-driven contracts, Pact-style tests, OpenAPI contract verification, schema versioning, and breaking-change detection across service boundaries. Use when: API contract tests, CDC, Pact, OpenAPI verification, schema versioning, breaking-change detection, provider/consumer negotiation.

Skills7

wicked-testing:acceptance-testing

/acceptance-testing

Evidence-gated acceptance testing with three-agent separation of concerns. Writer designs test plans, Executor collects artifacts, Reviewer evaluates independently. Eliminates false positives from self-grading. Use when: "acceptance test", "verify it works", "did it pass", "run acceptance", "test this scenario", "acceptance criteria", "validate the feature", "/wicked-testing:acceptance"

wicked-testing:authoring

/authoring

Tier-1 orchestrator for producing tests. Writes scenario files, generates test code (unit / integration / E2E), creates fixtures and test data. The "make me tests" skill. Use when: "write tests", "generate test code", "author scenarios", "create a scenario file", "add fixtures", "test data setup", "automate this scenario".

wicked-testing:execution

/execution

Tier-1 orchestrator for running tests and capturing evidence. Executes scenarios, invokes framework runners, collects artifacts, and writes the run + verdict to the ledger. Use when: "run the test", "execute this scenario", "run the suite", "acceptance test this", "capture evidence", "prove it works".

wicked-testing:insight

/insight

Tier-1 orchestrator for reading the ledger. Stats, reports, flake detection, coverage gaps, historical queries. Never writes — only reads. Use when: "has this passed recently", "flake rate", "show me the last N runs", "coverage gaps", "generate a report", "stats", "exploratory session".

wicked-testing:plan

/plan

Tier-1 orchestrator for test planning. Covers test strategy, risk, testability review, and requirements quality. Dispatches specialist agents based on what the target needs. Use when: "what should I test", "test strategy", "test plan", "risk matrix", "is this testable", "are these requirements testable", "coverage strategy", "shift-left testing".

Hooks1

Event Hooks

1 hook across 1 event

Stats

Version0.6.0

ReleasedJun 18, 2026

LanguageJavaScript

Stars0

MaintenanceExcellent

Last CommitJun 18, 2026

AddedApr 21, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).

Available In

wicked-testing

Safety Signals

Caution

Uses power tools

Uses Bash, Write, or Edit tools

README

           _      _            _       _            _   _             
 __      _(_) ___| | _____  __| |     | |_ ___  ___| |_(_)_ __   __ _ 
 \ \ /\ / / |/ __| |/ / _ \/ _` |_____| __/ _ \/ __| __| | '_ \ / _` |
  \ V  V /| | (__|   <  __/ (_| |_____| ||  __/\__ \ |_| | | | | (_| |
   \_/\_/ |_|\___|_|\_\___|\__,_|      \__\___||___/\__|_|_| |_|\__, |
                                                                 |___/

40 specialist agents. 5 coordinating skills. A 3-agent acceptance pipeline that eliminates self-grading.

npx wicked-testing install

Works with Claude Code, Gemini CLI, Cursor, Codex, and Kiro.

The Problem

When you ask an AI agent to test its own work, it grades its own homework. Self-reported PASS rates on agentic test runs sit 80%+ above human-reviewed rates. The agent that wrote the code also runs the tests and evaluates the results — there is no independence at any layer.

The industry answer has been scripted test frameworks: Playwright, pytest, k6, axe-core. But those only run what you already thought to test. They don't tell you what to test, whether the tests are any good, whether the results mean anything, or why the suite keeps failing intermittently on CI.

wicked-testing gives your AI CLI a complete QE team — from planning through execution through judgment — with enforced separation between the agent that runs tests and the agent that evaluates them.

What You Get

claude plugins marketplace add mikeparcewski/wicked-testing
claude plugins install wicked-testing

Then:

# Generate a shift-left test strategy from your codebase
/wicked-testing:plan src/auth/ --project auth-service

# Run the 3-agent acceptance pipeline with enforced reviewer isolation
/wicked-testing:acceptance scenarios/login-positive.md

# Ask plain-English questions about your test history
/wicked-testing:insight "what was the last verdict for the login scenario?"

Under the hood: a project-local SQLite ledger, 40 specialist agents grouped into 5 Tier-1 skills, and a public event contract for wicked-garden integration.

40 Agents, 5 Tier-1 Skills

Tier-1 Agents — Public Contract

The 15 Tier-1 agents form the stable integration surface. wicked-garden and other consumers depend only on these.

Agent	Invoked By	What It Does
`test-strategist`	`plan`	Maps codebase to test scenarios — positive, negative, edge cases
`testability-reviewer`	`plan`	Blocks designs that will be hard to test before a line is written
`requirements-quality-analyst`	`plan`	Applies SMART+T to acceptance criteria — ready-for-design or needs-iteration
`risk-assessor`	`plan`	Scores risks by likelihood × impact, produces a mitigation matrix
`test-designer`	`authoring`	Full write→execute→analyze→verdict loop from a scenario file
`test-automation-engineer`	`authoring`	Generates test code in the project's detected framework
`contract-testing-engineer`	`authoring`	Consumer-driven contract tests (Pact-style), breaking-change detection
`code-analyzer`	`authoring`	Static quality + testability signals, ship/fix/refactor verdict
`acceptance-test-writer`	`execution`	Evidence-gated test plan — every step declares expected evidence and an assertion
`acceptance-test-executor`	`execution`	Executes plan mechanically, captures artifacts, makes no judgment
`acceptance-test-reviewer`	`review`	Reads cold evidence only (`allowed-tools: Read`) — never sees executor context
`scenario-executor`	`execution`	Runs a scenario markdown file step-by-step
`semantic-reviewer`	`review`	Gap Report per AC: aligned / divergent / missing
`production-quality-engineer`	`insight`	Post-deploy health: healthy / degraded / unhealthy + next action
`test-oracle`	`insight`	Plain-English questions → 12 named parameterized SQL queries. No ad-hoc SQL.

Tier-2 Specialist Agents — Internal

25 domain specialists routed by the Tier-1 skills. Never break downstream consumers because they are not part of the public contract.

View full README on GitHub

wicked-testing

Popularity

What's Inside

Confidence

README

The Problem

What You Get

40 Agents, 5 Tier-1 Skills

Tier-1 Agents — Public Contract

Tier-2 Specialist Agents — Internal

Similar Plugins

fullstack-dev-skills

creative-writing

dotnet-skills

ecc

drawio-diagramming

octo

More by mikeparcewski

wicked-garden

wicked-estate

wicked-interactive

The Problem

What You Get

40 Agents, 5 Tier-1 Skills

Tier-1 Agents — Public Contract

Tier-2 Specialist Agents — Internal

Popularity

Health & Quality

More by mikeparcewski

wicked-garden

wicked-estate

wicked-interactive

Similar Plugins

fullstack-dev-skills

creative-writing

dotnet-skills

ecc

drawio-diagramming

octo