From superpowers-plus
Tests debugging hypotheses through controlled reproduction attempts. Designs and executes experiments, compares environments, records outcomes, and reports whether a hypothesis is confirmed or rejected with evidence confidence scoring.
How this skill is triggered — by the user, by Claude, or both
Slash command
/superpowers-plus:reproduction-experiment-investigatorThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
> **Role:** Test debugging hypotheses through controlled reproduction attempts. Confirm or reject with evidence.
Role: Test debugging hypotheses through controlled reproduction attempts. Confirm or reject with evidence. Dispatched by:
debug-conductor— never invoked directly by user. Evidence type:ExperimentEvidence(seeskills/_shared/evidence-schema.md)
Dispatched by debug-conductor when a hypothesis needs testing — controlled reproduction, environment comparison, or A/B verification of a suspected root cause.
From the conductor, receive:
Refine into a testable experiment:
For each attempt:
Minimum 3 attempts — intermittent bugs need statistical confidence.
| Attempts | Reproductions | Confidence |
|---|---|---|
| 3/3 | 3 | High (>0.8) — hypothesis strongly supported |
| 2/3 | 2 | Medium (0.5–0.8) — likely correct but intermittent |
| 1/3 | 1 | Low (0.3–0.5) — possible but unreliable |
| 0/3 | 0 | Very Low (<0.3) — hypothesis likely wrong OR environment mismatch |
If reproduction succeeded:
{
"hypothesis": "Event ordering bug in async pipeline under load",
"steps": [
{ "action": "Set event processing to async mode", "result": "Config applied", "success": true },
{ "action": "Send 50 concurrent call events", "result": "Events arrived out of order in 12/50 cases", "success": true },
{ "action": "Verify call state machine diverged", "result": "3 calls in disconnected state prematurely", "success": true }
],
"outcome": "reproduced",
"reproduced": true,
"attempts": 3,
"successRate": 1.0,
"environmentDiff": "Staging uses lower load (50 concurrent vs 500 in prod); reproduction rate may differ"
}
| Pattern | Evidence Shape |
|---|---|
| Deterministic bug | 3/3 reproduction, 0/3 without condition → confirmed |
| Load-dependent bug | Reproduces only above certain concurrency threshold |
| Environment-specific | Reproduces in prod-like environment but not staging → config/infra difference |
| Intermittent / race condition | 1–2/3 reproduction → timing-dependent |
| Hypothesis disproven | 0/3 reproduction even with condition → reject hypothesis |
| Mode | Symptom | Recovery |
|---|---|---|
| Incomplete isolation | Test affected by shared state | Reset environment between experiments |
| False confirmation | Coincidental success in reproduction | Run multiple trials |
| Wrong variable | Testing irrelevant hypothesis | Verify hypothesis matches symptoms |
npx claudepluginhub bordenet/superpowers-plus --plugin superpowers-plusGuides creation, editing, and verification of skills for AI coding agents using test-driven development with subagent scenarios. Use when authoring or debugging skills.