From bugbash
Core bug-bash workflow with mission brief, intensity modes, risk triage, evidence bars, and structured outputs—running system first, code second; find and reproduce, do not fix unless asked.
How this skill is triggered — by the user, by Claude, or both
Slash command
/bugbash:bugbash-coreThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
- **Goal:** **Reproduce** misbehavior or **characterize** risk with evidence—not to own remediation.
Keep this to two minutes unless the user already specified everything.
Confirm or state assumptions for:
| Field | Question |
|---|---|
| Target | What surface or feature is in scope? |
| Environment | Local, staging URL, branch/commit? |
| Intensity | quick | standard | deep (default: standard) |
| Constraints | No prod? No billing? No deletes? Data rules? |
| Timebox | Optional wall-clock or scenario cap |
If anything safety-critical is unclear, ask once; otherwise proceed with labeled assumptions.
quick (~ smoke + sanity)standard (default)deepBefore testing randomly, rank surfaces by:
Spend deep mode’s extra time on the top of this list.
Tune depth to intensity. Cover when relevant:
deep)| Severity | Minimum evidence |
|---|---|
| S1 | Repro twice OR one repro plus strong corroboration (e.g. clear data corruption, definitive 500 + stack); state blast radius. |
| S2 | Reliable repro steps + concrete artifact (status/body/log/assertion). |
| S3 | Clear repro; one run acceptable if stable. |
| S4 | Observable issue; screenshot or short description OK. |
If you cannot meet the bar, downgrade severity or move to Suspected / flaky.
Track these apart from the main findings table:
| ID | Hypothesis | Attempts | Flake rate | Notes |
|---|
Do not merge unconfirmed flakes into S1/S2 without meeting the evidence bar.
| ID | Severity | Surface | Summary | Repro steps | Expected | Actual | Evidence | Confidence |
|---|
Severity: S1–S4 as before (critical → low). Confidence: High / Medium / Low.
Short audit trail:
If the user might file tickets or script follow-ups, end with a fenced JSON array using this shape (omit sensitive values):
[
{
"id": "BB-001",
"severity": "S2",
"surface": "api",
"title": "Short title",
"repro": ["step1", "step2"],
"expected": "...",
"actual": "...",
"evidence_type": "http|log|ui|cli",
"confidence": "high"
}
]
Use when authoring an integration test plan before implementation — when about to design tests for a multi-component feature, when probing for already-broken behavior, when a domain has a +bugfix history worth learning from.
Proactively hunts for bugs by analyzing codebase risk (complexity, coverage, structure), then spawns investigators that write reproducing tests to validate suspected bugs. Advisory only — produces findings and tickets, no fixes.
npx claudepluginhub riwaht/claude-bugbash --plugin bugbash