Skill

diagnose

Use when something is broken, tests fail unexpectedly, or behavior doesn't match expectations. Triggers on "this isn't working", "there's a bug", "why is this happening", "debug this", or "diagnose". Do NOT use for new features, performance optimization, or code quality improvements.

Popularity

Stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/workflow:diagnose [symptom or issue description]

User invocable

Model invocable

Inline context

Default effort

Argument hint[symptom or issue description]

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

**Philosophy:** Understand WHAT is happening before deciding what to do. Evidence over assumption. Reproduce before theorizing. The right response might be a 5-line fix or a full redesign — diagnosis tells you which.

SKILL.md

609 lines · ~6k tokens(exceeds 5k compaction limit)

Stats

LanguageShell

Stars2

MaintenanceExcellent

Last CommitApr 2, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Diagnose: Symptom → Root Cause → Resolution

Philosophy: Understand WHAT is happening before deciding what to do. Evidence over assumption. Reproduce before theorizing. The right response might be a 5-line fix or a full redesign — diagnosis tells you which.

Duration targets: Fix-in-place ~15-30 minutes, targeted beads ~30-60 minutes (diagnosis + bead creation), design escalation ~20-30 minutes (diagnosis only, then hand off). Most time should be spent on Phases 1-2 (reproduce and isolate). If you're spending more time on the fix than on understanding the problem, you may be fixing a symptom, not a cause.

Why This Matters

Bugs are the most common interrupt in software development, yet most debugging time is wasted on the wrong hypothesis. Studies show developers spend 35-50% of their time debugging, but the majority of that time goes to understanding the problem, not writing the fix. The fix itself is usually small — the hard part is finding the right place to fix.

AI agents make this worse when they guess instead of investigate. Without a systematic approach, agents attempt shotgun fixes — changing code based on surface symptoms, introducing new bugs, and burning context window on dead ends. A structured diagnostic process prevents this by enforcing evidence collection before any code changes and escalating appropriately when the fix exceeds simple scope.

Trigger Conditions

Run this skill when:

Something is broken or behaving unexpectedly
User reports a bug or error
Tests are failing unexpectedly
User says "this isn't working", "there's a bug", "why is this happening"
Behavior doesn't match expectations

Do NOT use for:

New feature development → /brainstorm
Performance optimization → Different investigation pattern
Code quality improvements → /review

Stage Gates — AskUserQuestion

At every PAUSE point in this skill, call the AskUserQuestion tool to present structured options to the user. Do not present options as plain markdown text — use the tool. The YAML blocks at each PAUSE point show the exact parameters to pass.

For pattern details and examples: ../_shared/references/stage-gates.md

Fallback: Only if AskUserQuestion is not available as a tool (check your tool list), fall back to presenting options as markdown text and waiting for freeform response.

Collaborative Model

Phase 0: Context Gathering
Phase 1: Reproduce & Verify
  ── (if non-reproducible) PAUSE NR: "Can't reproduce. More details?" ──
Phase 2: Investigation & Isolation
  ── PAUSE 1: "Isolated the fault. Here's what I found." ──
Phase 3: Root Cause Analysis
Phase 4: Triage Decision (with self-review)
  ── Self-review gates presentation ──
  ── Triage fork: ──
      ├─ 5a: Fix-in-Place (isolated, simple)
      │    ── PAUSE 2a: "Here's the proposed fix. Apply?" ──
      ├─ 5b: Targeted Beads (localized, multiple files)
      │    ── PAUSE 2b: "Beads created. Run /execute?" ──
      └─ 5c: Design Required (cross-cutting/systemic)
           ── PAUSE 2c: "Needs design. Run /brainstorm?" ──

Critical Sequence

Phase 0: Context Gathering

Step 0.1 — Read the Error First:

Before doing anything else: read the COMPLETE error message, stack trace, and any linked logs. Do not pattern-match from the first line. AI agents have a specific failure mode where they see an error, match it to a common cause, and start fixing without reading the full output. Read everything. Then proceed.

Step 0.2 — Capture the Symptom:

Ask if not provided:

"What's happening?" (actual behavior)
"What did you expect?" (expected behavior)
"When did this start?" (helps narrow commits)
"Does it happen every time?" (reproducibility)

## Symptom Report
**Actual behavior:** {what's happening}
**Expected behavior:** {what should happen}
**First noticed:** {when}
**Reproducibility:** Always / Sometimes / Once
**Error messages:** {if any — quote in full}

Step 0.3 — Quick Context Scan:

Gather recent context that might be relevant:

Recent commits (especially in the affected area)
Uncommitted changes that could contribute
Recent changes in the affected files or directory

Step 0.4 — Check for Known Issues:

Search project learnings for similar symptoms — past gotchas often explain current bugs. If the project uses an issue tracker, search for existing issues matching this symptom.

Phase 1: Reproduce & Verify

This is the most critical phase. Never skip it.

Step 1.1 — Attempt Reproduction:

## Reproduction
**Branch:** {name}
**Commit:** {hash}

### Environment (include what's relevant)
- Runtime: {OS, browser/client, runtime version}
- Auth/session state: {if applicable — logged in as role X / anonymous}
- Data state: {specific records, empty DB, seeded data}
- Config: {relevant env vars, feature flags, build settings}

### Steps
1. {exact step}
2. {exact step}

### Result
- Expected: {what should happen}
- Actual: {what happens}
- Consistent: Yes / No (frequency: X/10)

Step 1.2 — Verify It's Not Already Fixed:

Check whether the issue exists on the main branch. Use a safe approach — create a temporary branch or worktree rather than stashing uncommitted work, which risks losing changes. If the issue doesn't exist on main, the regression was introduced in the current branch and git history analysis can narrow the culprit.

Step 1.3 — Handle Non-Reproducible Issues:

If you cannot reproduce after 3 genuine attempts:

## Non-Reproducible Issue
**Attempts:** {what you tried}
**Possible explanations:**
- Environment-specific
- Timing/race condition
- Data-dependent
- Already fixed in current code

**Recommended:** Ask user for more details, add logging, review for race conditions.

PAUSE (non-reproducible): Present the non-reproducible issue details (attempts made, environment info, possible explanations) as formatted markdown, then use AskUserQuestion:

AskUserQuestion:
  question: "I couldn't reproduce this issue. How should we proceed?"
  header: "Reproduce"
  multiSelect: false
  options:
    - label: "More details"
      description: "I can provide additional reproduction context or environment info."
    - label: "Add logging"
      description: "Instrument the code to capture the issue when it recurs."
    - label: "Park"
      description: "Save findings and revisit when it happens again with more data."

Phase 2: Investigation & Isolation

Evidence collection and isolation are iterative — each piece of evidence narrows scope, which tells you what to investigate next. Don't treat them as separate sequential phases.

Step 2.1 — Launch Parallel Investigation Tracks:

Track	Focus	Method
Code Path	Trace execution flow	Read affected code, follow the flow
History	What changed recently	`git log`, `git blame` on affected files
Tests	What's passing/failing	Run relevant test suite
Dependencies	Related components	Check what the affected code depends on

Step 2.2 — Code Path Analysis:

Trace execution from entry point through to where behavior diverges from expected. Use Explore agent or LSP tools (go to definition, find references) to follow the flow efficiently.

Step 2.3 — Git History Analysis:

Examine blame and recent changes on affected files. Look for suspect commits — recent changes that touch the failing code path. Example investigation commands:

git blame {affected_file}
git log --oneline -10 -- {affected_path}
git show {suspect_commit} --stat

Step 2.4 — Test Analysis:

Run tests for the affected area. Note missing test coverage for the buggy path — this is often a contributing factor.

Step 2.5 — Narrow Scope Iteratively:

Start broad, narrow systematically:

## Isolation Progress
1. Ruled out {X} → Remaining: {Y}
2. Ruled out {X} → Remaining: {Y}

**Isolated to:** {file}:{function}:{line range}

Step 2.6 — Binary Search Debugging:

If the fault location isn't obvious:

Add logging at midpoint of suspected code
Does issue occur before or after this point?
Repeat, halving the search space each time

For regressions, git bisect is highly effective:

git bisect start
git bisect bad HEAD
git bisect good {known_good_commit}
# Test at each step

Step 2.7 — Minimal Reproducing Case:

Can you reproduce with fewer steps, simpler input, or mocked dependencies? The minimal case often reveals the root cause.

Investigation Budget: Time-box Phase 2 to prevent unbounded investigation:

Simple symptoms: ~15 minutes (clear error, small surface area)
Complex symptoms: ~30 minutes (vague behavior, large surface area)
Systemic symptoms: ~45 minutes (multiple components, timing-dependent)

If the budget is exhausted without isolation, this is diagnostic information — the bug's complexity exceeds quick investigation. Present what you know and let the user decide: continue investigating or escalate.

Circuit Breaker: If 3 consecutive investigation steps yield no progress toward narrowing the fault, stop and reconsider. Your hypothesis may be wrong. Step back, review all evidence from scratch, and consider alternative explanations before continuing down the same path.

Step 2.8 — Document Evidence:

## Evidence

### Code Path
- Entry point: {where execution starts}
- Failure point: {where it goes wrong}
- Flow: A → B → C → [FAILURE] → D

### Git History
**Suspect commits:** {commits that might have introduced issue}

### Test Status
- Passing: {count} | Failing: {count}
- Missing coverage: {areas not tested}

### Dependencies
- Upstream: {what this code depends on}
- Downstream: {what depends on this code}

PAUSE 1: Present the evidence (code path, git history, test status, dependencies, isolation progress) as formatted markdown, then use AskUserQuestion:

AskUserQuestion:
  question: "I've isolated the fault to {area}. Does this align with what you're seeing?"
  header: "Isolation"
  multiSelect: false
  options:
    - label: "Accept (Recommended)"
      description: "Isolation looks correct. Proceed to root cause analysis."
    - label: "Redirect"
      description: "The fault is elsewhere — investigate a different area."
    - label: "More investigation"
      description: "Need deeper analysis before concluding."

Phase 3: Root Cause Analysis

Step 3.1 — The Diagnostic 5 Whys:

Different from brainstorm's 5 Whys — this asks "why is this happening" not "why build this":

## Root Cause Analysis (5 Whys)

**Symptom:** {the bug}

1. Why does {symptom} occur?
   → Because {immediate cause}
2. Why does {immediate cause} happen?
   → Because {deeper cause}
3. Why?
   → Because {even deeper}
4. Why?
   → Because {root cause emerging}
5. Why?
   → Because {ROOT CAUSE}

**Root Cause:** {1-2 sentence summary}

Stop when you reach a cause you can act on. Not every issue needs all 5 levels.

Step 3.2 — Challenge Your Hypothesis:

Before concluding, actively look for evidence that CONTRADICTS your root cause hypothesis. Ask: "If this were NOT the root cause, what else could explain the symptoms?" If you find contradictory evidence, revise the hypothesis before proceeding.

Step 3.3 — Classify the Root Cause:

Category	Description	Example
Logic Error	Code does wrong thing	Off-by-one, wrong condition
State Error	Unexpected state	Null, stale data, race condition
Integration Error	Components miscommunicate	Wrong API usage, contract violation
Configuration Error	Settings wrong	Wrong env var, missing config
Data Error	Bad input/data	Corrupt data, edge case input
Design Flaw	Architecture problem	Missing abstraction, wrong pattern

Step 3.4 — Contributing Factors:

Root cause is necessary but often not sufficient. What else contributed?

| Factor | How It Contributed |
|--------|-------------------|
| {missing test} | Would have caught this |
| {unclear docs} | Led to wrong assumption |
| {recent refactor} | Introduced the regression |

Phase 4: Triage Decision

Step 4.1 — Assess Scope & Complexity:

Scope	Description
Isolated	Single function/method, <20 lines affected
Localized	Single file or tightly coupled set of files
Cross-cutting	Multiple components/services affected
Systemic	Architectural flaw, affects many areas

Complexity	Description
Simple	Clear fix, one approach, minimal risk
Moderate	Clear fix but touches multiple places
Complex	Multiple valid approaches, needs design decisions
Uncertain	Root cause unclear or fix approach unknown

Step 4.2 — Apply Triage Matrix:

Scope	Complexity	→ Action
Isolated	Simple	Fix-in-Place
Isolated	Moderate	Fix-in-Place (with care)
Localized	Simple, single file	Fix-in-Place
Localized	Simple, multiple files	Targeted Beads
Localized	Moderate	Targeted Beads
Localized	Complex	Design Required
Cross-cutting	Any	Design Required
Systemic	Any	Design Required
Any	Uncertain	More Investigation or Design Required

Step 4.3 — Self-Review (gates presentation):

Before presenting the triage and resolution to the user, verify quality:

Theme 1: Evidence Quality

Symptom documented with actual vs expected?
Reproduction verified (or non-reproducibility documented)?
Git history and test status checked?

Theme 2: Isolation Quality

Fault location narrowed to specific area?
Ruled out red herrings?

Theme 3: Root Cause Quality

5 Whys completed to genuine root cause (not just correlation)?
Root cause explains ALL observed symptoms?
Hypothesis challenged with counter-evidence search?
Contributing factors identified?

Theme 4: Triage Quality

Triage decision follows the matrix?
Scope and complexity honestly assessed (not inflated or deflated)?

Theme 5: Proportionality

Is the resolution proportional to the issue? A simple off-by-one shouldn't escalate to "Design Required," and a systemic architecture flaw shouldn't be patched with a one-line fix.
If escalating, is there a genuine design decision to make, or could a simpler approach work?

If any theme fails, return to the relevant phase before proceeding.

Phase 5a: Fix-in-Place (Simple Bugs)

When: Isolated scope + Simple/Moderate complexity

Step 5a.1 — Write a Failing Test First:

Before touching the buggy code, write a test that reproduces the bug. Run it — it MUST fail (proving it catches the bug). If you can't write a failing test, reconsider whether you've truly identified the root cause.

Test scope guidance:

Unit test when the bug is in a single method's logic (off-by-one, wrong condition, null handling)
Integration test when the bug is in component interaction (wrong query, missing join, API contract mismatch)
Match existing test patterns — don't introduce a new test framework or style for a bug fix

Step 5a.2 — Propose the Fix:

## Proposed Fix

**Summary:** {1-2 sentences}
**Why this fixes it:** {connect fix to root cause}

### Changes
| File | Change |
|------|--------|
| {file} | {what changes} |

### Risk
- Regression risk: Low / Medium / High
- Side effects: {any potential}

PAUSE 2a: Present the proposed fix details (summary, changes table, risk level) as formatted markdown, then use AskUserQuestion:

AskUserQuestion:
  question: "Apply this fix? A failing test has been written to capture the bug."
  header: "Fix"
  multiSelect: false
  options:
    - label: "Apply fix (Recommended)"
      description: "Apply the code change and verify the test passes."
    - label: "Modify approach"
      description: "I want a different fix strategy."
    - label: "Escalate"
      description: "This is more complex than it looks — create beads or escalate to design."

Step 5a.3 — Apply Fix (with approval):

Make the code change
Run the regression test — it MUST now pass
Run the full relevant test suite
Verify the original symptom is resolved
Stage specific files and commit following the project's commit conventions from CLAUDE.md

Step 5a.4 — Track and Learn:

If the project uses an issue tracker, offer to create a tracked item documenting the bug and fix: "Want me to create a tracked issue for this bug fix?"

Offer learning capture: "This might be worth capturing as a learning. Run /compound?"

Phase 5b: Targeted Beads (Medium Issues)

When: Localized scope + multiple files need coordinated changes

Step 5b.1 — Save Diagnostic Context:

Create the output directory if it doesn't exist: docs/diagnosis/

Save to ${PROJECT_ROOT}/docs/diagnosis/{issue-slug}.md so beads can reference it:

## Diagnostic Context: {Issue Title}

### Root Cause
{Summary from Phase 3}

### Fix Approach
{High-level approach}

### Affected Files
| File | Required Change |
|------|-----------------|
| {file} | {change needed} |

### Verification
- Regression test: {describe test to write}
- Tests to run: {list}
- Manual verification: {steps}

Step 5b.2 — Create Beads:

Create focused beads that reference the diagnostic context document. Each bead should include a pointer to the saved diagnosis.

If the project uses an issue tracker, create a parent issue linking the beads to the diagnosed bug.

PAUSE 2b: Present the diagnostic context and created beads as formatted markdown, then use AskUserQuestion:

AskUserQuestion:
  question: "Beads created for the fix. How should we proceed?"
  header: "Beads"
  multiSelect: false
  options:
    - label: "Execute (Recommended)"
      description: "Run /execute to implement the fix beads."
    - label: "Modify beads"
      description: "Adjust bead scope or approach before executing."
    - label: "Compound first"
      description: "Capture a learning via /compound before executing."

If user selects "Compound first", run /compound with the diagnostic context, then return to PAUSE 2b to proceed with "Execute".

Phase 5c: Design Required (Complex Issues)

When: Cross-cutting/Systemic scope OR Complex/Uncertain complexity

Save diagnostic context to ${PROJECT_ROOT}/docs/diagnosis/{issue-slug}.md:

## Diagnostic Context: {Issue Title}

### Problem Discovered
**Original symptom:** {what user reported}
**Root cause:** {what we found}
**Why design is needed:** {scope/complexity justification}

### Evidence Summary
| Component | How Affected |
|-----------|--------------|
| {name} | {impact} |

### Constraints Discovered
- {constraint from investigation}

### Questions for Design Phase
- {question that emerged}

If the project uses an issue tracker, create a tracked issue linking the diagnosis to the upcoming design work.

PAUSE 2c: Present the diagnostic context and reasoning as formatted markdown, then use AskUserQuestion:

AskUserQuestion:
  question: "This issue needs design work. How should we proceed?"
  header: "Escalate"
  multiSelect: false
  options:
    - label: "Start brainstorm (Recommended)"
      description: "Run /brainstorm with diagnostic context from docs/diagnosis/{issue-slug}.md."
    - label: "Compound first"
      description: "Capture a learning via /compound before escalating."
    - label: "Park"
      description: "Save diagnosis for later."

If user selects "Compound first", run /compound with the diagnostic context, then return to PAUSE 2c to proceed with "Start brainstorm".

If user selects "Start brainstorm", pass the diagnostic context path to /brainstorm: "Starting brainstorm for {issue title}. Diagnostic context saved at docs/diagnosis/{issue-slug}.md."

Anti-Patterns

Fixing Symptoms, Not Causes — Adding a null check to prevent a crash without asking WHY the value is null. The null check masks the real bug, which will surface elsewhere. The 5 Whys exist specifically to push past the immediate symptom to the underlying cause. If your fix doesn't address the root cause from Step 3.1, it's a band-aid.

Shotgun Debugging — Making multiple speculative changes at once ("maybe it's this, and also this, and let me try this"). When the bug goes away, you don't know which change fixed it — and the other changes may introduce new bugs. Change one thing at a time, verify, then proceed. Discipline here prevents cascading uncertainty.

Guessing Without Evidence — "I think the bug is probably in the auth code" based on intuition rather than evidence. Before theorizing, collect data: git blame, logs, reproduction steps. Evidence narrows the search space; guesses expand it. Phase 2 exists to force evidence-first investigation.

Skipping Reproduction — Jumping straight to code inspection without confirming the bug exists and understanding exactly when it triggers. Without reproduction steps, you can't verify the fix works. If you can't reproduce it, say so and investigate why — that's valuable diagnostic information in itself.

Over-Engineering the Fix — A simple off-by-one bug doesn't need a module refactor. Fix the specific issue, verify it, and move on. If the surrounding code needs improvement, that's a separate brainstorm/design effort — don't mix bug fixes with refactoring. The proportionality theme in self-review catches this.

Premature Escalation — Routing to "Design Required" before actually isolating the issue. Many bugs that look complex turn out to be simple once isolated. Complete the investigation phase before deciding on triage. The triage matrix exists to make this decision systematic, not instinctive.

Confirmation Bias — Forming a hypothesis early and only looking for evidence that supports it. The most dangerous bugs are the ones that almost match a familiar pattern but have a different root cause. Step 3.2 exists to force a counter-evidence search before concluding. If you skip it, your confidence in the diagnosis is unearned.

Exit Signals

Signal	Meaning	Next Action
Fix applied	Simple bug resolved	Offer `/compound` for learning
Beads created	Medium fix ready	Proceed to `/execute`
Design required	Complex issue	Proceed to `/brainstorm`
Cannot reproduce	Insufficient info	Ask user for more details
Not a bug	Working as designed	Explain behavior to user
"park"	Save for later	Document findings, deprioritize

Exit message: "Diagnosis complete. {Resolution summary}."

Skill Version: 3.6 v3.6: Reproduction environment checklist (OS, auth state, data state, config). Investigation time budget by symptom complexity. Test scope guidance (unit vs integration). Inspired by gstack's systematic investigation patterns. v3.4: AskUserQuestion stage gates at all PAUSE points (Decision Gate pattern from stage-gates.md) v3.1: Duration targets, prose-based context gathering (no hardcoded setup commands), structured PAUSE response options at all decision points, non-reproducible path as explicit PAUSE, collaborative model shows three-path fork, proportionality theme in self-review, conditional issue tracker for bug tracking, all resolution paths offer /compound, commit format deferred to CLAUDE.md, safe git practices (no stash prescription), anti-patterns explain WHY

diagnose

Popularity

Invocation

Context Preview

SKILL.md

diagnose

Popularity

Invocation

Context Preview

SKILL.md

Diagnose: Symptom → Root Cause → Resolution

Why This Matters

Trigger Conditions

Stage Gates — AskUserQuestion

Collaborative Model

Critical Sequence

Phase 0: Context Gathering

Phase 1: Reproduce & Verify

Phase 2: Investigation & Isolation

Phase 3: Root Cause Analysis

Phase 4: Triage Decision

Phase 5a: Fix-in-Place (Simple Bugs)

Phase 5b: Targeted Beads (Medium Issues)

Phase 5c: Design Required (Complex Issues)

Anti-Patterns

Exit Signals

Similar Skills

Diagnose: Symptom → Root Cause → Resolution

Why This Matters

Trigger Conditions

Stage Gates — AskUserQuestion

Collaborative Model

Critical Sequence

Phase 0: Context Gathering

Phase 1: Reproduce & Verify

Phase 2: Investigation & Isolation

Phase 3: Root Cause Analysis

Phase 4: Triage Decision

Phase 5a: Fix-in-Place (Simple Bugs)

Phase 5b: Targeted Beads (Medium Issues)

Phase 5c: Design Required (Complex Issues)

Anti-Patterns

Exit Signals

Similar Skills