Graduated autonomous research for evaluating tools, technologies, ideas, and approaches. Use when researching any topic that requires consulting multiple sources, comparing options, or building a structured knowledge base. Triggers on "research X", "evaluate X", "compare X vs Y", "what's the best approach for X", "investigate X".
How this skill is triggered — by the user, by Claude, or both
Slash command
/agent-workflow-toolkit:deep-research-toolkitThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Graduated, source-cited research pass that scales depth to the question.
Graduated, source-cited research pass that scales depth to the question.
systematic-debugging)DEPTH = auto # auto | quick | medium | deep
MAX_SOURCES = 50 # cap per depth level: quick=10, medium=25, deep=50
OUTPUT_DIR = docs/research/ # findings go here (or .claude/docs/.../research/ for private)
STATE_FILE = .research-state.json
Override via natural language: "research X at deep depth" or "quick research on Y."
| Level | Time | When | What happens |
|---|---|---|---|
| Quick | 5–10 min | Clear question, likely documented | Web search + skim top results. Stop if confident. |
| Medium | 15–30 min | Multiple options to compare, need structured analysis | Multiple sources, build comparison, check GitHub repos, read docs. |
| Deep | 30–60 min | Novel topic, high-stakes decision, need exhaustive coverage | Parallel subagent research, cross-reference, build knowledge base, verify claims. |
Auto mode (default): start at Quick. If confidence is LOW after Quick, escalate to Medium. If still LOW after Medium, escalate to Deep. Stop as soon as confident.
Before searching anything:
Present the framing to the user. Wait for confirmation unless explicitly told to auto-proceed.
Search external sources. Never answer from training data — it's stale and prone to fabrication. Use whichever of these tools are available in the current session, in roughly this priority order:
| Tier | Tool | When |
|---|---|---|
| Always available | WebSearch (built-in) | Broad coverage, start here |
| Always available | WebFetch (built-in) | Pulling specific URLs surfaced by search |
| Plugin (optional) | firecrawl:firecrawl-search, firecrawl:firecrawl-scrape | Higher-quality scraping when content is JS-rendered |
| Plugin (optional) | mcp__context7__resolve-library-id + query-docs | Authoritative library / framework docs |
| Tool (optional) | gh api | GitHub repo metadata, stars, recent commits, issues |
| Project | Existing docs/research/ or .claude/docs/<feature>/research/ | Avoid redoing past work |
If none of the web-capable tools are available, stop and tell the user — do not invent answers from memory.
Assess confidence:
Build a structured comparison or analysis using one of three templates.
Tool / technology evaluation template:
## [Topic] Research
### Candidates
| Tool | What it is | Stars/Adoption | Last updated | License |
|---|---|---|---|---|
### Evaluation dimensions
| Dimension | Tool A | Tool B | Tool C |
|---|---|---|---|
| Setup complexity | | | |
| Documentation quality | | | |
| Community size | | | |
| Maintenance activity | | | |
| Cost | | | |
| Fit for our use case | | | |
### Key findings
### Recommendation
### Sources
Approach / architecture evaluation template:
## [Topic] Research
### Options considered
- Option A — …
- Option B — …
- Option C — …
### Trade-offs
| Dimension | Option A | Option B | Option C |
|---|---|---|---|
### Decision factors
- What matters most for our context?
- What are the irreversible choices?
- What can we change later?
### Recommendation
Idea / feasibility investigation template:
## [Topic] Research
### Prior art
What already exists? Who's done this before?
### Technical feasibility
- Can this be built with available tools?
- What are the hard parts?
### Open questions
What we still don't know.
### Next steps
What to investigate further.
Spawn parallel subagents (the Agent tool) for independent research tracks:
Agent 1: Research track A — specific question
Agent 2: Research track B — specific question
Agent 3: Research track C — specific question
Each subagent:
When using parallel agents, set model: "opus" and run_in_background: true per project convention. After all subagents return, synthesize:
Write findings to OUTPUT_DIR/<topic-slug>.md with:
For medium / deep research that spans multiple interactions, write .research-state.json next to the findings file:
{
"topic": "...",
"depth": "medium",
"phase": 3,
"started": "2026-05-15T22:00:00Z",
"sources_consulted": 15,
"findings_file": "docs/research/topic-slug.md",
"status": "in_progress",
"next_action": "evaluate remaining 3 candidates"
}
On resumption: read state file + findings file to restore context. Continue from saved phase.
| Anti-pattern | Why it fails | Correct behavior |
|---|---|---|
| Answering from memory | Training data is stale; you may hallucinate details | Always search, even for topics you "know" |
| Single-source trust | One blog post is not truth | Cross-reference 2+ independent sources |
| Scope creep | Researching everything instead of answering the question | Stay focused on the success criteria from Phase 1 |
| Analysis paralysis | Collecting more data instead of synthesizing | Set a source cap per depth level. Stop and synthesize. |
| Recency bias | Newest is not best | Consider maturity + stability, not just publish date |
| Ignoring negatives | Only finding evidence FOR an option | Actively search for criticisms and failure cases |
docs/research/<topic-slug>.md.claude/docs/<feature>/research/<topic-slug>.mdPick based on whether the artifact is meant to be shared with the team via the repo, or kept as personal working notes.
Guides creation, editing, and verification of skills for AI coding agents using test-driven development with subagent scenarios. Use when authoring or debugging skills.
npx claudepluginhub slowcraft/agentic-playbook_features-workflow-toolkit --plugin agent-workflow-toolkit