Skill

deep-research

Conduct deep, multi-agent research on a topic and document findings in an Obsidian vault with optional goodmem ingestion. Auto-scales from single-agent (narrow topics) to full 3-tier hierarchy with Opus managers and Sonnet data collectors (broad topics). Use when the user asks to "deep research", "do comprehensive research on", "research everything about", "build a knowledge base on", or "create a reference on" a topic. Do NOT use for quick factual questions, single lookups, or casual "what is X" queries.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/deep-research:deep-research

User invocable

Model invocable

Inline context

Default effort

Tool Access

This skill is limited to the following tools:

BashReadWriteGrepGlobAgentTodoWriteTaskCreateTaskUpdateWebSearchWebFetchmcp__plugin_goodmem_goodmem__goodmem_memories_retrievemcp__plugin_goodmem_goodmem__goodmem_memories_getmcp__plugin_goodmem_goodmem__goodmem_memories_create

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

You are conducting a deep research run on the user's behalf. Your job is to decompose the topic, dispatch research agents, and produce a permanent Obsidian vault reference. You run fully autonomously — no checkpoints, no pauses. Report results at the end.

SKILL.md

429 lines · ~5.3k tokens(exceeds 5k compaction limit)

Stats

Parent stars0

MaintenanceGood

Last CommitApr 14, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Deep Research

Step 1: Parse arguments

The user passed an argument string. Parse it into:

Topic — everything before the first -- flag. Required. If empty, ask the user what to research and stop.
--path — optional vault path override (absolute). If omitted, auto-detect in Step 3.
--tier — optional forced tier (1-5). If omitted, auto-detect in Step 3.

Step 2: Reconnaissance

Query existing knowledge BEFORE planning. Run these in parallel:

GoodMem Learnings (if configured) — search for the topic using goodmem_memories_retrieve. The orchestrator's environment determines the space ID and reranker config.
Existing vault — Glob for related folders and files at the user's vault path (typically ~/vault/ or wherever Obsidian stores notes):

Glob: <vault-root>/**/*<topic-keywords>*
Glob: <vault-root>/Libraries/*/00 - Index.md
Glob: <vault-root>/Projects/*/00 - Index.md

Existing vault content check — if files already exist at the target vault path, read the MOC to understand what's covered.

Recon decision

If the topic is already well-covered (existing vault section with 5+ files and recent dates), tell the user:

"Found existing research at ( files, last updated ). Want me to update/extend it, or do a fresh deep research run?"

Wait for the user's answer. If they want an update, adjust your plan to fill gaps rather than re-research everything.

If the topic is NOT well-covered, proceed to Step 3.

Step 3: Planning

3a: Decompose the topic

Break the research topic into N non-overlapping domains. Each domain should be:

Self-contained (can be researched independently)
Non-overlapping (no two domains cover the same sub-topic)
Roughly equal in scope (each produces a file of similar size)

For each domain, produce:

A domain name (becomes the vault file title)
A numbered file name following the NN - Title.md convention
10-30 specific sub-questions to investigate
A list of likely source types (web, context7, vault, GitHub)

3b: Auto-detect tier (unless --tier forced)

Every tier dispatches an Opus manager with mandatory Sonnet collectors. No tier does solo research.

Domains	Tier	Collector floor	Behavior
1	1	2	Single manager
2	2	3	Single manager per domain
3-4	3	4	Single manager per domain
5-7	4	6	Multiple managers, intermediate synthesis files
8+	5	8	Large-scale multi-manager, intermediate synthesis, extensive cross-referencing

3b.1: Compute COLLECTOR BUDGET per domain

The tier floor is a MINIMUM, not the actual budget. Compute per domain:

questions = number of sub-questions in this domain's SCOPE
floor = tier floor from table above (2, 3, 4, 6, or 8)
computed = ceil(questions / 4)     # ~4 questions per collector
budget = max(floor, computed)
budget = min(budget, 10)           # hard cap at 10 per manager

Examples:

Tier 1, 6 questions: max(2, ceil(6/4)) = max(2, 2) = 2
Tier 2, 10 questions: max(3, ceil(10/4)) = max(3, 3) = 3
Tier 3, 16 questions: max(4, ceil(16/4)) = max(4, 4) = 4
Tier 4, 20 questions: max(6, ceil(20/4)) = max(6, 5) = 6
Tier 4, 30 questions: max(6, ceil(30/4)) = max(6, 8) = 8
Tier 5, 40 questions: min(max(8, ceil(40/4)), 10) = min(10, 10) = 10

This ensures narrow domains don't waste collectors while broad domains get proportional coverage. The budget is computed per domain, not globally — different domains may get different budgets.

3c: Auto-detect vault path (unless --path forced)

1. Check existing vault folders:
   Glob <vault-root>/Libraries/*/ and <vault-root>/Projects/*/
   If topic matches an existing folder name (case-insensitive substring)
     -> use that existing folder
2. If no match, classify:
   Known library/framework/tool -> <vault-root>/Libraries/<Name>/
   Everything else -> <vault-root>/Projects/<Sanitized Topic>/
3. If path exists with content -> write alongside, update MOC

Sanitize the topic for use as a folder name: preserve spaces (Obsidian handles them fine), remove special characters except hyphens.

3d: Create scratch directory (Tier 4+ only)

mkdir -p /tmp/deep-research-$(date +%s)

Store this path — managers will write intermediate files here.

3e: Log the plan

Use TaskCreate to log each domain as a task. This gives the user visibility into progress:

TaskCreate: "Research domain: <domain name>" for each domain
TaskCreate: "Finalize MOC cross-cutting sections + cleanup"

3f: Initialize the MOC scaffold

Before processing any domains, write the MOC skeleton at <vault path>/00 - Index.md. This becomes the index file that you update as each domain completes (step 4e) -- users and other agents can read it mid-run to see progress.

---
goodmem_ingest: true
goodmem_scope: cross-project
type: moc
topic: <topic-keyword>
date: <today YYYY-MM-DD>
updated: <today YYYY-MM-DD>
tags: [<topic>, moc]
status: in-progress
---

# <Topic>

<1-2 sentence overview -- write this now from your understanding of the topic.>

## Map of this vault section

| # | Note | What's in it | Lines |
|---|---|---|---|
| 00 | [[00 - Index]] | You are here | this |
| 01 | [[01 - <Domain 1 title>]] | _pending_ | _pending_ |
| 02 | [[02 - <Domain 2 title>]] | _pending_ | _pending_ |
| ... (one row per planned domain) | | _pending_ | _pending_ |

## Key findings

_Populated after all domains complete (step 5)._

## Gaps and open questions

_Populated after all domains complete (step 5)._

## Cross-references

_Populated after all domains complete (step 5)._

## Session provenance

- Date: <today>
- Tier: <1-5>
- Domains: <N>
- Status: in-progress

Rationale: interrupt resilience (a partial MOC is more useful than none) and visible progress in Obsidian as rows flip from _pending_ to filled.

Step 4: For each domain -- collect and synthesize

Process each domain ONE AT A TIME (sequential). For each domain, YOU (the skill orchestrator, Opus main agent) dispatch Sonnet collectors and synthesize their findings incrementally into the vault file yourself. No separate synthesizer subagent -- you ARE the synthesizer.

Why you do both dispatching AND synthesis: Subagents do NOT reliably receive the Agent tool at runtime (confirmed Claude Code platform limitation). You (the main agent running this skill) are the only agent guaranteed to have it. And since you're already Opus, there's no quality loss from synthesizing inline vs dispatching a separate Opus synthesizer.

4a: Plan ALL collector tasks upfront (commit before dispatching)

Compute COLLECTOR BUDGET per 3b.1. Break the domain's SCOPE into EXACTLY that number of non-overlapping tasks -- no more, no fewer. Each collector covers ~3-5 sub-questions.

Collector task type	What it does	Best for
Web research	3-5 WebSearch queries on specific sub-topics	Current state, blog posts, engineering posts
Library docs	context7 resolve + query for specific libraries	API syntax, config options
Vault + memory scan	Read existing vault files + goodmem retrieve	Prior learnings, existing reference docs
GitHub/community	gh CLI searches, issue scans	Open issues, release notes
Academic/specs	WebSearch for arxiv, RFCs, official specs	Foundational concepts

MANDATORY PRE-COMMITMENT: Before dispatching ANY collector, present the full collector plan as a table and commit to executing every row:

Domain: <name> -- COLLECTOR BUDGET: N
| # | Type | Task | Sub-questions covered |
|---|---|---|---|
| 1 | Web research | <specific task> | Q1, Q3, Q7 |
| 2 | Library docs | <specific task> | Q2, Q5 |
| ... (continue for all N collectors) |

Then create TaskCreate entries for each collector -- one per planned task:

TaskCreate: "Collector 1/N: <task>" (domain <domain name>)
TaskCreate: "Collector 2/N: <task>" (domain <domain name>)
... (one per collector, N total)

This pre-commitment is the enforcement mechanism. You will verify all N TaskCreate entries are completed before moving to Step 4d.

4b: Scaffold the output file

Before dispatching any collectors, write the skeleton to the OUTPUT PATH:

---
goodmem_ingest: true
goodmem_scope: cross-project
type: reference
topic: <topic-keyword>
date: <today YYYY-MM-DD>
tags: [<topic>, <sub-topics>]
---

# <DOMAIN title>

<1-2 sentence overview of the domain>

4c: Serial collector loop -- incremental synthesis (STRICTLY one dispatch per turn)

EXECUTION RULE -- overrides Claude Code's default parallel-tool-call bias for this step.

Your system prompt tells you "if you intend to call multiple tools and there are no dependencies between the calls, make all of the independent calls in the same function_calls block." That rule does NOT apply here. In this loop, each collector's briefing is DERIVED from the state of OUTPUT_PATH after the previous collector's synthesis -- so there IS a hard data dependency between dispatches, even though step 4a pre-committed the task list.

Concrete consequences:

Emit exactly ONE Agent tool call per assistant turn during this loop. Never two.
Never put an Agent call in the same function_calls block as another Agent call.
Never dispatch collector i+1 until collector i's findings have been read, integrated into OUTPUT_PATH via Edit, and the integration verified by re-reading the file.
If you find yourself composing two Agent calls together, STOP. Delete one. Run them in separate turns.

If you dispatch all N collectors without synthesis between them, you have violated this rule and the incremental-synthesis purpose of this skill is defeated: collector findings become a 12-20K-word context bomb at the end, later briefings cannot adapt to earlier gaps, and the output file is written in one rushed pass.

You MUST dispatch every collector planned in Step 4a. Stopping early because findings seem "sufficient" is a failure. The whole point of N collectors is proportional coverage -- one collector cannot substitute for the full plan.

The loop -- N iterations, one collector per iteration:

FOR i = 1 to N:

(a) Read OUTPUT_PATH first. Use the Read tool on OUTPUT_PATH before doing anything else in this iteration. For i=1 the file is just the scaffold from 4b -- read it anyway so the pattern is identical across iterations. The current file state IS the input to step (b); without this Read, you cannot compute the next briefing correctly.

(b) Compute collector i's briefing from two inputs:

The task you planned for position i in step 4a's table
The current OUTPUT_PATH content you just read -- specifically: what sub-questions are already answered, what gaps remain, what sources are already cited If earlier collectors already covered a sub-question that was on collector i's slate, pivot collector i to an adjacent gap in the same domain. The 4a plan is the minimum coverage commitment, not a verbatim script -- refinement based on accumulated file state is expected.

(d) Dispatch collector i -- this assistant turn must contain ONE Agent call and no other tool calls:

Agent({
  description: "Collect <task type> for <domain name>",
  subagent_type: "deep-research:data-collector",
  model: "sonnet",
  prompt: "You are a DATA COLLECTOR for the deep-research plugin. Execute ONE narrow data-collection task and return structured raw findings.\n\nTASK: <specific collection job from step (b)>\nSOURCES TO CHECK: <explicit queries, URLs, library IDs, or paths>\nALREADY COVERED (do not re-fetch): <bullets from step (a), or 'nothing yet' if i=1>\nMAX OUTPUT: 2000 words\n\nReturn one H2 section per source with Date, Relevance, and Claims. End with a Collection Summary. Do NOT synthesize across sources. Do NOT write files. Flag any claim from model recall with [recall]."
})

Also announce the dispatch in your assistant text so the user can see the count: Dispatching collector <i>/N for domain "<domain>": <task type> -- <task description>.

(e) When the collector returns, in the NEXT turn, integrate its findings:

Read the collector's findings from the tool return value
Apply confidence grading to each claim: [P] primary, [S] secondary, [P x N] cross-verified, [V] tool-verified, [recall] unverified
Deduplicate against the OUTPUT_PATH content you already hold from step (a)
Edit OUTPUT_PATH to append the graded findings to the appropriate sections (use Edit for targeted inserts; Write only if the file needs a full rewrite)
Do NOT dispatch another collector in this turn. Integration is the only tool work in this turn.

(f) Verify the write. Read OUTPUT_PATH again and confirm the new section landed correctly. Note any fresh gaps this collector surfaced -- they feed step (b) of iteration i+1.

(g) Mark TaskCreate entry i as completed via TaskUpdate. This and the verification Read in step (f) may be in the same turn.

(h) Only NOW may you begin iteration i+1 at step (a). Do not skip ahead.

END FOR

Anti-batching checklist -- if ANY of these is true, you violated the execution rule and must correct course:

Symptom	Fix
Two or more Agent tool calls in one assistant turn	Split -- one per turn
Dispatched collector i+1 without an Edit on OUTPUT_PATH since collector i returned	Stop. Integrate collector i first, then resume
Skipped the Read at step (a)	Read OUTPUT_PATH now; without it, the next briefing is uninformed
Reused step 4a's briefing verbatim without reflecting what's already in the file	Re-read the file, recompute gaps, adjust the briefing
Marked multiple TaskCreate entries completed in one turn	Each completion must be tied to the just-finished collector, not batched
OUTPUT_PATH was only written once (at the end) rather than N+1 times (scaffold + once per collector)	You batched. Restart the loop with integration between dispatches

Absolute prohibitions:

Do NOT stop after collector 1 because "the findings look sufficient" -- run all N
Do NOT merge two planned collectors into one dispatch to save time -- each is separate
Do NOT skip a collector because earlier ones covered "most" of its sub-questions -- gaps from your own judgment are not a substitute for parallel source coverage
Do NOT conclude the domain is complete until all N TaskCreate entries for this domain are marked completed

Pre-finalization gate: Before moving to Step 4d, verify via TaskList that all N collector tasks for this domain are completed. If any are still in_progress or pending, resume the loop at that iteration.

4d: Final pass

FIRST: Verify all N collector TaskCreate entries for this domain are marked completed. Use TaskList to check. If ANY are still pending or in_progress, go back to Step 4c and dispatch the missing ones. Do NOT proceed with final pass until every collector in your plan has returned.

After verification:

Read the full output file
Add ## Gaps and Open Questions section (sub-questions no collector answered)
Add ## References section (consolidate all sources cited)
Quality check: confidence grades on every claim, tables over prose, no AI slop
Write the final version to OUTPUT PATH
If GoodMem is configured, write ONE memory with key findings
Verify the output file: wc -l <output path> -- capture the line count for step 4e

4e: Update the MOC for this domain + force-sync to goodmem

Immediately after step 4d (before starting the next domain), update the MOC and push this domain's vault file to goodmem:

Update the MOC map row for this domain. Use Edit on <vault path>/00 - Index.md to replace the _pending_ row you scaffolded in step 3f. Change:

| NN | [[NN - <Title>]] | _pending_ | _pending_ |

to:

| NN | [[NN - <Title>]] | <one-line summary of what's in the domain file> | <line count from 4d step 7> |

Bump the MOC's updated: frontmatter field to today's date.
If GoodMem ingestion is configured, force-sync this domain's vault file into goodmem so it's queryable immediately rather than waiting for the scheduled ingest job:
```
~/.local/bin/vault-ingest-goodmem.sh
```
The ingester is idempotent (UUID5 from content hash) -- re-ingesting prior domains on every call is cheap and safe. If GoodMem is not configured, skip this step.
Mark the domain's TaskCreate entry as completed.
Begin the next domain at step 4a. If this was the final domain, proceed to step 5.

Why per-domain, not batched at end: interrupt resilience (MOC and goodmem reflect whichever domains completed), less context pressure on the final pass, and visible progress in Obsidian as each row flips from _pending_ to filled.

Step 5: Finalize the MOC (cross-cutting sections)

At this point the MOC's ## Map of this vault section table is already filled (you updated it per-domain in step 4e), every domain vault file exists, and each domain has its own goodmem memory if GoodMem is configured. What remains is the cross-cutting content that required seeing all domains first.

Edit <vault path>/00 - Index.md to replace the three _Populated after all domains complete (step 5)._ placeholders:

Fill ## Key findings -- a table of the 5-10 most important findings across ALL domains, each with evidence citation and a wikilink to the source domain file. Deduplicate findings that appear in multiple domains:
```
| Finding | Evidence | File |
|---|---|---|
| <finding> | <source citation> | [[NN - ...]] |
```
Fill ## Gaps and open questions -- aggregate unanswered sub-questions from each domain's ## Gaps and Open Questions section. Group related gaps:
```
| Gap | Source file | When to fill |
|---|---|---|
| <unanswered question> | [[NN - ...]] | <suggested trigger> |
```
Fill ## Cross-references -- wikilinks to existing vault sections adjacent to this topic. Pull from the Step 2 reconnaissance:
```
- [[../existing vault section 1]] -- <why relevant>
- [[../existing vault section 2]] -- <why relevant>
```
Flip status: frontmatter from in-progress to published. Bump updated: to today.

Fill ## Session provenance with final numbers:

- Date: <today>
- Tier: <1-5>
- Domains: <N>
- Total collectors dispatched: <sum across domains>
- Total output lines: <sum of all domain files + MOC>

Step 6: Cleanup + report

Clean up scratch dir (Tier 4+ only)

rm -rf /tmp/deep-research-<timestamp>

Report to user

Print a final summary:

Research complete.

Topic: <topic>
Vault path: <path>
Files: <N> domain files + MOC index
Total lines: <N>
Tier: <1-5> (<total> collectors)

Key findings:
- <finding 1>
- <finding 2>
- <finding 3>

Do NOT add a trailing summary or explanation beyond this block.

Mark the final "Finalize MOC cross-cutting sections + cleanup" TaskCreate entry as completed.

Error handling

Error	Action
No topic provided	Ask user and stop
goodmem retrieve fails	Continue without prior art (degrade gracefully)
A collector fails to return	Log the failure, skip that slot, note the gap in the domain file and in step 4e's map-row summary
A domain's output file missing after 4d	Log warning, note as gap in the MOC, continue with next domain
Vault path permission error	Tell user and stop
vault-ingest-goodmem.sh fails (step 4e)	Warn user that auto-ingest will pick it up on its next scheduled run; continue to next domain
All domains fail	Tell user the research failed and suggest retrying with --tier 1

deep-research

Invocation

Tool Access

Context Preview

SKILL.md

deep-research

Invocation

Tool Access

Context Preview

SKILL.md

Deep Research

Step 1: Parse arguments

Step 2: Reconnaissance

Recon decision

Step 3: Planning

3a: Decompose the topic

3b: Auto-detect tier (unless --tier forced)

3b.1: Compute COLLECTOR BUDGET per domain

3c: Auto-detect vault path (unless --path forced)

3d: Create scratch directory (Tier 4+ only)

3e: Log the plan

3f: Initialize the MOC scaffold

Step 4: For each domain -- collect and synthesize

4a: Plan ALL collector tasks upfront (commit before dispatching)

4b: Scaffold the output file

4c: Serial collector loop -- incremental synthesis (STRICTLY one dispatch per turn)

4d: Final pass

4e: Update the MOC for this domain + force-sync to goodmem

Step 5: Finalize the MOC (cross-cutting sections)

Step 6: Cleanup + report

Clean up scratch dir (Tier 4+ only)

Report to user

Error handling

Similar Skills

Deep Research

Step 1: Parse arguments

Step 2: Reconnaissance

Recon decision

Step 3: Planning

3a: Decompose the topic

3b: Auto-detect tier (unless --tier forced)

3b.1: Compute COLLECTOR BUDGET per domain

3c: Auto-detect vault path (unless --path forced)

3d: Create scratch directory (Tier 4+ only)

3e: Log the plan

3f: Initialize the MOC scaffold

Step 4: For each domain -- collect and synthesize

4a: Plan ALL collector tasks upfront (commit before dispatching)

4b: Scaffold the output file

4c: Serial collector loop -- incremental synthesis (STRICTLY one dispatch per turn)

4d: Final pass

4e: Update the MOC for this domain + force-sync to goodmem

Step 5: Finalize the MOC (cross-cutting sections)

Step 6: Cleanup + report

Clean up scratch dir (Tier 4+ only)

Report to user

Error handling

Similar Skills