From toolu
Use when a task is broad or multi-step and should be delegated across subagents rather than run inline — exploration, codebase-wide searches, parallel builds, migrations, audits, reviews. Teaches the MAIN thread to delegate well: when to spawn a subagent vs edit inline, which agent for which job, how to parallelize independent work, how to keep main context lean, and which model tier fits each job. Tells: "orchestrate this", "delegate this", "break this down", "coordinate subagents", "this is a big task", or a UserPromptSubmit nudge flagged the prompt as broad/multi-step.
How this skill is triggered — by the user, by Claude, or both
Slash command
/toolu:orchestratorThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
The main thread is the **only** orchestrator — it alone holds the Agent tool. Subagents do the work; they do not spawn their own subagents (Claude Code discourages nested delegation, and it bloats context). This skill makes the main thread good at delegating: it decides *what* to hand off, to *which* agent, on *which* model tier, and *how* to keep its own context lean so a long task does not dr...
The main thread is the only orchestrator — it alone holds the Agent tool. Subagents do the work; they do not spawn their own subagents (Claude Code discourages nested delegation, and it bloats context). This skill makes the main thread good at delegating: it decides what to hand off, to which agent, on which model tier, and how to keep its own context lean so a long task does not drown in tool output.
Trigger phrases: orchestrate this, delegate this, break this down, coordinate subagents, fan this out, this is a big/multi-step task.
| Delegate to a subagent | Do inline (main thread) |
|---|---|
| Broad/fan-out search across many files or naming conventions | A read of a file you already know |
| Reading many files to answer one question | A single-fact grep where you know the symbol |
| Independent work that can run in parallel | A trivial single-file edit |
| Review / audit of a diff or subsystem | Final synthesis + the decision |
| Anything that returns a lot of bytes you only need the conclusion of | Anything where delegation overhead > the work |
Rule of thumb: if the answer means reading across several files and you only need the conclusion, delegate it and keep the conclusion, not the file dumps. Once delegated, don't also run it yourself — wait for the result.
toolu:deep-explore — structural/architecture exploration via ast-grep (runs on Sonnet). First choice for "where/how is X done across the code".Explore — broad read-only fan-out search when you need the conclusion, not file dumps.Plan — design an implementation strategy for a non-trivial change.general-purpose — multi-step research/execution that doesn't fit a specific agent.caveman:cavecrew-investigator / -builder / -reviewer — when the caveman plugin is installed: compressed-output locate / bounded 1–2 file edit / diff review. Output is ~60% smaller, so main context lasts longer.Carry the session mandates into every subagent prompt (comemory recall/save, ast-grep first). Delegation never exempts the work.
Launch independent subagents in one message with multiple tool calls so they run concurrently — not one-at-a-time. Dependent steps wait; independent steps don't. A four-way independent search done serially wastes three-quarters of the wall-clock.
The expensive, recurring cost in a long session is input tokens re-sent every turn (see the token-efficiency report). Two rules follow:
Match the model to the job (the convention deep-explore follows):
Routing the bulk of work to Sonnet keeps the expensive frontier model for the lead thread's hard reasoning — large cost cut at near-full quality.
Subagents multiply token spend (multi-agent runs can cost ~15× a single thread; unmanaged fan-out has produced four- and five-figure single-session bills). Before a large fan-out: cap the number of parallel agents to what the task needs, give each a tight scope, and prefer one well-scoped sweep over a runaway loop. If you bound coverage (top-N, sampling), say so — don't let "covered everything" hide a silent cap.
Creates, edits, and optimizes skills for Claude Code, including drafting, evaluating with test prompts, iterating on performance, and improving skill descriptions for better triggering accuracy.
npx claudepluginhub falconiere/toolu --plugin toolu