From Anti-Hallucination Verify-First
Swarm orchestration with a non-blocking main thread. Use whenever work is heavy, long-running, or parallelizable - builds, tests, deploys, data dumps, research, codebase sweeps, multi-part features, or any operation that would otherwise make the user wait. The main conversation stays a coordinator: it decomposes work, launches background and parallel subagents for anything that can run without conflict, keeps itself free to talk to the user, then collects and synthesizes results. Trigger on multi-step tasks, 2+ independent subtasks, anything expected to take more than ~30s or produce verbose output, or any time the user signals the main thread feels blocked/slow.
How this skill is triggered — by the user, by Claude, or both
Slash command
/anti-hall:orchestrationThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
The main conversation is a **coordinator, not a worker**. Its scarcest resources are
The main conversation is a coordinator, not a worker. Its scarcest resources are the user's attention and its own context window. So: push heavy work out to subagents, run independent work in parallel, prefer the background, and keep the main thread free to think and talk. The user should rarely watch the main thread block on a long operation.
run_in_background: true) — and the main thread continues.Agent calls so they
run concurrently. Don't serialize what can run at once.isolation: "worktree") and integrate the results afterward.Agent calls): independent investigations
or transforms across non-overlapping targets.Match the model to the job (resolve to the newest in each tier at runtime):
../MODEL-POLICY.md).Don't send a planning problem to Haiku, or a log-tail to Opus.
For a big, multi-stream effort, use a tiered hierarchy so no single context holds everything:
L1 Main coordinator (this conversation)
decomposes, partitions, launches, collects, decides, talks to the user.
Does NOT do exploration/implementation/heavy ops directly.
L2 Sub-orchestrators (one per independent stream; optional)
own a worktree or a subsystem; spawn L3 workers; report up.
L3 Workers
do the actual Bash/edits/tests/research; return tight summaries.
For peer coordination among long-lived streams, a team (TeamCreate +
SendMessage) beats one-shot Agent calls. Use plain Agent calls for scope-
isolated, fire-and-collect work.
A subagent only has the context you hand it. Give it exactly enough — no less, no filler. Every brief states:
file:line citations) — NOT a transcript.Precise and concise both matter: a vague brief produces wandering, wasteful work; a bloated brief wastes tokens and buries the goal. Aim for the minimum that makes the task unambiguous.
When the codex CLI / plugin is available, treat it as a second worker pool, not a
fallback. Split independent slices between Claude subagents and Codex so they run
concurrently and you get cross-model diversity (different training, different blind
spots). Good splits:
../MODEL-POLICY.md roster — Opus + Codex).Builds, test runs, deploys, pushes, migrations, dumps, installs — their raw output is garbage for the main context window. NEVER run them inline in the main conversation.
Co-Authored-By trailer
and no "Generated with " line. The work is the owner's; the assistant takes no
authorship credit. The plugin's git-guard PreToolUse hook blocks commits that
include such trailers.git-guard hook blocks
git push --force / -f / --force-with-lease.npx claudepluginhub talas9/anti-hall --plugin anti-hallProvides UI/UX resources: 50+ styles, color palettes, font pairings, guidelines, charts for web/mobile across React, Next.js, Vue, Svelte, Tailwind, React Native, Flutter. Aids planning, building, reviewing interfaces.
Fetches up-to-date documentation from Context7 for libraries and frameworks like React, Next.js, Prisma. Use for setup questions, API references, and code examples.