Skill

harness-setup

Build, extend, and maintain a project's agentic harness — the agents, skills, and orchestrator under .claude/. This skill writes files. Use it to set up, scaffold, extend, rebuild, or sync a harness, to add or change an agent or skill, or to apply a review context from harness-review; on request it also discovers and registers fitting MCP/plugin tools. When a project runs both a repo-native tracker and a human tracker (Jira, Linear, GitHub Issues), it also generates the dual-tracker sync — use it for 'keep the trackers in sync', 'sync issues to Jira/Linear', or 'tracker sync setup'. For read-only assessment of an existing harness, use harness-review — this skill is the writer, that one is the reader. Not for authoring a single standalone skill or plugin (use plugin-dev or skill-creator), or one-shot automation recommendations (use claude-code-setup). Not for choosing a project's spec-driven development system or issue tracker — use spec-advisor or tracker-advisor.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/agentic-harness:harness-setup

User invocable

Model invocable

Inline context

Default effort

Configuration

Modelinherit

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

This skill builds and maintains a project-local **agentic harness**: a team of agent

Supporting Files

references/agent-design-patterns.mdreferences/maintenance.mdreferences/orchestrator-template.mdreferences/skill-writing-guide.mdreferences/team-examples.mdreferences/tool-discovery.mdreferences/tracker-sync-template.md

SKILL.md

397 lines · ~6.8k tokens(exceeds 5k compaction limit)

Stats

LanguageShell

Stars0

MaintenanceExcellent

Last CommitJun 13, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Harness setup — build and maintain the agent team

This skill builds and maintains a project-local agentic harness: a team of agent definitions, the skills those agents use, an orchestrator that coordinates them, and a pointer in the project's CLAUDE.md. It writes files. It does not do the project's domain work — it builds the agents and skills that do.

Read the model first: ${CLAUDE_PLUGIN_ROOT}/shared/harness-model.md defines the three parts (agent = who, skill = how, orchestrator = when/order) and why they stay separate.

This skill vs harness-review

harness-setup is the writer — every change to the harness goes through it. harness-review is the reader — it assesses an existing harness and writes nothing. When a request is "assess / review / audit / how well is it used," that is harness-review. When a request creates or changes anything, it is this skill. After harness-review hands off a review context, this skill is what acts on it.

Step 0: Orient before writing

Always check the current state first, then pick the path. Do not start generating until the plan is confirmed.

Take the user's starting context, if any. This skill accepts an optional context at the start — domain notes, constraints, tools already in use, or a review context from harness-review. Read it and fold it into everything below.
Read .claude/agents/, .claude/skills/, and the harness section of CLAUDE.md.
Branch on what you find:
- New build — no harness, or empty agent/skill dirs → run Steps 1–7 in full.
- Extend — a harness exists and the request adds or changes an agent or skill → run only the needed steps, per the extension matrix in references/maintenance.md.
- Apply a review context — harness-review produced a prioritized list → work it as an interactive improvement pass (see references/maintenance.md).
- Sync — the files and the CLAUDE.md record disagree → reconcile and record the correction.
Present the plan and confirm it before generating. Tool research and tool maintenance are part of a good harness, so make them part of the plan you present every run — don't wait to be asked:
- Always ask whether to run tool discovery (Step 1b), on a new build or an extension.
- On an existing harness (extend / apply-review-context / sync), also ask whether to run a tool-maintenance review of the registered tools.md (see references/maintenance.md). Record the answers. Asking is the default; a "no" is a fine answer, but a silent skip is not. Running happens only on a yes — see Step 1b. This confirms the approach; the concrete list of files and tools is approved separately at Step 2b, before anything is written.
Account for the project's process layers. A harness is the who/how/when of the work; a project may also follow process layers — a spec process (what to build) and an issue tracker (what work is ready and in what state). They are complementary to the harness, so check each advisor area — scan with ${CLAUDE_PLUGIN_ROOT}/shared/detection-signatures.md:

Process area Advisor skill Coordination map
Spec process (SDD) spec-advisor ${CLAUDE_PLUGIN_ROOT}/shared/sdd-coordination.md
Issue tracking tracker-advisor ${CLAUDE_PLUGIN_ROOT}/shared/tracker-coordination.md

The same gate applies to every area:
- No system, project looks like software → offer to run the area's advisor skill, which advises what fits and delegates setup to the chosen system's own installer. Offering is the default; running is gated on a yes, and nothing is installed without the advisor's own per-system approval. If the user installs one, fold its coordination into the plan below. If the advisor reports an installer failure, treat the area as having no system — record no coordination context; the advisor leaves the retry path with the user.
- A system is present (detected, or just installed) → do not re-recommend and do not install. Identify the system and version, look up its row in the area's coordination map, and record a coordination context — for SDD {system, version, owned-segment, activation, auto-invokable, hand-back contract, write-back rule}; for a tracker {tracker, version, entry point, ready-work query, create convention, status write-back, auto-invokable} — to carry into Step 2 and Step 5. This is the lightweight read harness-setup needs to bake the coordination into the orchestrator; the advisor skills still own recommending and installing. The protocol behind every map is ${CLAUDE_PLUGIN_ROOT}/shared/coordination-protocol.md.
- Both an agentic tracker and a human tracker present or declared (a repo-native tracker like Beads plus Jira, Linear, or GitHub Issues — the human-tracker usage signals are in ${CLAUDE_PLUGIN_ROOT}/shared/detection-signatures.md) → also offer the dual-tracker sync sub-step: generate the project's tracker-sync skill, agent, and sync config per ${CLAUDE_PLUGIN_ROOT}/shared/tracker-sync-protocol.md, using references/tracker-sync-template.md. Offer it only on this dual-tracker condition — never when one tracker or none is present. Before generating any sync artifact, run the sync preflight: validate the recorded tracker coordination context by actually calling both entry points — the ready-work query on the agentic side, human-tracker role resolution plus a cheap ping on the SaaS side. A failed validation stops the sub-step and corrects the context first; never generate against a stale context.
Record the answer for each area either way. A future advisor adds a row to the table; the gate itself does not change.

Process area	Advisor skill	Coordination map
Spec process (SDD)	`spec-advisor`	`${CLAUDE_PLUGIN_ROOT}/shared/sdd-coordination.md`
Issue tracking	`tracker-advisor`	`${CLAUDE_PLUGIN_ROOT}/shared/tracker-coordination.md`

Step 1: Analyze the domain

Identify the domain and the core task types (creation, validation, editing, analysis).
Explore the codebase — tech stack, data models, key modules — so agents and skills fit the project rather than a generic template.
Check for conflicts or overlap with any existing agents and skills from Step 0.
Read the user's technical level from the conversation and match your wording to it. Explain a term like "assertion" or "schema" when the cues suggest it is unfamiliar.

Step 1b: Discover tools — always offered, run on acceptance

Step 0 always offers tool research as part of the plan; run this step when the user accepts. It can also be triggered on its own later, against an existing harness. Offering is the default; running is gated on that yes — and adopting any individual candidate is gated on a separate, explicit per-tool yes (Step 3). Those two gates are the safeguard: the user is always asked, and nothing is installed behind their back.

This skill proposes nothing of its own — the candidates come from a live search, not a built-in catalog:

Hand a search-optimized subagent (general-purpose, with web search) a tight context — the project's domain, stack, and task types from Step 1 — and have it find candidate MCP servers and plugins that fit. Also inspect the local and session configuration for tools already available, so you don't propose what is already there.
Present each candidate with the role it would fill, what it does, and its trade-off. The user accepts or rejects each one explicitly. Adopt only what is accepted.
Accepted tools become install/register rows in the Step 2b manifest; once that is approved, register them by role in the tools registry under the orchestrator — a tools.md file in .claude/skills/{domain}-orchestrator/references/. It is a lookup of role → preferred tool → alternative (for when the preferred one is unavailable) → status. Agents and skills reference a tool by its role, never by a hard tool name, so the harness falls back to the alternative when a tool is missing.

The subagent's context template, the acceptance flow, and the registry schema are in references/tool-discovery.md. Registered tools are reviewed periodically — see references/maintenance.md.

Step 2: Choose the execution mode and the architecture pattern

Execution mode. Default to an agent team when two or more agents genuinely need to exchange information mid-task; fall back to subagents when they do not, or when the experimental team tools are unavailable. The team-tools caveat and the mechanical fallback mapping are in ${CLAUDE_PLUGIN_ROOT}/shared/execution-modes.md — read it and decide the mode before designing the team, because the mode shapes the agent definitions and the orchestrator.

Architecture pattern. Decompose the work into areas of expertise and pick a structure. The six patterns — Pipeline, Fan-out/Fan-in, Expert Pool, Producer-Reviewer, Supervisor, Hierarchical Delegation — with their fit and their team-mode suitability are in references/agent-design-patterns.md. Composite patterns are common; the same reference covers them.

Split agents along four axes — expertise, parallelism, context, reusability. The criteria table is in references/agent-design-patterns.md. Prefer a few focused agents over many thin ones; coordination cost grows with team size.

Coordinate with the process layers. If Step 0 recorded a coordination context, decide with the user how the orchestrator and each present system compose — they must work together without overlap and with minimal friction, not run in parallel.

Spec process (SDD). Using ${CLAUDE_PLUGIN_ROOT}/shared/sdd-coordination.md, settle: which phases the orchestrator delegates to the SDD (the spec/plan/decompose segment it owns) versus owns (typically execution, integration, and a cross-boundary QA pass); and whether activation is auto-invokable (the orchestrator calls the SDD's CLI/MCP entry point with a contextual prompt) or human-gated (it emits the prompt and pauses for the user, as with an IDE or an approval step).
Issue tracker. Using ${CLAUDE_PLUGIN_ROOT}/shared/tracker-coordination.md, settle: which phases touch the tracker (phase 0 pulls ready work or creates the issue; integrate writes status back); whether access is auto-invokable (CLI / configured MCP) or human-gated (a SaaS UI with no configured access); and the one-status-owner rule when Taskmaster or an SDD task artifact co-exists with the tracker — each item's work state has exactly one owning system.

Fold the decisions into the Step 2b manifest so they are approved before any write.

Step 2b: Approve the change manifest — required before any write

Before creating, updating, or deleting anything — and before installing or uninstalling any tool — present a single explicit change manifest and get the user's formal approval. This is mandatory on every path: new build, extend, apply-review-context, sync. Nothing is written to .claude/ or CLAUDE.md, and no tool is installed or removed, until the user approves it.

This is not the Step 0 plan confirmation. Step 0 agrees the approach before the design exists; the manifest is the concrete, itemized list of exactly what this run will touch, produced once the design is settled. Writes and installs change the user's repository and environment and are awkward to undo — one explicit sign-off on the exact list is what keeps the run from making a change the user did not expect.

Present it as concrete items, each labelled with its action and target:

Action	Target
create / update / remove	`.claude/agents/{name}.md` (one row per agent)
create / update / remove	`.claude/skills/{name}/` (one row per skill)
create / update	`.claude/skills/{domain}-orchestrator/`
update	`CLAUDE.md` (harness pointer + change-history row)
install / uninstall	`{role} -> {tool}` (only if tool discovery or maintenance proposed it)
register / unregister schedule	`{venue} — {cadence} — {mode}` (one row per scheduled run; environment-level — approving it changes the user's machine, not just the repo)

List only the rows that apply. If the user amends the list — drops an agent, declines a tool, renames a skill — revise and present it again; the approval is of the final list. Once approved, carry out exactly what was approved in Steps 3–6 — no extra files, no extra installs.

Step 3: Generate the agent definitions

Write every agent as a file under .claude/agents/{name}.md — including agents that use a built-in type (general-purpose, Explore, Plan). Put the built-in type in the spawn call; put the role, principles, and protocol in the file. The reason is in the harness model: a role defined only inline is not reusable next session and carries no collaboration contract.

Each agent file states: core role, working principles, input/output protocol, error handling, and collaboration. In team mode, add a team communication protocol section — who it messages, who messages it, and what it claims from the shared task list. The definition template and worked agent files are in references/agent-design-patterns.md and references/team-examples.md.

Model. Default each agent to model: inherit so it follows the session's model. A harness's quality tracks its agents' reasoning, so for a role that depends on judgment rather than throughput, pin the strongest reasoning model explicitly — by its current dated id (e.g. claude-opus-4-8), not a bare opus alias that ages.

If the team includes a QA agent. Use the general-purpose type (Explore is read-only and cannot run validation). Make its core method cross-boundary comparison — read both sides of a contract together (the producer and the consumer), not each in isolation — and run it incrementally as each module lands, not once at the end. The full methodology, boundary-bug patterns, and a QA agent template are in the qa-agent-guide reference under the harness-review skill.

Step 4: Create the skills

Create each skill the agents use at .claude/skills/{name}/SKILL.md. The authoring guide — description writing, body principles, progressive disclosure, data-schema standards — is in references/skill-writing-guide.md. The essentials:

Description. It is the only trigger mechanism. Write it to be specific about what the skill does and when it should fire, slightly pushy to offset conservative triggering, and worded to stay clear of skills that should not fire on the same request.
Body. Explain the why rather than issuing bare "ALWAYS/NEVER" rules — an agent that understands the reason handles edge cases correctly. Keep it lean (aim under 500 lines; move detail to references/). Generalize to the principle instead of overfitting to one example. Write imperatively.
Progressive disclosure. Metadata is always in context; the body loads on trigger; references/ load only when needed. Split large or domain-specific detail into references/ so only the relevant file loads.
Linking. One agent uses one or more skills; a skill may be shared across agents. The skill holds how; the agent holds who.

Step 5: Build the orchestrator and register the pointer

The orchestrator is a skill whose subject is the team: which agents take part, what each produces, how outputs flow, and how failures are handled. Templates for team, subagent, and hybrid modes — with data-passing, error handling, and test scenarios — are in references/orchestrator-template.md.

Build into the orchestrator:

An intake-and-triage first phase, because the orchestrator is the repo's entry point and runs for every prompt: it triages first — a trivial or out-of-domain request is answered directly and stops there; in-domain work then goes through the context check (initial run vs. follow-up vs. partial re-run, branching on whether _agents_workspace/ already exists). This triage is what makes the CLAUDE.md hard gate practical — it routes every prompt without spinning up a team for trivia.
An entry-point description that opens by stating the orchestrator is the entry point for the domain (invoke before responding to any domain request), then carries follow-up trigger keywords ("re-run", "update", "modify", "supplement", "improve the previous result", and everyday domain phrasings). The description backs the CLAUDE.md directive; without the follow-up keywords the harness goes unused after its first run.
Data-passing stated explicitly, matched to the mode — see ${CLAUDE_PLUGIN_ROOT}/shared/execution-modes.md.
Error handling that does not assume success: retry once, then proceed without the missing result and note the omission; never delete conflicting data — record it with its source.
The tools registry, when tool discovery (Step 1b) has run: it lives in this orchestrator's references/ directory as tools.md, and agents and skills reference tools by role from it.
SDD coordination, when Step 0 recorded an SDD coordination context: splice the addenda from references/orchestrator-template.md (SDD coordination section) into the orchestrator's phase 0, prepare, and integrate phases, inlining the system's concrete artifact paths and entry point — the orchestrator cannot read the shared file at runtime. Mark each phase as delegated (→ SDD: {system}) or orchestrator-owned. The model is in ${CLAUDE_PLUGIN_ROOT}/shared/sdd-coordination.md.
Tracker coordination, when Step 0 recorded a tracker coordination context: splice the addenda from references/orchestrator-template.md (Tracker coordination section) into the orchestrator's phase 0 and integrate phases, inlining the tracker's concrete commands (the ready-work query, the create command, the status write-back). The model is in ${CLAUDE_PLUGIN_ROOT}/shared/tracker-coordination.md.
Tracker-sync write-through, when the dual-tracker sync sub-step generated artifacts: splice Addendum T4 (Tracker coordination section of references/orchestrator-template.md) immediately after T3, so the integrate phase invokes the generated tracker-sync skill in scoped mode for the items it just wrote back. The sync model is in ${CLAUDE_PLUGIN_ROOT}/shared/tracker-sync-protocol.md.

When extending rather than building new, modify the existing orchestrator — do not create a second one. Reflect a new agent in the team composition, task assignment, data flow, and trigger keywords.

Verify generation before declaring it complete. After writing the generated artifacts — the orchestrator and, when the sync sub-step ran, the sync skill, agent, and sync config — grep every written file for unsubstituted {PLACEHOLDER} tokens and for ${CLAUDE_PLUGIN_ROOT} references. Any hit fails the run: fix the file and re-verify before moving to Step 6. Generated files must be self-contained — a leaked placeholder or plugin path surfaces later inside the target project, at worst in a scheduled headless run that fails with nobody watching.

Then register the pointer in the project's CLAUDE.md: goal, the entry-point directive (the hard gate that makes the orchestrator the single entry point — every prompt routes through it before any response), and the change-history table — and nothing the file system already holds. The convention and the template are in ${CLAUDE_PLUGIN_ROOT}/shared/claude-md-pointer.md.

Step 6: Record the change in history

Every write to the harness appends a row to the change-history table in CLAUDE.md (Date | Change | Target | Reason). This is a required step, not optional — it is how evolution stays visible and regressions stay catchable. The table format is in ${CLAUDE_PLUGIN_ROOT}/shared/claude-md-pointer.md.

Step 7: Capture feedback

A harness is a system that keeps changing, not a one-time artifact. After a run, offer the user the chance to feed back ("anything to improve in the result, the team, or the workflow?"). If there is none, move on — do not force it. When there is, route it to the right place: output quality → the relevant skill; missing role → a new agent definition; wrong order → the orchestrator; missing trigger → a description. The routing table and the evolution triggers (recurring feedback, repeated failures, the user bypassing the orchestrator) are in references/maintenance.md.

Some feedback is not about this harness but about the tool that built it — a confusing step, a missing capability, a bug in the plugin itself. That belongs upstream, not in the project's files. When the feedback is of that kind, offer to run harness-feedback: it runs a short kaizen retrospective and, with the user's explicit consent, files a privacy-safe GitHub issue on the agentic-harness repo. Offering is the default; a "no" is fine, and nothing is filed without approval.

Deliverable checklist

Before calling a setup or change complete:

References

references/agent-design-patterns.md — execution-mode comparison, the six architecture patterns, agent-split criteria, the agent-definition template.
references/team-examples.md — worked agent teams across generic domains, with full sample agent files.
references/orchestrator-template.md — orchestrator templates by mode, with data-passing, error handling, and test scenarios.
references/skill-writing-guide.md — skill authoring: descriptions, body style, progressive disclosure, data-schema standards.
references/maintenance.md — extending an existing harness (the extension matrix), applying a review context, syncing drift, feedback routing, and periodic tool review.
references/tool-discovery.md — the optional, on-request tool-discovery step: the search subagent's context, the explicit-acceptance flow, and the tools.md registry schema.
references/tracker-sync-template.md — what the dual-tracker sync sub-step generates: the tracker-sync skill, agent, and sync-config templates, the elicitation guidance, and the schedule-registration block per venue.
${CLAUDE_PLUGIN_ROOT}/shared/harness-model.md, ${CLAUDE_PLUGIN_ROOT}/shared/execution-modes.md, ${CLAUDE_PLUGIN_ROOT}/shared/claude-md-pointer.md — shared concepts.
${CLAUDE_PLUGIN_ROOT}/shared/detection-signatures.md — how to recognise an installed SDD system or issue tracker (shared with spec-advisor and tracker-advisor); ${CLAUDE_PLUGIN_ROOT}/shared/coordination-protocol.md — the generic coordination protocol; ${CLAUDE_PLUGIN_ROOT}/shared/sdd-coordination.md and ${CLAUDE_PLUGIN_ROOT}/shared/tracker-coordination.md — the per-area instances with their per-system coordination maps; ${CLAUDE_PLUGIN_ROOT}/shared/tracker-sync-protocol.md — the dual-tracker sync model (lanes, state store, fingerprints, concurrency, failure rules, per-SaaS map).

harness-setup

Invocation

Configuration

Context Preview

Supporting Files

SKILL.md

harness-setup

Invocation

Configuration

Context Preview

Supporting Files

SKILL.md

Harness setup — build and maintain the agent team

This skill vs harness-review

Step 0: Orient before writing

Step 1: Analyze the domain

Step 1b: Discover tools — always offered, run on acceptance

Step 2: Choose the execution mode and the architecture pattern

Step 2b: Approve the change manifest — required before any write

Step 3: Generate the agent definitions

Step 4: Create the skills

Step 5: Build the orchestrator and register the pointer

Step 6: Record the change in history

Step 7: Capture feedback

Deliverable checklist

References

Similar Skills

Harness setup — build and maintain the agent team

This skill vs harness-review

Step 0: Orient before writing

Step 1: Analyze the domain

Step 1b: Discover tools — always offered, run on acceptance

Step 2: Choose the execution mode and the architecture pattern

Step 2b: Approve the change manifest — required before any write

Step 3: Generate the agent definitions

Step 4: Create the skills

Step 5: Build the orchestrator and register the pointer

Step 6: Record the change in history

Step 7: Capture feedback

Deliverable checklist

References

Similar Skills