By Alan-IFT
Harness Engineering toolkit for Claude Code: 7-agent pipeline (PM/Analyst/Architect/Gate/Developer/Reviewer/QA) shipped as plugin-native agents (harness-kit:<name>), Claude-native, fullstack & backend Developer partitioning, verify_all gates. Ships init / adopt / verify / status skills.
Reviews developer's code against requirement and design - not just code style, but completeness and design fidelity. Stage 5 of the Harness pipeline. Independent perspective - finds what the author cannot see.
The only agent that writes production code. Implements the approved design exactly, runs verify_all before declaring done. Stage 4 of the Harness pipeline. Updates dev-map when project structure changes.
Last checkpoint before development starts. Reviews requirement + design holistically for completeness, feasibility, and risk. Stage 3 of the Harness pipeline. Independent verifier - never trusts upstream blindly.
Project manager that routes work between specialist agents through a fixed 7-stage pipeline. Use this when starting any new feature or bug fix - it owns task lifecycle, stage transitions, and rollback decisions. Never makes professional judgments itself.
Validates the implementation against user-observable behavior - not just unit tests, but end-to-end correctness, regressions, edge cases. Stage 6 of the Harness pipeline. Owns the automated test suite long-term.
Adopt Harness Engineering into an existing project. Scans the repo, extracts conventions, proposes a plan, and (with user confirmation) applies it non-destructively. Use this when a project already has code but no Harness setup.
Run a list of tasks (T-01...T-NN) through the 7-stage pipeline in sequence. Reads docs/batches/<batch-id>/BATCH_PLAN.md and dispatches pm-orchestrator per task via the Task tool, so each task gets its own isolated context and the batch orchestrator itself only accumulates per-task summaries. Use when you have multiple tasks (from /harness-plan decomposition, an accumulated backlog, post-checkup integrations, or an external list) and want fire-and-forget execution instead of /harness × N. Stops on strong signals only: verify_all FAIL, pm-orchestrator FAIL verdict, 3 same-stage rollbacks, intervention.md STOP, safety hook block.
Switch or set a harness project's decision / escalation MODE — Mode 1 (human decides every judgment call, the safe default), Mode 2 (the AI decides per the project's PRESET rubric and escalates only the red lines), or Mode 3 (the AI decides per YOUR OWN custom rubric). Surgically rewrites ONLY the "Active mode" line of .harness/rules/25-decision-policy.md; on a first switch to Mode 3 it collects your custom decision prompts and writes them into the rubric's Custom section. Non-destructive (clean-git gated, .bak per edited file), idempotent (re-picking the current mode is a clean no-op). Use to change HOW MUCH the AI decides on its own versus asks you first — "switch decision mode", "let the AI decide on its own", "make it ask me first / 人工决策", "切换决策模式", "让 AI 自己拿主意", "改成人工决策", "用我自己的决策规则". NOT for editing the rubric's CONTENT only (edit .harness/decision-rubric.md directly), NOT for output-language (/harness-language), NOT for layout/version upgrades (/harness-upgrade).
Research / feasibility mode. Light-weight requirement analysis + a free-form findings document. No design, no Gate Review, no code. Use when "can we even do X?" is the question and you need evidence, not a vetted plan. Skips most of the 7-agent pipeline by design.
Open-ended goal mode. Runs Developer + QA in a loop within a stated budget (time or iteration count) until a success criterion is met or the budget is exhausted. Use for "keep refactoring until verify_all passes", "improve test coverage to 80%", "reduce build time below 30s" — tasks without a clean single-shot definition.
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
English · 简体中文
Harness Engineering toolkit for Claude Code — a Claude Code Plugin (15 skills + 8 framework agents + project templates) that brings disciplined AI-driven development to fullstack and backend projects. Claude-native (the framework agents ship as plugin agents — no per-project copy).
Goal: humans only do "describe the requirement" and "step in when AI can't"; everything else — 7-agent pipeline, verify gates, structured documents — runs automatically.
This is a Claude Code Plugin packaging that gives any project fifteen AI skills:
Pipeline skills (six task shapes; the AI picks the right one from your natural-language description)
/harness-kit:harness — full 7-stage pipeline (RA → SA → GR → Dev → CR → QA → Delivery). Use for real feature / bug / refactor work./harness-kit:harness-plan — design-only mode: runs RA + SA + GR, stops with a verdict before any code is written. Use to vet a design./harness-kit:harness-explore — research / feasibility mode: light RA + a findings.md with citations. No design, no code. Use for "can we even do X?"/harness-kit:harness-goal — open-ended Dev + QA loop bounded by a measurable success criterion and a budget. Use for "keep improving until coverage > 80%" type tasks./harness-kit:harness-batch — runs T-01…T-NN sequentially through pm-orchestrator, each task in its own sub-agent context. Stops on strong signals (verify_all FAIL, pm-orchestrator FAIL, intervention STOP, safety hook block). Use for /harness-plan decompositions, accumulated backlogs, post-checkup sweeps, or imported task lists — instead of invoking /harness N times./harness-kit:harness-stream — like batch, but the task pool stays alive: it re-reads BATCH_PLAN.md every iteration, so tasks you add mid-run (in chat or by appending to the pool / an ADD intervention) get planned and executed without re-invoking. Best-effort completion (a failed task is marked and skipped, the stream keeps going) with the same hard-safety stops as batch. A task that needs human input (clarification, a human-reserved decision, or authorization for a safety-critical action) is deferred (a distinct needs-human status — set aside, its dependents blocked) and surfaced together at stream end in STREAM_REPORT.md's ## Needs your input section, so the stream never sits stopped waiting on you. Use when you want to fire requirements as they occur to you and only watch results. Complex multi-part requirements are auto-decomposed at ingest into dependency-staged sub-task rows (simple ones stay one row; rows you author — ADD / hand-written — run as-written). Ambient mode: just invoke with no pool-id — a default pool (docs/batches/default/) is auto-created and a UserPromptSubmit hook (gated by .harness/ambient.flag) makes every chat message a heartbeat that folds requirements into the pool and drains it; no /loop, no re-invocation, no keyword. It is session-scoped — a SessionStart hook auto-clears the flag each new session, so re-invoke /harness-stream to resume.Setup skills
/harness-kit:harness-init — bootstrap Harness skeleton in a new project (asks 6 questions, generates .harness/ + .claude/ + AI-GUIDE.md + stub CLAUDE.md / copilot-instructions.md in ~30s)/harness-kit:harness-adopt — non-invasively add Harness to an existing project (detects stack, extracts conventions, prompts before applying)/harness-kit:harness-upgrade — bring an already-initialized but stale project up to the current plugin layout (relocate scripts to .harness/scripts/, content-refresh depth-sensitive scripts for correct root derivation, re-install the pre-commit hook, rewire settings, regenerate verify_all while preserving your B.* checks — dry-run preview, idempotent, proven with a green verify_all)/harness-kit:harness-language — set, switch (English ↔ Chinese), or refresh a project's output-language policy by surgically rewriting only the three policy surfaces (.harness/rules/00-core.md section + CLAUDE.md line + .github/copilot-instructions.md line) to the target language's current canonical text. Self-bootstraps the text from the plugin templates (so an old project can pull a refreshed policy), non-destructive, idempotent, dry-run preview, .bak per file.npx claudepluginhub alan-ift/harness-kit --plugin harness-kitComprehensive feature development workflow with specialized agents for codebase exploration, architecture design, and quality review
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Harness-native ECC operator layer - 67 agents, 271 skills, 92 legacy command shims, reusable hooks, rules, selective install profiles, and production-ready workflows for Claude Code, Codex, OpenCode, Cursor, and related agent harnesses
Tools to maintain and improve CLAUDE.md files - audit quality, capture session learnings, and keep project memory current.
Access thousands of AI prompts and skills directly in your AI coding assistant. Search prompts, discover skills, save your own, and improve prompts with AI.
Develop, test, build, and deploy Godot 4.x games with Claude Code. Includes GdUnit4 testing, web/desktop exports, CI/CD pipelines, and deployment to Vercel/GitHub Pages/itch.io.