Search everything...

Stats

Actions

Available In

harness

Name: harness
Author: xuzhijie-ownself

By xuzhijie-ownself

Domain-blind core harness with 6 agents, GAN-like adversarial evaluation, profile system, sprint contracts, and quantified 0-5 grading

npx claudepluginhub xuzhijie-ownself/harness --plugin harness

Popularity

Stars

Med: 0·Avg: 285

Installs

Med: 0·Avg: 1

What's Inside

Slash Commands6

postmortem

/postmortem

Generate a postmortem report for the current harness run. Reads evaluation artifacts, metrics, events, and feature history to produce .harness/postmortem.md with Timeline, Score Trends, Failure Analysis, Process Compliance, and Recommendations sections.

release

/release

Create a release checkpoint -- version bump, changelog, and git tag.

reset

/reset

Write a structured handoff file and checkpoint the current session. Use when context is filling or work needs to pause. The next /session resumes from the handoff automatically. Implements Variant B (Reset-Based Compatibility).

run

/run

Run the harness in continuous coordinator-driven mode. Advances sprint rounds automatically until all required features pass or a blocker stops the run. Use when you want unattended progress.

session

/session

Run one supervised sprint round for a harness project. Selects the next failing required feature, negotiates a sprint contract with evaluator review, implements it, and evaluates it. Waits for user confirmation between contract review and implementation.

Agents6

coordinator

/coordinator

Advance sprint rounds automatically in continuous mode. Selects the next failing feature, dispatches generator and evaluator, updates state.json, and pauses when pause rules fire. Spawn only when execution mode is continuous.

Evaluator Agent

/evaluator

> Thin wrapper -- edit `plugins/harness/skills/harness/roles/evaluator.md` instead.

Generator Agent

/generator

> Thin wrapper -- edit `plugins/harness/skills/harness/roles/generator.md` instead.

initializer

/initializer

Set up a harness scaffold. Creates .harness/features.json, .harness/progress.md, and .harness/init.md. Spawn once at project start.

planner

/planner

Expand an underspecified project goal into a product spec with a finite required feature set and an explicit execution strategy. Spawn when the user's prompt is too short to define a full app.

Skills1

harness

/harness

Designs and runs Anthropic-style long-running application harnesses for autonomous coding. Use when turning a short prompt into a multi-agent workflow, dispatching initializer/planner/generator/evaluator/coordinator roles, tracking completion through a machine-readable feature list, negotiating sprint contracts before coding, making incremental progress on failing features across sessions, or running until the required feature set passes. Also activate for /start, /session, /run, /reset commands, context reset with handoff files, supervised vs continuous execution modes, or questions about Anthropic harness design patterns and context anxiety.

Hooks1

Event Hooks

Bash

3 hooks across 1 event

Stats

Version2.2.2

LanguageJavaScript

Stars0

Forks1

MaintenanceExcellent

LicenseMIT

Last CommitApr 6, 2026

AddedApr 4, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).

Available In

harness

Safety Signals

Caution

Executes bash commands

Hook triggers when Bash tool is used

Uses power tools

Uses Bash, Write, or Edit tools

Has parse errors

Some configuration could not be fully parsed

README

Harness

Multi-agent sprint orchestration middleware for long-running application development. Two-plugin architecture: a domain-blind core handles all orchestration, and domain skill suites provide profile-specific knowledge.

6 agents: initializer, planner, generator, evaluator, coordinator, releaser. Dual-runtime: Works with both Claude Code and OpenAI Codex CLI. Two plugins: harness (core) + harness-sdlc-suite (software delivery domain skills).

Methodology: PDCA (Plan-Do-Check-Act) + two innovations:

Evaluator-never-edits-code -- tool-access-level role purity
Authenticity gate -- binary overlay catching generic/template AI output

The core harness can be used standalone with a custom profile -- no domain skill suite required. Domain suites add pre-built profiles with evaluation criteria, artifact taxonomies, and verification procedures.

References:

Install

Option 1: Claude Code Marketplace (Recommended)

# Installs both core harness and SDLC suite
claude plugin install harness@harness

Then reload:

/reload-plugins

Option 2: Git Clone + Local Install

# Clone into your project
git clone https://github.com/xuzhijie-ownself/harness.git plugins/harness-repo

# Install (Mac / Linux / Git Bash)
bash plugins/harness-repo/install.sh

# Install (Windows CMD)
plugins\harness-repo\install.bat

The install script copies:

Core skill from plugins/harness/skills/harness/
Domain skills from plugins/harness-sdlc-suite/skills/ (6 skills)
Agents, commands, and hooks from the core plugin

Option 3: Codex CLI

If using OpenAI Codex CLI, the plugin auto-loads from the .codex-plugin/plugin.json manifest:

# Clone the repo into your project
git clone https://github.com/xuzhijie-ownself/harness.git plugins/harness-repo

# Codex detects .codex-plugin/plugin.json automatically
# Skills from both plugins are loaded via dual skill paths
codex  # start codex in the project directory

Uninstall

# Claude Code marketplace
claude plugin uninstall harness

# Local install (removes core + all domain skills)
bash plugins/harness-repo/install.sh --uninstall

Update

# Marketplace
claude plugin update harness

# Local
cd plugins/harness-repo && git pull && bash install.sh

Commands

Command	Purpose
`/harness:start`	Scaffold harness for a new project (run once)
`/harness:session`	Run one supervised sprint round
`/harness:run`	Continuous coordinator-driven loop (unattended)
`/harness:reset`	Checkpoint + handoff when context fills (Variant B)
`/harness:release`	Version bump, changelog, and git tag

Roles

Agent	Spawned by	Reference
initializer	`/harness:start`	`plugins/harness/skills/harness/roles/initializer.md`
planner	`/harness:start`	`plugins/harness/skills/harness/roles/planner.md`
generator	`/harness:session`, coordinator	`plugins/harness/skills/harness/roles/generator.md`
evaluator	`/harness:session`, coordinator	`plugins/harness/skills/harness/roles/evaluator.md`
coordinator	`/harness:run`	`plugins/harness/skills/harness/roles/coordinator.md`
releaser	`/harness:release`, coordinator	`plugins/harness/skills/harness/roles/releaser.md`

The harness follows an adversarial PDCA pattern: the generator produces artifacts (Do) and the evaluator grades them (Check). The generator cannot self-approve; the evaluator cannot edit product artifacts. This separation prevents the common failure mode where a model is too lenient grading its own work. Agent files are thin YAML wrappers -- all instructions live in role files as the single source of truth.

Authenticity Gate

Every evaluation round applies a binary pass/fail Authenticity Gate after domain criteria scoring. This catches technically-competent-but-generic output -- artifacts that score adequately on domain criteria yet show no evidence of project-specific decision-making.

Dimension	What it checks
`internal_consistency`	Artifacts share consistent conventions -- structure, terminology, style form a unified whole
`intentionality`	Evidence of project-specific decisions, not unmodified defaults or template output
`craft`	Technical fundamentals correct -- hierarchy, structure, naming, formatting
`fitness_for_purpose`	Deliverables usable by target audience without additional explanation

The gate is dual-side: generators apply a pre-implementation checklist (prevention), evaluators apply a post-grading gate (detection). Any dimension failure fails the round regardless of domain scores.

View full README on GitHub

harness

Popularity

What's Inside

Confidence

README

Harness

Install

Option 1: Claude Code Marketplace (Recommended)

Option 2: Git Clone + Local Install

Option 3: Codex CLI

Uninstall

Update

Commands

Roles

Authenticity Gate

Similar Plugins

harness-session

scaffolding

claude-code-harness

egregore

harness-claude

orchestrator-supaconductor

More by xuzhijie-ownself

harness-sdlc-suite

Harness

Install

Option 1: Claude Code Marketplace (Recommended)

Option 2: Git Clone + Local Install

Option 3: Codex CLI

Uninstall

Update

Commands

Roles

Authenticity Gate

Popularity

Health & Quality

More by xuzhijie-ownself

harness-sdlc-suite

Similar Plugins

harness-session

scaffolding

claude-code-harness

egregore

harness-claude

orchestrator-supaconductor