By Dfintz
Project-agnostic AI agent harness: workflow stage machine with five architectural gates, convergence/workflow/autoresearch-style experiment loops, local-LLM loop agents (Ollama/LM Studio), committed memory, a read-only MCP server, and a live metrics dashboard.
> Use when: You need to track and triage new AI engineering ideas over time for possible adoption in
Alignment-first interrogation that closes the gap between what a human wants and what the agent builds, BEFORE planning or code. The agent relentlessly interviews the user one question at a time until every branch of the decision tree is resolved, challenges the plan against the project's shared language (CONTEXT.md) and prior decisions (ADRs), and records new terminology and hard decisions inline. Pairs with the harness stage machine (grill → Architect Brief → plan-review → Implement). USE WHEN the user asks to grill me, interrogate the plan, align before building, pin down requirements, resolve ambiguity, build a shared language / glossary, write or update an ADR, or says "I'm not sure exactly what I want yet."
Project-agnostic AI agent harness for driving non-trivial work to done. Provides a workflow stage machine (Understand → Architect → Architect-Challenge → Implement → Review-Breadth → Review-Depth → Feedback) with five architectural gates, plus three loop kinds: convergence (iterate until lint/type/build/test are green), workflow (rubric-graded passes like review-fix / feature-cycle / ci-green), and autoresearch-style experiment loops (hill-climb a numeric metric, keep-if-improved else revert). Loop agents can run on a local model via Ollama or LM Studio. Includes committed memory (lessons + Architecture Briefs), an optional knowledge-graph, a read-only MCP server, and a live metrics dashboard. USE WHEN the user asks to run a harness loop, iterate until checks pass, fix build/type/lint/test failures, drive CI to green, keep reviewing until clean, optimize a numeric metric (lint warnings, bundle size, coverage) with a local LLM, plan a non-trivial change through gated stages, record or recall a hard-won lesson, or set up the harness in a repository.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
A project-agnostic harness that gives any AI coding agent (Claude Code, GitHub Copilot, Codex, Cursor, Gemini, …) a consistent operating contract: what to load, what sequence to follow, and how to iterate until done — plus autonomous, metric-driven self-improvement loops that can run on a local LLM, and a live metrics dashboard.
Extracted as a clean, reusable kit. See CREDITS.md for the prior work it builds on,
and HARNESS_CARD.md for the one-page control/agency/runtime design summary.
New here? Run
node scripts/harness/doctor.mjs(ornpm run harness:doctor). It checks your runtime, shows what's available, runs the self-tests, and prints the exact MCP setup for your editor or agent — Claude Code, Cursor, VS Code, Windsurf, Cline, Zed, JetBrains, or a plain terminal. Per-environment recipes:docs/ENVIRONMENTS.md.
The kit is packaged as an Agent Skill and a Claude Code plugin, so it installs into 70+ agents (Claude Code, Codex, Cursor, GitHub Copilot, Gemini CLI, Windsurf, Cline, …) without copying folders by hand.
# Any of 70+ agents, via the open Agent Skills CLI (-g installs globally for your user):
npx skills add <owner>/harness-kit -g
# A specific agent (or several):
npx skills add <owner>/harness-kit -g -a github-copilot -a claude-code
# Or from a local checkout of this kit:
npx skills add ./harness-kit --list # discover, then add --skill harness to install
# Claude Code, via the native plugin marketplace (auto-updates):
/plugin marketplace add <owner>/harness-kit
/plugin install harness-kit
Two layers, on purpose. The skill above is the playbook — it teaches the agent the harness
contract (stages, gates, loops, memory) and is enough for guidance in any repo. The runnable
engine (the scripts/harness/*.mjs loop runners, dashboard, and MCP server) ships with the kit
files; get it by either installing the Claude Code plugin (bundles everything) or adopting the
kit scaffold per SETUP.md. Replace <owner>/harness-kit with wherever you publish this
kit.
| Capability | Where | Notes |
|---|---|---|
| Workflow stage machine | .github/harness/HARNESS.md, .github/instructions/ | Understand → Architect → Architect Challenge (cross-model) → Implement → Review (breadth+depth) → Feedback, with 5 architectural gates |
| Convergence loops | .github/harness/loops/, run-loop.mjs | Iterate until checks (lint/type/build/test) go green |
| Workflow loops | same | Rubric-graded passes (review-fix, feature-cycle, ci-green) |
| Experiment loops (autoresearch-style) | run-experiment.mjs, experiment-loop.mjs | Hill-climb a numeric metric; keep-if-improved, else revert |
| Local-LLM agents | ollama-agent.mjs, ollama-apply-agent.mjs | Drive loops with a local model via Ollama or LM Studio (--provider) |
| Memory | .github/harness/memory/ | Committed lessons + Architecture Briefs (structure only — no lessons shipped) |
| Knowledge graph | graph-refresh-loop.mjs | Optional structural memory (needs the Understand-Anything plugin) |
| MCP server | mcp-server.mjs | Exposes 15 graph/memory/vector + loop/report tools over MCP (.vscode/mcp.json registers it) |
npx claudepluginhub dfintz/fintz-harness-kit --plugin harness-kitHarness-native ECC operator layer - 67 agents, 271 skills, 92 legacy command shims, reusable hooks, rules, selective install profiles, and production-ready workflows for Claude Code, Codex, OpenCode, Cursor, and related agent harnesses
Reliable automation, in-depth debugging, and performance analysis in Chrome using Chrome DevTools and Puppeteer
Plugin that includes the Figma MCP server and Skills for common workflows
Persistent file-based planning for AI coding agents. Crash-proof markdown plans (task_plan.md, findings.md, progress.md) that survive context loss and /clear, with an opt-in completion gate and multi-agent shared state. Manus-style. Works with Claude Code, Codex CLI, Cursor, Kiro, OpenCode and 60+ agents via the SKILL.md standard. Includes Arabic, German, Spanish, and Chinese (Simplified and Traditional).
v9.44.1 — Patch release for Gemini environment/version detection and qwen auth gating. Run /octo:setup.
AI-powered development tools for code review, research, design, and workflow automation.