Developer toolkit for the harness project. Currently provides skill-toolkit for creating, auditing, and improving Claude Agent Skills.
A Claude Code plugin that helps you create, audit, and improve Claude Agent Skills.
One omni skill, three modes, one rubric. Ships as a single skill-toolkit skill whose body is a thin router; each mode's real instructions live under skills/skill-toolkit/references/ and are loaded only when that mode runs (progressive disclosure).
| Mode | Purpose | Writes files? |
|---|---|---|
create | Scaffold a new SKILL.md under .claude/skills/<name>/, walk through intent capture, self-check against the rubric | Yes |
audit | Run a 20-rule rubric against an existing SKILL.md and emit a severity-tagged findings report | No (read-only) |
improve | Consume an audit report and apply minimal-diff fixes, then re-audit | Yes |
All three modes share the same validation rubric (references/rubric.md) and the same frontmatter spec (references/frontmatter-schema.md), so create/audit/improve can never disagree about what "valid" means.
Twenty rules (R01–R20) covering:
name, description, argument-hint, disable-model-invocation, user-invocable, model, effort, allowed-tools, shell, context, agent, hooks, paths)See skills/skill-toolkit/references/rubric.md for the full text.
git clone <this-repo> harness
cd harness
claude --plugin-dir .
Inside Claude Code:
/reload-plugins
You should see skill-toolkit listed under the harness-toolkit plugin. Invoke it via:
"help me create a skill for X" or "audit this SKILL.md" — Claude matches your intent against the description./harness-toolkit:skill-toolkit create <name> (or audit/improve)./harness-toolkit:skill-toolkit create my-new-skill
The skill asks up to 5 intent questions, scaffolds .claude/skills/my-new-skill/SKILL.md from assets/skeleton.md, fills the frontmatter, self-checks against the rubric, and reminds you to /reload-plugins.
/harness-toolkit:skill-toolkit audit .claude/skills/my-new-skill/SKILL.md
Emits a markdown report grouped by severity. Audit mode is strictly read-only — it will never edit your files.
/harness-toolkit:skill-toolkit improve .claude/skills/my-new-skill/SKILL.md --report=<paste or path>
Applies minimal Edit diffs keyed to the audit findings, re-audits, reports residuals.
Short version: Anthropic's own skill-creator is a Skill, not a Subagent. When the authors of both primitives picked one for this exact use case, that's the tiebreaker. Long version is in the plan file at /root/.claude/plans/vast-drifting-harp.md — seven ordered reasons covering file mutation, the audit→improve loop, auto-trigger, progressive disclosure, community consensus, and the escape hatch (context: fork preserves subagent benefits inside a Skill if you need isolation).
harness/
├── LICENSE
├── README.md # this file
├── .claude-plugin/
│ └── plugin.json
└── skills/
└── skill-toolkit/
├── SKILL.md # thin router
├── references/
│ ├── rubric.md # R01–R20 (the single source of truth)
│ ├── frontmatter-schema.md # per-field spec (all 13 fields)
│ ├── mode-create.md # create workflow
│ ├── mode-audit.md # audit workflow (read-only)
│ ├── mode-improve.md # improve workflow
│ ├── anti-patterns.md # bad-vs-good examples
│ └── report-template.md # audit report scaffold
└── assets/
├── skeleton.md # blank SKILL.md template
└── example-skill.md # minimal rubric-passing example
skill-creator — reference implementation this toolkit is modeled on.nyosegawa/skill-auditor, affaan-m/everything-claude-code/skill-stocktake, obra/superpowers/writing-skills.See LICENSE.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub jason-hchsieh/marketplace --plugin harness-toolkitInline code review with MCP integration — add comments in the browser via SSH port forward, let Claude Code read and fix them via MCP tools.
FEP/World-Models scaffolding for stable LLM coding agents: predict-before-act, inspectable belief state, and surprise-triggered reflection.
Send push notifications from Claude Code sessions via ntfy
Ultimate Claude Code skill creator. Design, scaffold, build, review, evolve, and publish production-grade AI agent skills following the Agent Skills open standard and 3-layer architecture.
Create and validate production-grade agent skills with 100-point marketplace grading
Professional skill creation with TDD workflow. Features dual-mode (fast/full), behavioral validation, and automated quality gates for 9.0/10+ scores.
Open collection of AI agent skills — reusable, framework-agnostic SKILL.md packages
Self-evolving skill engine for Claude Code. Creates, scores, repairs, and hardens skills autonomously through recursive improvement cycles.
Create and manage Claude Code skills, plugins, subagents, and hooks. Use when building new skills, validating existing skills, testing skills empirically, creating plugins, converting projects to plugins, creating hooks, or managing plugin automation. Includes /skills-toolkit:skill-composer, /skills-toolkit:skill-refiner, /skills-toolkit:skill-tester, /skills-toolkit:plugin-creator, /skills-toolkit:subagent-creator, /skills-toolkit:hook-creator, and /skills-toolkit:ask-user-question skills.