Search everything...

Slash Command

/SKILL

From agent-harness-kit

Run Mini SWE-bench style harness regression tasks and A/B comparisons to measure harness improvement objectively.

Popularity

Stars

Invocation

How this command is triggered — by the user, by Claude, or both

Slash command

/agent-harness-kit:SKILL

Model invocable

No pre-commands

Configuration

Namespacetemplates/.claude/skills/benchmark-suite/

Tool Access

This command is limited to the following tools:

ReadBash(node .harness/scripts/bench-runner.mjs:*)Bash(node .harness/scripts/bench-compare.mjs:*)

Context Preview

The summary Claude sees in its command listing — used to decide when to auto-load this command

# Benchmark Suite

Use this when evaluating whether a harness change improved or regressed behavior.

## Commands



## Output contract

Command Content

29 lines · ~182 tokens

Stats

LanguageJavaScript

Stars3

MaintenanceExcellent

Last CommitJun 11, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

/SKILL

Popularity

Invocation

Configuration

Tool Access

Context Preview

Command Content

/SKILL

Popularity

Invocation

Configuration

Tool Access

Context Preview

Command Content

Benchmark Suite

Commands

Output contract

Other plugins with /SKILL

Benchmark Suite

Commands

Output contract

Other plugins with /SKILL