From catalyst
Use when you need to score or grade an artifact against a rubric with a fresh-context evaluator that never saw how the artifact was made — the anti-self-grade guarantee (generator ≠ evaluator). Ships 6 bundled domains (code-quality, ui-design, prose, security, performance, accessibility), each dispatched through a shared brief builder that enforces the separation. Composes with handoff PIPELINE mode's evaluator stages. Trigger phrases: "dispatch an evaluator", "fresh-context evaluator", "anti-self-grade", "pick a rubric", "score this artifact", "rate the quality", "/evaluator-library", plus domain forms like "code-quality review" or "prose review". Use this skill liberally whenever a generated artifact needs an independent quality score, whenever a pipeline stage is an evaluator, or whenever you are about to grade your own output — under-triggering here lets biased self-grades slip through.
How this skill is triggered — by the user, by Claude, or both
Slash command
/catalyst:evaluator-libraryThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Reusable, dispatch-by-name evaluators for common subjective domains. Solves the problem that every PIPELINE-mode user was hand-rolling rubrics, leading to inconsistent quality and frequent anti-self-grade violations.
Reusable, dispatch-by-name evaluators for common subjective domains. Solves the problem that every PIPELINE-mode user was hand-rolling rubrics, leading to inconsistent quality and frequent anti-self-grade violations.
The library does not run the evaluator itself — it builds a brief and the caller (you, or handoff PIPELINE mode) dispatches a fresh Agent with that brief. This preserves the architectural invariant that the evaluator subagent never sees the generator's transcript.
| Domain | Axes |
|---|---|
code-quality | correctness, readability, maintainability, test_coverage |
ui-design | coherence, originality, craft, functionality |
prose | clarity, accuracy, brevity, hook |
security | input_validation, authn_authz, secrets_handling, owasp_coverage |
performance | algorithmic, allocation, io, blocking_calls |
accessibility | semantic_html, aria, keyboard_nav, contrast |
All rubrics score 1-5 per axis with anchor descriptions. Pass threshold defaults to ≥4 on all axes; configurable via .claude/evaluator-library.json.
User / PIPELINE → /evaluator-library run <domain> <artifact>
↓
Claude interprets commands/evaluator-library.md
↓
Claude runs scripts/dispatch-evaluator.sh → brief (stdout)
↓
Claude dispatches Agent subagent with brief (fresh context)
↓
Evaluator writes .claude/eval-reports/<domain>-<ts>.md
↓
User / orchestrator reads report → decides next step
The brief enforces the anti-self-grade rule by listing it in the Forbidden section. The brief does NOT include any transcript path, session ID, or prior conversation content.
.claude/evaluator-library.json:
{
"pass_threshold": 4
}
The only config field honored by the dispatcher today is pass_threshold (default 4). User-supplied rubric overrides live at .claude/evaluator-library/evaluators/<domain>.md — the dispatcher checks the user path first, falls back to plugin-bundled rubric.
| Command | What it does |
|---|---|
/evaluator-library list | Show all available domains (bundled + user overrides) |
/evaluator-library run <domain> <artifact> | Dispatch evaluator with fresh context |
/evaluator-library run <domain> <artifact> --contract <path> | Include a sprint contract in the brief (PIPELINE mode) |
/evaluator-library show-rubric <domain> | Print the rubric body — for inspection or copying into a custom rubric |
evaluators/<name>.md, then add a SKILL entry.handoff PIPELINE mode invokes evaluator-library at Synthesize precondition stages. Anti-self-grade rule is reinforced.session-health will flag a recovery-spiral at session end if you repeatedly re-dispatch the same evaluator on the same artifact without acting on the verdict — that's a signal you're stuck.pipeline-templates bundled templates reference evaluator-library by name for their evaluator stages.Assumes evaluator subagents need explicit rubrics. May relax if future models internalize domain rubrics with looser prompting. Review annually per Catalyst convention.
npx claudepluginhub adrijshikhar/catalyst --plugin catalystProvides UI/UX resources: 50+ styles, color palettes, font pairings, guidelines, charts for web/mobile across React, Next.js, Vue, Svelte, Tailwind, React Native, Flutter. Aids planning, building, reviewing interfaces.
Fetches up-to-date documentation from Context7 for libraries and frameworks like React, Next.js, Prisma. Use for setup questions, API references, and code examples.