From test-quality-tools
Turn a test-quality scorecard JSON into a self-contained interactive HTML readout — hero stats, a per-axis Win/Tie/Loss distribution chart, and the full per-suite matrix. Use when the user wants to visualize benchmark/scorecard results, regenerate the results dashboards in docs/, or render a {baselines, arms} scorecard (as emitted by the benchmark's score_quality.py / score_cross_language.py, or the test-quality skill's scorer) as a shareable web page. Produces one standalone .html file (Chart.js via CDN, no build step).
How this skill is triggered — by the user, by Claude, or both
Slash command
/test-quality-tools:results-dashboardThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Render a **scorecard JSON** into one self-contained HTML page you can open in a
Render a scorecard JSON into one self-contained HTML page you can open in a
browser or commit under docs/. The input is the {baselines, arms} shape the
benchmark's scorers emit (see references/scorecard-json-shape.md); the output
is a dark-themed readout with:
lang.gen over base
with ✓ win / ✗ loss / = tie / · n-a, plus the W/L/T tally and a better? badge.<scorecard>.json."score_quality.py / score_cross_language.py (or the
test-quality skill's score.py --json …), to publish the numbers.arms array (and
usually baselines). If the user points at a results-*-scorecard.json,
use it directly. If they only have a raw tests dir, run the relevant scorer
first to produce the JSON, then feed it here.--title and a one-line
--subtitle. Default title is derived from the filename.python <skill>/scripts/build_dashboard.py <scorecard>.json \
-o docs/<name>.html --title "…" --subtitle "…"
"present": false render as an absent row;
null axis values render as ·. Axis keys are read from the first present
arm, so the columns adapt to whichever axes the scorer measured.docs/*.html dashboards in this repo are curated, richer
variants of the same idea — treat this skill's output as the reproducible base.Provides a checklist for code reviews covering functionality, security, performance, maintainability, tests, and quality. Use for pull requests, audits, team standards, and developer training.
npx claudepluginhub rollinsio/beyond-test-coverage --plugin test-quality-tools