From preflight-ux-review
Use when comparing UX risk findings against known product failures, benchmark issues, support evidence, telemetry, or scorer notes. Tracks hits, misses, false positives, precision, recall, and calibration limits.
How this skill is triggered — by the user, by Claude, or both
Slash command
/preflight-ux-review:calibrate-ux-findingsThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Use this skill when predictions need to be checked against known outcomes.
Use this skill when predictions need to be checked against known outcomes.
The goal is calibration: identify what the review caught, missed, and over-predicted.
If known issues are weakly sourced, mark the calibration as provisional.
If the current workspace is a Preflight UX checkout, or PREFLIGHT_UX_REPO
points to one, prefer the repository scoring artifacts:
python3 scripts/preflight_ux.py locate.python3 scripts/preflight_ux.py validate.python3 scripts/preflight_ux.py uxpanel score --run <run.json> --benchmark <benchmark-dir> --out <score.json>.python3 scripts/preflight_ux.py uxpanel compare-scores <score-a.json> <score-b.json>.calibration/false_positives.md, calibration/misses.md, and scorer
notes when they exist.If no checkout is available, calculate the calibration manually from the supplied findings and known issues, and state that repo scoring was not run.
Use class-level scoring when only issue categories are reliable.
Use issue-level scoring when each known issue is specific enough to match one prediction to one documented failure.
Return:
| Known issue | Issue class | Match | Prediction | Rationale |
|---|
Then summarize:
npx claudepluginhub sparckix/preflight-ux --plugin preflight-ux-reviewCreates, edits, and optimizes skills for Claude Code, including drafting, evaluating with test prompts, iterating on performance, and improving skill descriptions for better triggering accuracy.