From u22a8
Score content against scoring model traits. Use when the user wants to measure how content performs on specific traits.
How this skill is triggered — by the user, by Claude, or both
Slash command
/u22a8:evaluateThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
A scoring model is a standard of judgment, learned from examples, that can score any content. Each model has traits — distinct dimensions of quality — and scores content 0–100 per trait. Scores are deterministic and instant.
A scoring model is a standard of judgment, learned from examples, that can score any content. Each model has traits — distinct dimensions of quality — and scores content 0–100 per trait. Scores are deterministic and instant.
Score content against specific traits. Measurement only — no edits.
Identify content to score. Read from a file path ($ARGUMENTS), a URL, a selection, or conversation context. URLs (http/https) can be passed directly to score — they are fetched and text-extracted automatically. If unclear, ask the user what content to evaluate.
Determine the model. If the user specifies a model, use it. Otherwise, call list_models to see what's available and either pick the most relevant one based on context (e.g. a README → u22a8.compelling-readme) or present the options and let the user choose.
Identify traits. Use list_traits to show the model's traits. Suggest relevant ones based on context, but confirm the user's choice. Do NOT silently evaluate all traits unless the user explicitly asks for "all".
Score the content. Call score with the model handle, content text, and specified traits.
Present results clearly. For each trait, show:
Report composite. Show the composite score. If headroom > 0, mention which trait is the bottleneck.
Use the zone and breaks from the detail field — these are data-driven from the model, not fixed ranges. The breaks define three thresholds (developing, solid, strong) that separate four labels:
breaks.strongbreaks.solid and breaks.strongbreaks.developing and breaks.solidbreaks.developingPresent these labels rather than raw numbers when interpreting results.
score. The scoring handles any necessary processing internally.detail field.User: "Evaluate my README"
u22a8.compelling-readme is the natural fit — use it directlylist_traits to show available traits, confirm with userscore with chosen traitsClarity: 72 — Solid (confidence: high)
Flow: 45 — Developing (confidence: moderate, headroom: 12)
Composite: 56
Bottleneck: Flow (12 points from solid)
Provides a checklist for code reviews covering functionality, security, performance, maintainability, tests, and quality. Use for pull requests, audits, team standards, and developer training.
npx claudepluginhub u22a8/plugins --plugin u22a8