Skill

bias-fairness-assessment

Assess an AI model, feature, or dataset for bias and fairness across groups — representational and allocative harms, disparate performance, and skewed refusals — using appropriate fairness metrics, and recommend mitigations. Use when evaluating whether an AI system treats people equitably.

Popularity

Parent stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/ai-safety:bias-fairness-assessment

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

A fairness assessment that identifies where the system performs or behaves

SKILL.md

53 lines · ~618 tokens

Stats

Parent stars1

MaintenanceGood

Last CommitMay 31, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Goal

A fairness assessment that identifies where the system performs or behaves disparately across groups, quantifies it with suitable metrics, and proposes mitigations — distinguishing the type of harm.

Frame the harm type first

Allocative — the system influences resource/opportunity decisions (hiring, lending, housing, moderation, healthcare). Fairness of outcomes matters most.
Representational — the system depicts/describes groups (stereotyping, erasure, demeaning content, quality-of-service gaps). Fairness of treatment.

Steps

Identify groups and context — protected/sensitive attributes relevant to the use case and jurisdiction; note intersectional groups. Define what "fair" means here (it's context-dependent and metrics can conflict).
Choose metrics to match the harm:
- Allocative: demographic parity, equalized odds, equal opportunity, predictive parity, calibration — pick by which error is most harmful; they trade off.
- Representational/generative: disparate refusal rate, sentiment/toxicity skew, stereotype rate, quality (accuracy/helpfulness) parity across groups.
Measure on a representative, consented dataset (watch for unrepresentative or skewed data — that's itself a finding). Report gaps with confidence.
Diagnose sources — data imbalance, label bias, proxy features, objective mismatch, feedback loops.
Recommend mitigations — data/representation fixes, reweighting/constraints, threshold adjustment, post-processing, human review, scope limits — and note residual disparity and metric trade-offs.

Output

A fairness report: harm type · groups · metric · measured disparity · likely source · mitigation · residual/trade-off. Use security-reporting; visualize gaps with security-diagramming:infographic.

Notes

There is no single "fair" — metrics conflict and the right choice depends on which error harms people most in this context. State the chosen definition and why. Beware proxies: removing a protected attribute doesn't remove bias carried by correlated features.

bias-fairness-assessment

Popularity

Invocation

Context Preview

SKILL.md

bias-fairness-assessment

Popularity

Invocation

Context Preview

SKILL.md

Goal

Frame the harm type first

Steps

Output

Notes

Similar Skills

Goal

Frame the harm type first

Steps

Output

Notes

Similar Skills