From ai-safety
Assess an AI model, feature, or dataset for bias and fairness across groups — representational and allocative harms, disparate performance, and skewed refusals — using appropriate fairness metrics, and recommend mitigations. Use when evaluating whether an AI system treats people equitably.
How this skill is triggered — by the user, by Claude, or both
Slash command
/ai-safety:bias-fairness-assessmentThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
A fairness assessment that identifies where the system performs or behaves
A fairness assessment that identifies where the system performs or behaves disparately across groups, quantifies it with suitable metrics, and proposes mitigations — distinguishing the type of harm.
A fairness report: harm type · groups · metric · measured disparity · likely source
· mitigation · residual/trade-off. Use security-reporting; visualize gaps with
security-diagramming:infographic.
There is no single "fair" — metrics conflict and the right choice depends on which error harms people most in this context. State the chosen definition and why. Beware proxies: removing a protected attribute doesn't remove bias carried by correlated features.
npx claudepluginhub jassics/awesome-claude-security --plugin ai-safetyProvides CDSS development patterns for drug interaction checking, dose validation, clinical scoring (NEWS2, qSOFA), and alert classification integrated into EMR workflows.