Use when scoping or commissioning a psychological audit of an AI/ML personnel assessment — to define which claims the audit will evaluate (validity, utility, lack of bias), establish the auditor's stance and credibility (internal / external / independent), decide formative vs. summative timing and the audience, and settle data/documentation access and disclosure terms. Triggers: "audit an AI hiring tool", "plan an algorithm audit", "bias audit scope", "internal vs external vs independent auditor", "formative vs summative audit", "NYC Local Law 144 bias audit", "what claims should the audit test".
How this skill is triggered — by the user, by Claude, or both
Slash command
/ai-personnel-assessment:ai-audit-planningThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Scope the **psychological audit** before doing it. A psychological audit is *an impartially conducted
Scope the psychological audit before doing it. A psychological audit is an impartially conducted conceptual and empirical evaluation of claims about psychological characteristics — traits, attitudes, emotions, behaviors, and other quantities not directly observable — as measured or predicted by algorithms. The target characteristics are set by the developers, and may not correspond to what the algorithm ultimately measures or predicts — so the audit must surface that gap.
An audit must be both conceptually grounded in psychometric theory and empirically tested, using a research design appropriate to the claims and interpreted according to modern standards of test evaluation.
Audits can evaluate claims of any type, but are most valuable for:
ai-fairness-lenses first.Write each developer claim as an explicit, testable statement (e.g., "this algorithm predicts job performance equally well for all groups using appropriate modeling techniques") — then design to evaluate it. In the focal example, that exact claim is questionable and should be audited.
Auditors come in three types; none is automatically credible:
It's intuitive to rank these by credibility, but access to data and documentation varies greatly even within a type (NDAs can restrict how deeply anyone can probe). So evaluate every auditor — regardless of source — on: their stated measurement standards and definitions of bias, whether they applied them in due course, and the terms of access and nondisclosure (what may be withheld at the company's discretion should itself be disclosed). Claims that "an algorithmic system has been audited and is therefore credible" should be viewed skeptically.
Prefer building auditing into the development lifecycle, not just bolting it on afterward.
The audit's purpose is often driven by its intended audience. Plan to produce results in
multiple formats for all relevant audiences (a precise technical report and a layperson-friendly
summary for those whose predictions are affected) — see ai-audit-reporting. Decide the release
policy up front: in general, unless there's a compelling, transparently stated reason, audits in
the public interest should be released; withholding risks the credibility of the audit, the company,
and the auditors.
Confirm what data, documentation, and code you can examine, and record access limits as findings in their own right. The audit will span the 12 components across three categories (see the collection README and Table 1):
ai-input-data-and-design-audit, ai-model-development-audit,
ai-model-outputs-auditai-claims-and-stakeholder-auditai-audit-meta-componentsTable 1 is not exhaustive — a framework for the most important issues. When a claim can't be fully evaluated within it, ask additional questions tied to relevant professional standards.
Some laws mandate audits (e.g., NYC Local Law 144 requires an annual "bias audit" of automated employment decision tools; many bills leave "bias" undefined so the term stays adaptable). A compliance audit and a scientifically meaningful psychological audit are not the same — satisfying a statute's checklist does not establish validity or lack of bias under Lens 1/3. Scope both explicitly and don't let one masquerade as the other. (Coordinate with counsel on jurisdictional law.)
ai-fairness-lenses).ai-fairness-lensesai-fairness-lenses (prerequisite) · ai-audit-reporting · all model/stakeholder/meta audit skills ·
validation-planning · technical-validation-report
Source: Landers & Behrend (2023), "Designing an Effective Psychological Audit"; auditor types & credibility; formative auditing; "Major Components of an Audit" / Table 1.
npx claudepluginhub openmatter-network/agent-io-skills --plugin ai-personnel-assessmentCreates, edits, and optimizes skills for Claude Code, including drafting, evaluating with test prompts, iterating on performance, and improving skill descriptions for better triggering accuracy.