From agent-discipline
Audit a coding agent's configuration for operating discipline against the Agent Discipline Standard. Use when asked to "audit my setup", "score my CLAUDE.md", "check my agent config", "how disciplined is my config", "valuta la mia config", "audit agent discipline", or to assess whether a CLAUDE.md / AGENTS.md / .cursorrules and its harness enforce good engineering practices. Produces a 0-100 score, per-criterion evidence, and concrete diffs.
How this skill is triggered — by the user, by Claude, or both
Slash command
/agent-discipline:audit-setupThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
You are auditing the **operating discipline** of a coding agent's configuration: whether it actually forces disciplined engineering, and whether the critical rules are *mechanically enforced* rather than just written down. You do **not** check syntax (stale model IDs, broken paths) — that is the job of `agnix`/`cclint`; say so if the user expects it.
You are auditing the operating discipline of a coding agent's configuration: whether it actually forces disciplined engineering, and whether the critical rules are mechanically enforced rather than just written down. You do not check syntax (stale model IDs, broken paths) — that is the job of agnix/cclint; say so if the user expects it.
Read STANDARD.md — the single source of truth for all criteria, weights, and scoring rules. Look for it, in order:
../../STANDARD.md relative to this skill)../STANDARD.md) — the standalone-install case.Apply the criteria as written in STANDARD.md, not from this file. This skill is the procedure; the standard is the rubric. If the two ever disagree, the standard wins.
Two things get audited:
CLAUDE.md (project and global), AGENTS.md, .cursorrules, system prompt, or equivalent. Read all that exist; merge them — a rule satisfied in the global config counts for the project too.settings.json, settings.local.json, plugin hooks.json, managed settings — wherever hooks and permissions.deny live. This is required for the EN-* (Enforcement) criteria. If you cannot access it, score EN-* as 0 and say explicitly that enforcement could not be verified (do not guess).State which files you found before scoring. If a config is missing entirely, that is itself the finding.
Follow the two scoring kinds defined in STANDARD.md:
AD-* (behavioral): score = weight × (0.5 × presence + 0.5 × quality).
EN-* (enforcement): walk the binary checklist against the harness config. Each covered item earns its fixed share. No model judgment — either the hook/deny entry exists or it doesn't. Cite the exact entry.Stability discipline: the AD-* presence half and all of EN-* are deterministic — they must not vary between runs. Only the AD-* quality half carries judgment, and it is bounded per-criterion. If you find yourself about to award a wildly different score than a plain reading justifies, re-read the criterion. The headline number should be reproducible within a few points.
Sum the weighted criterion scores to a 0–100 total. Assign the band from STANDARD.md (🔴 Risky / 🟠 Workable / 🟡 Solid / 🟢 Battle-tested).
Output in this shape:
# Agent Discipline Audit
**Score: <N>/100 — <band emoji> <band>**
Audited: <files found>
## Scores by category
| Criterion | Score | Evidence |
|---|---|---|
| AD-1 Anti-loop | 9/9 | "after 2 identical failures, stop" — global CLAUDE.md:14 |
| ... | ... | ... |
| EN-1 Destructive guardrail | 3/5 | deny rm -rf, push --force; missing cloud-delete, branch -D |
## Top gaps — concrete diffs
For each low criterion, the smallest change that raises it:
### AD-13 Scope discipline — 0/7 → ~6/7
Add to CLAUDE.md:
> Stay within the requested scope. No unrequested features, no opportunistic
> refactors outside the task. Propose before expanding scope; prefer the
> smallest diff that solves it.
### EN-1 Destructive guardrail — 3/5 → 5/5
Add to settings.json `permissions.deny`:
> "Bash(git branch -D *)", "Bash(gcloud * delete*)", "Bash(firebase * delete*)"
Rules for the report:
agnix for the deterministic checks this standard deliberately skips.If the user asks you to apply the diffs (not just report), edit the config files directly, then re-run the audit to confirm the new score. Treat config edits as you would any change: small, reversible, and confirmed before destructive overwrites.
Provides behavioral guidelines to reduce common LLM coding mistakes, focusing on simplicity, surgical changes, assumption surfacing, and verifiable success criteria.
Searches, retrieves, and installs Agent Skills from prompts.chat registry using MCP tools like search_skills and get_skill. Activates for finding skills, browsing catalogs, or extending Claude.
npx claudepluginhub spinabuilds/agent-discipline --plugin agent-discipline