Assess the AI readiness of a codebase and produce an autonomy maturity map. Trigger phrases: "assess readiness", "how AI-friendly is this codebase", "autonomy maturity", "readiness score", "AI readiness assessment", "how ready is my codebase for AI agents"
How this skill is triggered — by the user, by Claude, or both
Slash command
/codebase-ai-readiness:assess-readiness [path-to-codebase] (defaults to current directory)[path-to-codebase] (defaults to current directory)This skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Produce an autonomy maturity map for the target codebase. The output is a `readiness-report.md` file containing a numeric score, category breakdown, workflow artifact summary, collaboration effectiveness assessment, recommended autonomy level, blockers, and a prioritized roadmap.
references/agent-contributor-model.mdreferences/autonomy-levels.mdreferences/category-definitions-agent.mdreferences/category-definitions.mdreferences/collaboration-metrics.mdreferences/compound-engineering.mdreferences/context-engineering.mdreferences/feedforward-surfaces.mdreferences/report-template.mdreferences/scoring-rubric.mdreferences/workflow-artifacts.mdProduce an autonomy maturity map for the target codebase. The output is a readiness-report.md file containing a numeric score, category breakdown, workflow artifact summary, collaboration effectiveness assessment, recommended autonomy level, blockers, and a prioritized roadmap.
Use Glob and Grep to understand the project:
package.json, Cargo.toml, go.mod, pyproject.toml, pom.xml, *.csproj).github/workflows/, .gitlab-ci.yml, Jenkinsfile, .circleci/).pre-commit-config.yaml, .husky/, lefthook.yml, .git/hooks/)test/, tests/, __tests__/, *_test.go, *_spec.rb)README.md, docs/, CHANGELOG.md, ADR/, adr/)tsconfig.json, mypy.ini, pyrightconfig.json, .strict)Dockerfile, docker-compose.yml, .devcontainer/, flake.nix, mise.toml)cdk.json, cdktf.json, terraform/, *.tf, Pulumi.yaml, template.yaml (SAM), *.bicep, cloudformation/)*.proto, openapi.*, *.schema.json, swagger.*)AGENTS.md, CLAUDE.md, .cursorrules, .cursor/rules/)docs/plans/, docs/exec-plans/, PLANS.md)docs/requirements/, docs/specs/, requirements/), design (docs/design/, design/), review learnings (docs/reviews/, docs/learnings/) — load references/workflow-artifacts.md.env.example, config.schema.json)CODEOWNERS, OWNERS)eslint-plugin-boundaries in config, deptry, madge, ArchUnit, structural tests)plop, hygen, cookiecutter, .template files).claude/skills/, agent skill directories)hooks.json, post-tool-use automation)wc -l, excluding vendored/generated/build dirs), not by eyeballing a few. Report the share of files under 300 lines (the "good" target in category 2.15) and flag outliers over 500 lines. The two thresholds are distinct: 300 is the per-file comprehension target; 500 marks a file large enough to warrant splitting.Evaluate 15 categories. Load references/category-definitions.md and references/category-definitions-agent.md for detailed signals. Also load references/agent-contributor-model.md for the framing principles.
2.1 Structure and modularity
2.2 Documentation
2.3 Testable boundaries
2.4 CI reliability
flaky labels)--onlyChanged, Launchable) — keeps feedback fast as agent-generated test volume grows2.5 Typing strength
strict: true in tsconfig, --strict in mypy)any, type: ignore, as unknown)2.6 Deterministic environment and deployment
.env.example, .env.template)2.7 Architecture decisions
2.8 Machine-readable intent
git log (closing keywords like Closes #123, bare #123, or tracker keys). If git history is unavailable, mark the PR-link signal "not inspectable" rather than scoring it absent.2.9 Progressive context disclosure
2.10 Hidden state and magic
.env.example, config schema)2.11 Repository-scale reasoning
2.12 Failure mode legibility
2.13 Feedforward surfaces
Load references/feedforward-surfaces.md for detailed scoring signals.
--no-verify; branch protection enforces checks server-side)2.14 Compound engineering readiness
Load references/compound-engineering.md and references/workflow-artifacts.md for detailed scoring signals.
2.15 Context engineering friendliness
Load references/context-engineering.md for detailed scoring signals.
Load references/scoring-rubric.md for scoring criteria.
Assign each category a score from 0-100:
Use the category weights from the scoring rubric to compute a weighted average (0-100).
Load references/autonomy-levels.md and map the overall score to L0-L5.
For the current autonomy level, identify what specifically prevents moving to the next level. Be concrete: name missing files, missing configurations, weak categories.
Load references/collaboration-metrics.md. Estimate or mark "not measured" for first-pass
acceptance, iteration cycles per task, and post-merge rework. Note infrastructure enablers
(PR templates, labels, review rubrics, learning docs). Produce 2-4 recommendations to start
or improve tracking aligned with the recommended autonomy level.
Produce a prioritized list of 5-10 recommended actions. For each action:
Write readiness-report.md in the codebase root. Load references/report-template.md for the
required sections and tables.
npx claudepluginhub krokoko/cairn --plugin codebase-ai-readinessCreates, edits, and optimizes skills for Claude Code, including drafting, evaluating with test prompts, iterating on performance, and improving skill descriptions for better triggering accuracy.