Search everything...

Stats

Actions

Available In

claudekit

Name: claudekit
Author: duthaho

By duthaho

Execute structured, evidence-driven engineering workflows in Claude Code across 5 phases (Investigate, Design, Implement, Verify, Ship) using 15 skills and 8 specialist agents that enforce TDD, root-cause analysis, architecture/UX/security reviews, dependency audits, PR preparation, and incremental shipping with pasted test outputs, logs, and diffs as proof before advancing.

npx claudepluginhub duthaho/claudekit --plugin claudekit

Popularity

Stars

Top 10%

Med: 0·Avg: 285

Installs

Top 1%

Med: 0·Avg: 1

What's Inside

Agents8

architect

/architect

Use when reviewing the architecture dimension of a written plan. Dispatched primarily by plan-review-architecture (via plan-review). Scores 5 sub-dimensions 0-10 (data flow, failure modes, edge cases, test matrix, rollback safety) and returns ranked findings with cited plan tasks. <example> Context: A plan has been written and is about to be implemented. user: "Run plan-review on the cache-invalidation plan." assistant: "Dispatching the architect agent to score the architecture dimension while the experience-reviewer runs in parallel." </example> <example> Context: A migration plan needs an architecture-only pass. user: "I just need an arch review on this — skip the UX review." assistant: "Dispatching the architect agent directly." </example>

code-reviewer

/code-reviewer

Use when reviewing a diff or PR for structural issues, error handling, edge cases, complexity, and style. Dispatched primarily by code-review-loop. Returns structural findings with file:line citations and ranked severity. Pairs with security-auditor for sensitive paths. <example> Context: A PR is ready for first-pass review. user: "Review my charge-endpoint PR before I tag humans." assistant: "Dispatching the code-reviewer agent to find structural issues, error-handling gaps, and complexity hotspots." </example> <example> Context: A refactor PR needs a sanity check. user: "Sanity-check this refactor PR." assistant: "Dispatching the code-reviewer to confirm behavior preservation and look for unintended changes." </example>

experience-reviewer

/experience-reviewer

Use when reviewing the experience dimension of a written plan (UX + DX). Dispatched primarily by plan-review-experience (via plan-review). Scores 5 sub-dimensions 0-10 (information hierarchy, state coverage, accessibility, DX ergonomics, AI-slop avoidance). <example> Context: A plan with both UI and API changes needs review. user: "Run plan-review on the dashboard plan." assistant: "Dispatching the experience-reviewer agent in parallel with the architect to cover UX and DX in one pass." </example> <example> Context: A new public API surface is being added. user: "Review the DX of the new webhook API plan." assistant: "Dispatching the experience-reviewer to score DX ergonomics, error copy, and discoverability." </example>

investigator

/investigator

Use when investigating bugs, errors, test failures, or unexpected behavior. Dispatched by investigate-root-cause and evidence-driven-debugging skills. Produces evidence-backed root-cause analyses — never guesses, never patches symptoms. <example> Context: An API endpoint is returning intermittent 500s. user: "The /api/users endpoint is throwing 500s sometimes." assistant: "Dispatching the investigator agent to gather evidence, write a hypothesis, and prove or refute it before any fix." </example> <example> Context: Tests passed locally but fail in CI. user: "My tests pass locally but CI is red." assistant: "Dispatching the investigator to find the env diff between local and CI and produce a hypothesis." </example>

planner

/planner

Use when decomposing a spec into an executable plan. Dispatched primarily by the write-plan skill. Produces a numbered task list with file paths, exact test commands, dependency annotations, acceptance criteria per task, and a Risks section. <example> Context: An approved spec exists; implementation hasn't started. user: "Turn the auth-rotation spec into a plan we can execute." assistant: "Dispatching the planner agent to produce a numbered task list with file paths, test commands, and rollback notes." </example> <example> Context: A previous plan was rejected during plan-review for being too vague. user: "Re-plan the migration; the reviewers said it had no acceptance criteria." assistant: "Dispatching the planner agent to rebuild the plan with falsifiable acceptance lines per task." </example>

Skills15

audit-dependencies

/audit-dependencies

Use when investigating dependency bloat, security advisories, supply-chain risk, upgrade planning, or before adding a new third-party package. Activate for keywords like "deps", "dependencies", "package.json", "requirements.txt", "Cargo.toml", "audit", "CVE", "stale package", "do we use", "what depends on", "transitive dep". Produces a written audit with import-graph evidence — never trust scanner output without verifying call sites.

code-review-loop

/code-review-loop

Use when opening a PR for review or when receiving review feedback. Activate for keywords like "code review", "PR review", "request review", "review feedback", "address comments", "reviewer said". Covers both ends of the loop: preparing a reviewable PR and acting on feedback rigorously. Always engage with every comment -- never dismiss feedback by silently ignoring it.

evidence-driven-debugging

/evidence-driven-debugging

Use during active debugging when you have a hypothesis to test or need to instrument a running system. Activate for keywords like "debug", "instrument", "log", "trace", "breakpoint", "what's happening at runtime", "production behavior". Pair to investigate-root-cause for the doing-it phase. Always record what you observed -- never debug entirely "in your head" without leaving evidence behind.

incremental-shipping

/incremental-shipping

Use when implementing a non-trivial feature, migration, or refactor that would otherwise be a single large change. Activate for keywords like "feature flag", "incremental", "vertical slice", "migration", "rollout", "behind a flag", "ship small". Enforces vertical slicing, feature-flagged rollout, and refactor-with- evidence (behavior-preserving changes proved by test/perf deltas). Always ship the smallest reversible change -- never bundle unrelated improvements.

init

/init

Interactive setup wizard for claudekit. Scaffolds rules, hooks, and MCP server configs into the user's project. Run /claudekit:init to configure. Use when setting up a new project with claudekit or reconfiguring an existing one.

Output Styles5

Brainstorm

brainstorm

keeps coding instructions

Creative exploration mode — divergent thinking, multiple alternatives, structured trade-offs before any code

Review

review

keeps coding instructions

Critical analysis mode — find issues first, severity-tagged findings, actionable suggestions

Deep Research

deep-research

keeps coding instructions

Thorough investigation mode — completeness over speed, evidence-cited, confidence levels named

Implementation

implementation

keeps coding instructions

Code-focused execution mode — minimal prose, action-oriented updates, follow established patterns

Token Efficient

token-efficient

keeps coding instructions

Compressed output mode — minimal prose, code-first, no preambles

Stats

Version4.0.0

LanguageJavaScript

Stars93

Forks55

Copy clicks7

MaintenanceExcellent

LicenseMIT

Last CommitMay 7, 2026

AddedApr 21, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).

Available In

claudekit-dev90 claudekit-marketplace

Safety Signals

Caution

Uses power tools

Uses Bash, Write, or Edit tools

README

Claude Kit

A verification-first engineering toolkit for Claude Code. Built for senior ICs and tech leads who already know how to ship production code — and want a workflow that keeps the discipline tight without getting in the way.

15 skills, 8 agents, one philosophy: every claim has evidence. No tests pass — trust me. No it works in my IDE. No I think the cache is stale. Skills produce artifacts you could paste into a code review.

What makes claudekit different

Rationalizations tables in every skill. The excuses an engineer makes to skip a step ("I see the problem, let me just patch it") are documented in the skill itself, with rebuttals. The skill refuses to be skipped silently.
Evidence requirements at every checkpoint. Each phase produces a specific artifact. If the artifact doesn't exist, the phase wasn't completed.
Pre-completion gates. verification-gate runs before any "done" claim — runs the tests, checks the negative path, exercises the change in a non-IDE environment, cross-checks the original ask.
No founder voice. No "ambitious vision," no "10x outcomes," no "delight." Engineering analogies, real file paths, real commands.
Plan-review pipeline as the headline. Two parallel reviewers (architecture + experience) score 5 sub-dimensions each, consolidate into one fix gate. Catches structural issues before code.

Install

/plugin marketplace add duthaho/claudekit-marketplace
/plugin install claudekit
/claudekit:init

/claudekit:init interactively scaffolds rules, hooks, and MCP server configs into your project's .claude/ directory. Output styles ship with the plugin and are auto-discovered by Claude Code (no init step required).

The 5-phase spine

Phase	Skills	What's enforced
Investigate	`investigate-root-cause`, `map-codebase`, `audit-dependencies`	Every claim about the system has a `<file:line>` citation. No memory-based assertions.
Design	`shape-spec`, `write-plan`, `plan-review`, `plan-review-architecture`, `plan-review-experience`	Plans have file paths, exact test commands, falsifiable acceptance criteria, named rollbacks. Reviewed before implementation.
Implement	`test-first`, `incremental-shipping`	Red-green-refactor with pasted runner output. Vertical slices behind feature flags. Refactors prove behavior preservation with test/perf deltas.
Verify	`verification-gate`, `evidence-driven-debugging`	Mandatory pre-completion gate. Active debugging keeps a paper trail.
Ship	`code-review-loop`, `release-and-changelog`	Reviewable PRs with verification evidence pasted. Atomic releases with diff-built changelogs.
Setup (off-spine)	`init`	One-time scaffolding wizard for project-level config.

All 15 skills are user-invocable as /claudekit:<name>.

Output styles (5)

Five Claude Code output styles ship with the plugin. They're auto-discovered by Claude Code — no init step required. Switch via /config or by setting outputStyle in .claude/settings.local.json.

Style	When to use
Brainstorm	Creative exploration — divergent thinking, multiple alternatives, structured trade-offs before any code
Deep Research	Thorough investigation — completeness over speed, evidence-cited findings with confidence levels
Implementation	Code-focused execution — minimal prose, action-oriented updates, follow established patterns
Review	Critical analysis — find issues first, severity-tagged findings, actionable suggestions
Token Efficient	Compressed output — minimal prose, code-first, no preambles

All styles use keep-coding-instructions: true, so Claude's default coding/testing/verification discipline still applies underneath.

The 8-agent roster

Each agent has a single dispatcher and a clear job. No agent-bloat.

Agent	Job	Dispatched by
`claudekit:planner`	Decompose specs into executable plans	`write-plan`
`claudekit:architect`	Score architecture dimension of a plan	`plan-review-architecture`
`claudekit:experience-reviewer`	Score UX + DX dimension of a plan	`plan-review-experience`
`claudekit:investigator`	Root-cause investigation with evidence chain	`investigate-root-cause`, `evidence-driven-debugging`
`claudekit:tester`	Design and write tests with red-green discipline	`test-first`
`claudekit:code-reviewer`	Pre-merge structural review of diffs	`code-review-loop`
`claudekit:security-auditor`	OWASP-aligned review of sensitive paths	`code-review-loop` (sensitive paths)
`claudekit:scout`	Codebase mapping and dependency audits	`map-codebase`, `audit-dependencies`

What `/claudekit:init` configures

View full README on GitHub

claudekit

Popularity

What's Inside

Confidence

README

Claude Kit

What makes claudekit different

Install

The 5-phase spine

Output styles (5)

The 8-agent roster

What `/claudekit:init` configures

Similar Plugins

aidd-dev

cwf

ai-workflow

autonomous-dev

Claude Kit

What makes claudekit different

Install

The 5-phase spine

Output styles (5)

The 8-agent roster

What `/claudekit:init` configures

Popularity

Health & Quality

Similar Plugins

aidd-dev

cwf

ai-workflow

autonomous-dev

cc-best

agentic-dev-team

claudekit

Popularity

What's Inside

Confidence

README

Claude Kit

What makes claudekit different

Install

The 5-phase spine

Output styles (5)

The 8-agent roster

What /claudekit:init configures

Similar Plugins

aidd-dev

cwf

ai-workflow

autonomous-dev

Claude Kit

What makes claudekit different

Install

The 5-phase spine

Output styles (5)

The 8-agent roster

What /claudekit:init configures

Popularity

Health & Quality

Similar Plugins

aidd-dev

cwf

ai-workflow

autonomous-dev

cc-best

agentic-dev-team

What `/claudekit:init` configures

What `/claudekit:init` configures