Skill

dev-process

THE SINGLE ORCHESTRATOR for all implementation work. This skill should be used ANY TIME the user asks to implement, build, or execute a plan that involves multi-file changes or non-trivial features. This is the DEFAULT workflow for all implementation work beyond single-file fixes. You do NOT need to be explicitly told to use it — if the task involves architecture, multiple files, agent teams, or a plan to execute, USE THIS SKILL. This skill orchestrates sub-skills — do NOT use them independently for implementation work: - `brainstorming` — design exploration (Phase 1) - `test-driven-development` — TDD per requirement (Phase 3) - `subagent-driven-development` — parallel execution engine (Phase 3) - `verification-before-completion` — verification enforcement (Phase 5) It also integrates with beads for persistent requirement tracking. Every requirement becomes a beads issue with acceptance criteria, tracked from creation through verified closure. Examples of when this skill activates: - "Go implement this" - "Build this feature" - "Execute this plan" - "Implement the architecture" - "Go build it" - "Use the dev process" - "Follow the standard pipeline" - "Implement this" - "Build this" - Any request to implement a PRD, epic, or multi-step plan

Popularity

Parent stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/dev-process:dev-process

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Spec-driven, gate-reviewed development pipeline for rigorous implementation.

Supporting Files

references/architecture-template.mdreferences/task-template.md

SKILL.md

517 lines · ~4.8k tokens

Stats

LanguagePython

Parent stars1

MaintenanceGood

Last CommitMar 31, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Structured Development Process

Spec-driven, gate-reviewed development pipeline for rigorous implementation.

When to Use

Multi-file features or fixes
Work that needs architecture before coding
Changes requiring security or compliance review
Any work where "just code it" leads to rework

When NOT to Use

Single-file bug fixes
Quick config changes
Pure research/exploration tasks

The Pipeline

Phase 1: Spec → Phase 2: Spec Review → Phase 3: Implement → Phase 4: Code Review → Phase 5: Test → Phase 6: Verdict

Phase 1: Architecture Spec

Write a detailed spec BEFORE any code. This phase uses parallel codebase exploration to inform the design.

Step 0a: Design Exploration (When Starting From Scratch)

If the feature is greenfield or the approach is unclear, invoke the brainstorming skill first for design exploration. Let brainstorming run its full cycle (diverge → converge → decide), then carry the chosen approach into the spec below. Skip this if you already have a clear PRD or plan to execute.

Step 0b: Verify Current State (MANDATORY)

Before writing ANY spec, you MUST verify the current state of the system you're targeting. Specs written from memory are specs written against fiction. This step has been skipped in 5 out of 5 recent incidents, causing features to target dead code paths every time.

Run these checks and include the OUTPUT in your spec:

Trace the actual data flow — open the app, check browser DevTools / network requests, or grep the codebase to find which endpoints and code paths are ACTUALLY used
Verify the target files exist and are active — grep -r for imports, route registrations, or function calls that prove the code you're targeting is live (not legacy)
Check for recent changes — git log --oneline -10 [target files] to see if the area was recently refactored (your mental model may be stale)

Include a ## Current State Verification section in every spec with actual command output. If you cannot produce this evidence, you are not ready to write the spec.

Step 1: Understand the Requirements

Read and extract requirements from the source (PRD, epic, issue, or user request):

Goal and acceptance criteria
Scope boundaries (in/out)
Constraints and requirements

Step 2: Explore Codebase (Parallel Subagents)

Launch 3 Explore subagents in parallel using the Task tool:

Task 1 - Pattern Discovery:
"Find existing features similar to [feature type]. Look for patterns, file structures,
and conventions to follow. Output: Reference files to use as templates."

Task 2 - Integration Points:
"Find where [feature] would integrate with existing code. Look for APIs, data models,
entry points. Output: Files to modify, connection points."

Task 3 - Test Patterns:
"Find how tests are structured in this codebase. Look for unit, integration, e2e patterns.
Output: Test file locations, testing conventions."

Wait for all 3 to complete, then synthesize findings.

Step 3: Design Architecture

Launch a Plan subagent with the requirements and exploration findings. The subagent outputs:

Technical approach (how to solve it)
Patterns to follow (reference files)
Implementation legs (each sized for one agent session)

Step 4: Write the Spec

The spec MUST include:

# [Feature/Fix Name] — Architecture Spec

## Goal
[1-2 sentences: what we're building and why]

## Scope
- **In scope**: [list]
- **Out of scope**: [list]

## Reference Patterns
| Purpose | File | Notes |
|---------|------|-------|
| Similar feature | src/path/to/file | Follow this structure |
| Test pattern | tests/path/to/test | Use this testing approach |

## Files to Modify/Create
| File | Action | Purpose |
|------|--------|---------|
| path/to/file | Create/Modify | What changes |

## Technical Design

### [Component 1]
- **Goal**: [what this component does]
- **Logic**: [how it works, pseudocode or description]
- **Types**: [new types/interfaces needed]
- **Edge cases**: [what could go wrong]

### [Component 2]
[repeat pattern]

## Implementation Legs

### Leg 1: [Name]
[Brief description of what this leg accomplishes]

### Leg 2: [Name]
[Brief description]

## Dependencies & Parallelization

Leg 1 ──┬──> Leg 3 │ Leg 2 ──┘


[Explain what can run in parallel and what's sequential]

## Current State Verification
[MANDATORY — actual command output proving this spec targets live code paths]
- Data flow trace: [which endpoints/functions are actually called]
- Target file verification: [grep output showing files are imported/used]
- Recent changes: [git log of target area]

## Dependency Verification
[MANDATORY — for each external dependency referenced in the spec]
| Dependency | Assumed Capability | Verified At Source? | Source Path |
|---|---|---|---|
| @pkg/name | method X exists | YES/NO | node_modules/@pkg/name/dist/file.js:L123 |
[Phase 2 reviewer will REJECT specs without this section. The GATE beads issue
cannot be closed without VERIFIED: evidence citing these source paths.]

## Acceptance Criteria (Executable)
[MANDATORY — each criterion is a testable assertion, not prose]
For each criterion, write it as: "When [action], then [observable result]"
Example:
- When user loads conversation view, DOM contains `.tool-group` elements (count > 0)
- When user clicks tool group header, expanded content becomes visible
These MUST be convertible to Playwright assertions. Prose-only criteria are rejected at review.

## Test Requirements

For each component:
- Unit tests: [count and description]
- Integration tests: [if applicable]
- E2E tests: [Playwright tests that verify acceptance criteria against REAL data, not mocks]

## Security Considerations
[If applicable: bypass vectors, injection risks, access control]

Save the spec as a doc in the project's docs/ folder or as a task in your tracking system.

See references/architecture-template.md for a lighter-weight template.

Step 5: Create Beads Issues From Requirements

After the spec is approved, create a beads issue for EACH requirement/acceptance criterion:

bd create --title='R1: [requirement name]' --description='[acceptance criteria — testable assertion]' --type=feature
bd create --title='R2: [requirement name]' --description='[acceptance criteria]' --type=feature
# ... one per requirement

# MANDATORY: Create the dependency verification gate
bd create --title='GATE: Dependencies verified at source' \
  --description='Verify all external dependency assumptions against actual library source (node_modules/), not wrapper code. Close with VERIFIED: evidence citing source file paths.' \
  --type=task --priority=1

This creates the tracking trail. Every requirement gets a beads issue that must be individually verified and closed before the feature can ship. The GATE issue enforces that dependency assumptions were checked against real library source — not just our wrapper code. Do NOT skip this step.

The GATE issue is NOT optional. The verify-before-close hook will BLOCK closing any R-prefixed requirement issue unless a GATE: issue has been closed with VERIFIED: evidence. If you skip creating the GATE issue, you will be blocked at close time. Three separate agents read wrapper code instead of library source and missed a critical API — this gate prevents that (CAPA-8).

Phase 2: Spec Review Gate

STOP. Do not proceed to implementation without review.

Launch a Plan subagent to review the spec:

"Review this architecture spec for completeness. Check:
1. Are all files identified? Any missing?
2. Are edge cases covered?
3. Is the test plan adequate?
4. Are dependencies correctly mapped?
5. Any security concerns missed?
6. Are dependency assumptions verified AT SOURCE? For each external dependency
   referenced in the spec, was the ACTUAL library source (node_modules/pkg/dist/*)
   inspected — not just our wrapper code? Are source file paths cited as evidence?
   If the spec references dependency capabilities without citing library source → REJECT.
Output: APPROVE with notes, or REJECT with specific issues."

If REJECTED: fix the spec and re-review. If APPROVED: complete the dependency verification gate before proceeding to Phase 3.

Dependency Verification (Close the GATE Issue)

After spec approval, you MUST close the GATE issue before starting implementation. The verify-before-close hook will block closing ANY R-prefixed requirement issue until the GATE issue is closed. This is architectural enforcement, not a suggestion.

For each external dependency referenced in the spec:

Read the actual library source — node_modules/pkg/dist/*, NOT your wrapper code. If the dependency is inside a container, exec in and read the source there.
List the methods/APIs you found — what does the library actually expose?
Compare to spec assumptions — does the library support what the spec assumes?
Look for capabilities the spec missed — are there better-suited APIs you didn't know about?

Then close the GATE issue with evidence:

bd update <gate-id> --notes='VERIFIED: [date] | Dependencies checked: [list] | Source paths: [node_modules/ paths where capabilities confirmed]'
bd close <gate-id>

If the library does NOT support what the spec assumed → update the spec and re-review.

Phase 3: Implementation

Use agent teams for parallel work. Follow the dependency diagram from the spec.

Sub-Skill Integration

Use the test-driven-development skill for each requirement. Write the test first, then the implementation. This is not optional — every requirement gets TDD treatment.
Use the subagent-driven-development skill to parallelize work across independent requirements. Each implementation leg runs as a subagent with isolated scope.

Progress Tracking

After each requirement is implemented (code + tests written):

Update its beads issue: bd update <id> --status=in_progress
Git checkpoint: commit the requirement with a clear message (e.g., "Implement R3: [name]")
Do NOT batch commits — one commit per requirement keeps the trail clean

Team Setup

Create a team (if not already in one)
Create tasks for each implementation leg
Set up dependencies between tasks
Spawn implementation agents for each independent leg

Implementation Agent Instructions Template

Implement [Leg Name] per the architecture spec.

Spec location: [path]
Your scope: [specific section of the spec]

Rules:
- Follow the spec exactly — don't improvise
- Write tests alongside code (TDD preferred)
- Commit when your leg is complete
- Report: what you built, test results, any deviations from spec

Parallelization Rules

Independent legs run simultaneously
Dependent legs wait for blockers to complete
Each agent gets isolated scope — no overlapping file edits
If two legs touch the same file, they must be sequential

Phase 4: Code Review Gate

STOP. All implementation must be reviewed before merging.

Launch a code review agent:

"Review the implementation against the architecture spec.

Spec: [path]
Changes: [git diff or file list]

Check per file:
1. Spec compliance — does it match the design?
2. Code quality — clean, readable, no dead code
3. Test coverage — are all spec'd tests present and passing?
4. Security — any bypass vectors or injection risks?

Output format:
### [filename]
| Requirement | Status |
|-------------|--------|
| [from spec] | PASS / FAIL |

### Security Review
[explicit checks]

### Verdict
APPROVE / REJECT
Severity: [CRITICAL: X, HIGH: X, MEDIUM: X, LOW: X]"

If REJECTED: fix issues and re-review. If APPROVED: proceed to Phase 5.

Phase 5: Testing + Independent Verification

Invoke the verification-before-completion skill to enforce this entire phase. Do not skip it or claim verification happened without running through the skill's checklist.

5a: Run Tests (Builder)

Unit tests: Run the full test suite — report BEFORE and AFTER counts (e.g., "was 118, now 130" — if count didn't increase, new tests weren't added or aren't running)
Integration tests: If applicable
Regression check: Ensure existing functionality isn't broken

5b: Independent Verification (NOT the Builder)

The agent who built it CANNOT verify it. This is non-negotiable. In 5 out of 5 recent incidents, self-certification led to shipping broken features.

Spawn a separate verification agent (use the independent-verifier agent if available, or launch a fresh Agent with read-only + Playwright access):

"Verify this feature works from the USER's perspective.

Spec: [path to spec with acceptance criteria]
App URL: [the URL the user actually uses]

For EACH acceptance criterion in the spec:
1. Load the app the way the user loads it (not a test harness)
2. Perform the user action described
3. Check the observable result matches the criterion
4. Take a screenshot as evidence

Report:
| Acceptance Criterion | Result | Evidence |
|---------------------|--------|----------|
| [criterion] | PASS/FAIL | [screenshot or DOM check] |

You have NO access to edit code. You can only read, browse, and report.
If ANY criterion fails, report FAIL — do not attempt to fix."

The independent verifier also updates beads as it verifies each requirement:

# For each verified requirement:
bd close <id> --reason='Verified: [evidence description — screenshot path or DOM assertion result]'

If the verifier reports FAIL → return to Phase 3 with the failure details. If the verifier reports PASS → proceed to Phase 5c (if plugin work) or Phase 6.

5c: Plugin Verification (If Work Involves Plugins)

Publishing a plugin to a marketplace and verifying files exist is NOT verification. The plugin must actually LOAD in a fresh Claude Code session. This step has been missed repeatedly — "pushed to GitHub" was treated as "done" when the plugin wasn't even installed.

Skip this section if the work doesn't involve Claude Code plugins. For plugin work:

For NEW plugins (not yet installed):

Launch a fresh Claude Code session (use the tmux skill if available):

SESSION="agent-verify-plugin-$(openssl rand -hex 2)"
tmux new-session -d -s "$SESSION" -c <workdir> "claude --dangerously-skip-permissions"

Install the plugin: send /plugin install <name> to the session
Reload: send /reload-plugins
Verify the skill appears: ask the session to list skills containing the plugin name
Clean up the tmux session

For UPDATED plugins (version bump):

Launch a fresh Claude Code session (new sessions fetch updated marketplace)
Run /doctor — check for plugin errors
Run /skills — verify the skill is listed with the correct plugin name
Invoke the skill with a test prompt to confirm it activates correctly
Clean up the tmux session

Verification evidence must include:

The /skills or skill list output showing the plugin loaded
Or the skill activation output showing it triggered correctly
Plugin version confirmed (cache shows correct version)

Do NOT accept "marketplace.json updated" or "files pushed" as verification. The plugin must load.

Phase 6: Verdict & Ship

Beads Gate (MANDATORY)

Before evaluating the pre-ship checklist, check beads:

bd list --status=open

If ANY beads issues for this feature remain open, you are NOT done. Do NOT claim completion. Go back to the phase where the open requirement needs work.

Pre-ship checklist (all must be YES):

Spec includes Current State Verification with command output?
Acceptance criteria are executable (not prose)?
Test count increased? (before: ___, after: ___)
Independent verifier (not builder) confirmed feature works?
Verifier evidence (screenshots/DOM checks) included in report?
All beads issues for this feature are closed? (bd list --status=open returns none)

If all YES:

Final commit with clean message
Close/update completed tasks in your tracking system
Save learnings to claude-mem for future reference

Present beads summary to user as proof of completion:

bd list  # Show all issues — should all be closed with verification evidence

Report summary to user WITH verifier evidence AND beads trail attached

If any NO:

Document what failed and why
Create fix tasks
Loop back to the appropriate phase

Quick Reference

Phase	Gate	Who	Output
1. Spec	Current state verified?	Lead + Explore agents	Architecture doc with evidence
2. Review	Spec complete? Criteria executable?	Plan agent	APPROVE/REJECT
3. Implement	—	Implementation agents	Code + tests
4. Review	Spec compliant?	Review agent	APPROVE/REJECT
5a. Test	Tests pass? Count increased?	Builder	Test results
5b. Verify	Feature works for user?	Independent verifier (NOT builder)	Evidence
6. Ship	All gates + verifier evidence?	Lead	Done or iterate

Tips

Don't skip gates even when it feels like overkill — the 10 minutes saved by skipping review costs hours of rework
Spec changes during implementation are OK but must be documented and re-reviewed
Small scope is better — break large features into multiple dev-process cycles
Claude CLI for LLM calls: Use CLAUDECODE= claude --print not Anthropic SDK

Additional Resources

Reference Files

references/architecture-template.md — Lighter-weight architecture spec template
references/task-template.md — Task description format with step-by-step instructions

dev-process

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

dev-process

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

Structured Development Process

When to Use

When NOT to Use

The Pipeline

Phase 1: Architecture Spec

Step 0a: Design Exploration (When Starting From Scratch)

Step 0b: Verify Current State (MANDATORY)

Step 1: Understand the Requirements

Step 2: Explore Codebase (Parallel Subagents)

Step 3: Design Architecture

Step 4: Write the Spec

Step 5: Create Beads Issues From Requirements

Phase 2: Spec Review Gate

Dependency Verification (Close the GATE Issue)

Phase 3: Implementation

Sub-Skill Integration

Progress Tracking

Team Setup

Implementation Agent Instructions Template

Parallelization Rules

Phase 4: Code Review Gate

Phase 5: Testing + Independent Verification

5a: Run Tests (Builder)

5b: Independent Verification (NOT the Builder)

5c: Plugin Verification (If Work Involves Plugins)

Phase 6: Verdict & Ship

Beads Gate (MANDATORY)

Pre-ship checklist (all must be YES):

Quick Reference

Tips

Additional Resources

Reference Files

Similar Skills

Structured Development Process

When to Use

When NOT to Use

The Pipeline

Phase 1: Architecture Spec

Step 0a: Design Exploration (When Starting From Scratch)

Step 0b: Verify Current State (MANDATORY)

Step 1: Understand the Requirements

Step 2: Explore Codebase (Parallel Subagents)

Step 3: Design Architecture

Step 4: Write the Spec

Step 5: Create Beads Issues From Requirements

Phase 2: Spec Review Gate

Dependency Verification (Close the GATE Issue)

Phase 3: Implementation

Sub-Skill Integration

Progress Tracking

Team Setup

Implementation Agent Instructions Template

Parallelization Rules

Phase 4: Code Review Gate

Phase 5: Testing + Independent Verification

5a: Run Tests (Builder)

5b: Independent Verification (NOT the Builder)

5c: Plugin Verification (If Work Involves Plugins)

Phase 6: Verdict & Ship

Beads Gate (MANDATORY)

Pre-ship checklist (all must be YES):

Quick Reference

Tips

Additional Resources

Reference Files

Similar Skills