Name: sd0x-dev-flow
Author: sd0xdev

Stats

Actions

Available In

Tags

sd0x-dev-flow

The harness layer for Claude Code.

Quality gates that AI can't skip. A reference implementation of AI Agent Harness Engineering for Claude Code — hook-enforced dual review, state-machine gates that survive context compaction, and fail-closed safety where it counts.

96 bundled · 96 public skills · 15 agents — ~4% of Claude's context window

What This Harness Does

Harness engineering is the discipline of engineering everything around the LLM — tool loops, context management, hooks, state machines, safety layers — as opposed to training the model itself. Mitchell Hashimoto coined the term in Feb 2026; Anthropic engineering and Martin Fowler have published on it; arXiv 2603.05344 formalizes it.

sd0x-dev-flow is a reference implementation. Each row below maps a canonical harness sub-problem to concrete code you can study:

Harness sub-problem

sd0x-dev-flow implementation

Code evidence

Tool loop control

/codex-review-fast → /precommit auto-loop with sentinel-driven transitions

rules/auto-loop.md + hooks/post-tool-review-state.sh

Sentinel-driven state machine

✅ Ready / ⛔ Blocked / ✅ All Pass gate markers parsed into durable state

scripts/emit-review-gate.sh (producer) + hooks/post-tool-review-state.sh (parser)

Context recovery across compaction

[AUTO_LOOP_RESUME] stdout injection after SessionStart(compact)

hooks/post-compact-auto-loop.sh

Lifecycle interceptors

5 hook event types dispatched to 8 scripts: PreToolUse / PostToolUse / Stop / SessionStart / UserPromptSubmit

hooks/ (8 scripts) + .claude/settings.json

Capability-based tool gating

Skill frontmatter allowed-tools — e.g., /ask has no Edit/Write

86 of 95 public skills declare allowed-tools

Defense-in-depth safety

5 layers: pre-edit-guard → commit-msg-guard → pre-push-gate → stop-guard → sidecar fail-closed marker

scripts/pre-push-gate.sh + scripts/commit-msg-guard.sh + hooks/stop-guard.sh

Generator-evaluator split

Dual review: Codex (primary) + Claude (secondary) dispatched in parallel on every review cycle

rules/codex-invocation.md + rules/auto-loop.md (Dual Review Mode)

Incremental progress tracking

iteration_history.current_round + max_rounds + convergence plateau detection

rules/auto-loop.md (exit conditions + strategic reset)

Human-in-the-loop safety gates

/dev/tty confirmation + AskUserQuestion for destructive ops

scripts/pre-push-gate.sh + skills/push-ci/SKILL.md

Self-improvement loop

Correction → record lesson → promote to rule after 3+ recurrences

rules/self-improvement.md

Most harness projects cover 2–4 of these. sd0x-dev-flow covers all 10 — which makes the code useful as a study target, not just a tool.

Why sd0x-dev-flow?

Without guardrails

With sd0x-dev-flow

AI skips review when context is long

Hook-enforced: stop-guard blocks incomplete reviews

Single reviewer misses issues

Dual dispatch: Codex + secondary in parallel

"Fixed it" without re-verification

Auto-loop: fix → re-review → pass → continue

Review state lost after compact

State tracking: SessionStart hook re-injects

Quick Start

# Install plugin /plugin marketplace add sd0xdev/sd0x-dev-flow /plugin install sd0x-dev-flow@sd0xdev-marketplace # Configure your project /project-setup

One command auto-detects framework, package manager, database, entrypoints, and scripts. Installs a subset of rules and hooks; the full plugin bundles 14 rules + 9 hooks.

sd0x-dev-flow

The harness layer for Claude Code.

96 bundled · 96 public skills · 15 agents — ~4% of Claude's context window

What This Harness Does

Harness engineering is the discipline of engineering everything around the LLM — tool loops, context management, hooks, state machines, safety layers — as opposed to training the model itself. Mitchell Hashimoto coined the term in Feb 2026; Anthropic engineering and Martin Fowler have published on it; arXiv 2603.05344 formalizes it.

sd0x-dev-flow is a reference implementation. Each row below maps a canonical harness sub-problem to concrete code you can study:

#	Harness sub-problem	sd0x-dev-flow implementation	Code evidence
1	Tool loop control	`/codex-review-fast` → `/precommit` auto-loop with sentinel-driven transitions	`rules/auto-loop.md` + `hooks/post-tool-review-state.sh`
2	Sentinel-driven state machine	`✅ Ready` / `⛔ Blocked` / `✅ All Pass` gate markers parsed into durable state	`scripts/emit-review-gate.sh` (producer) + `hooks/post-tool-review-state.sh` (parser)
3	Context recovery across compaction	`[AUTO_LOOP_RESUME]` stdout injection after SessionStart(compact)	`hooks/post-compact-auto-loop.sh`
4	Lifecycle interceptors	5 hook event types dispatched to 8 scripts: PreToolUse / PostToolUse / Stop / SessionStart / UserPromptSubmit	`hooks/` (8 scripts) + `.claude/settings.json`
5	Capability-based tool gating	Skill frontmatter `allowed-tools` — e.g., `/ask` has no Edit/Write	86 of 95 public skills declare `allowed-tools`
6	Defense-in-depth safety	5 layers: pre-edit-guard → commit-msg-guard → pre-push-gate → stop-guard → sidecar fail-closed marker	`scripts/pre-push-gate.sh` + `scripts/commit-msg-guard.sh` + `hooks/stop-guard.sh`
7	Generator-evaluator split	Dual review: Codex (primary) + Claude (secondary) dispatched in parallel on every review cycle	`rules/codex-invocation.md` + `rules/auto-loop.md` (Dual Review Mode)
8	Incremental progress tracking	`iteration_history.current_round` + `max_rounds` + convergence plateau detection	`rules/auto-loop.md` (exit conditions + strategic reset)
9	Human-in-the-loop safety gates	`/dev/tty` confirmation + `AskUserQuestion` for destructive ops	`scripts/pre-push-gate.sh` + `skills/push-ci/SKILL.md`
10	Self-improvement loop	Correction → record lesson → promote to rule after 3+ recurrences	`rules/self-improvement.md`

Most harness projects cover 2–4 of these. sd0x-dev-flow covers all 10 — which makes the code useful as a study target, not just a tool.

Why sd0x-dev-flow?

Without guardrails	With sd0x-dev-flow
AI skips review when context is long	Hook-enforced: stop-guard blocks incomplete reviews
Single reviewer misses issues	Dual dispatch: Codex + secondary in parallel
"Fixed it" without re-verification	Auto-loop: fix → re-review → pass → continue
Review state lost after compact	State tracking: SessionStart hook re-injects

Quick Start

# Install plugin
/plugin marketplace add sd0xdev/sd0x-dev-flow
/plugin install sd0x-dev-flow@sd0xdev-marketplace

# Configure your project
/project-setup

One command auto-detects framework, package manager, database, entrypoints, and scripts. Installs a subset of rules and hooks; the full plugin bundles 14 rules + 9 hooks.

sd0x-dev-flow

Popularity

What's Inside

Confidence

README

sd0x-dev-flow

What This Harness Does

Why sd0x-dev-flow?

Quick Start

Similar Plugins

cwf

ai-workflow

agentic-dev-team

claudekit

harness-flow

claude-combine

sd0x-dev-flow

What This Harness Does

Why sd0x-dev-flow?

Quick Start

Popularity

Health & Quality

Similar Plugins

cwf

ai-workflow

agentic-dev-team

claudekit

harness-flow

claude-combine