By rlpatrao
GAN-inspired autonomous SDLC scaffold (BRD v3.0): initializer/coding-agent split, feature_list.json contract, mandatory browser-automation E2E gate, Ralph Loop exit interception, per-workflow LLM routing, Plan Mode subagent, Extended ReAct, adaptive context compaction, tree-structured sessions, spec-gap backprop, monotonic-improvement guards, instinct extraction, YAML recipes. Vendor-first reuse mandate per brd/v3.0.md §10.
Interactive stack interrogation, design artifact generation, decision verification, and learnings persistence. Runs after BRD approval, before spec decomposition.
Autonomous build loop with Karpathy ratcheting, GAN evaluator, browser console capture, UI standards review, 8-gate ratchet, session chaining, and cross-project learnings.
Label the current path in the session tree (BRD §4.5) so it can be retrieved later via /tree <label>.
Create a Business Requirements Document through Socratic five-dimension interview. First step of the SDLC pipeline, before spec/design/build.
Full 12-phase SDLC pipeline. BRD → Architect → Spec → Design → Observe → Comply → Initialize → Auto (11 gates) → Post-build. Human gates on phases 1-4.
Interactive technical design partner. Conducts stack interrogation informed by BRD context, challenges weak decisions, generates machine-readable design artifacts, verifies completeness, and persists decisions for cross-project reuse.
Collaborates with the human to create Business Requirements Documents through Socratic dialogue with 5-dimension exploration, alternatives analysis, and engineer self-audit.
Reviews code for quality, architecture compliance, test coverage, and story traceability.
Per-session feature worker. Runs every session after the Initializer has set up the project. Follows the fixed 8-step startup sequence enforced by hooks/session-start.js. Works exactly one feature_list.json entry per session.
Summarizes session transcripts for BRD §4.3 compaction stages 3-5. Uses Haiku for cost. Read+summarize only. Spawned by hooks/compaction-stage.js when budget thresholds are crossed.
UX patterns for agentic AI applications — intent preview, autonomy dial, confidence signals, audit trails, escalation, streaming, multi-agent dashboards, and error recovery.
Design the system architecture including layered dependencies, API contracts (endpoints, schemas, errors), data models, folder structure, and deployment topology. Output a detailed design document to `specs/design/` with all decisions justified.
Interactive stack interrogation, design artifact generation, decision verification, and learnings persistence. Runs after BRD approval, before spec decomposition.
Autonomous build loop with Karpathy ratcheting, GAN evaluator, browser console capture, UI standards review, 8-gate ratchet, session chaining, and cross-project learnings. Iterates story groups until all features pass or stopping criteria met.
Create a Business Requirements Document through Socratic five-dimension dialogue with the human. First step of the SDLC pipeline, before spec/design/build. Supports greenfield projects or single-feature additions.
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
A Claude Code plugin that builds software the way a well-run engineering team would -- from requirements to production, with independent verification at every step.
v3.0 (May 2026) is the current spec. It retrofits the v2.0 pipeline with: initializer/coding-agent split,
feature_list.jsonas the project-completion contract, mandatory browser-automation E2E gate, Ralph Loop exit interception, per-workflow LLM routing, Plan Mode subagent, Extended ReAct, 5-stage adaptive compaction, spec-gap backpropagation, monotonic-improvement guards, instinct extraction, tree-structured sessions, and YAML recipes. Full spec:brd/v3.0.md. Operational plan:brd/v3.0-implementation-plan.md. Live punch list:feature_list.json. 19 agents · 27 hooks · 47 skills · 23 commands.
You describe what you want to build. The forge runs 19 specialized agents through a 9-phase pipeline (v3.0 §7): gathering requirements through Socratic interview, challenging your architecture decisions, decomposing work into stories, generating code with parallel agent teams, and verifying everything by actually running the application. Not by reading the code and saying "looks good."
One command starts it. Human approval gates the creative decisions (BRD, architecture, design). Everything after that -- implementation, testing, verification, self-healing -- runs autonomously, bounded by the feature_list.json contract.
# Load as plugin and scaffold
claude --plugin-dir ~/claude-harness-forge
> /scaffold
# Run the full pipeline
> /build
The forge doesn't just generate code -- it runs your app, hits your API endpoints, drives a browser through Playwright, and checks for console errors. A 200 response with "Failed to connect" in the body is a failure. An empty list when data should exist is a failure. If something breaks, it diagnoses the issue, fixes it, and re-verifies -- up to 3 attempts per gate before escalating.
Tests pass. The app crashes. This is the most common failure mode in AI-generated code, and the forge addresses it structurally:
expect(true).toBe(true)), inflating coverage with dead code. Also cannot be disabled.The forge has multiple feedback loops that compound over time:
/change logs them with version tracking, runs impact analysis, and cascades updates through only the affected stories, design, and code.The architect analyzes your requirements and activates only what's relevant:
| Project Type | What Activates |
|---|---|
| CRUD | Standard architecture review, gates 1-8 |
| ML | + ML pipeline design, compliance gate, model cards, bias/fairness audits |
| Agentic | + Agentic architecture round, OWASP Agentic Top 10, agentic UX patterns |
| RAG | + RAG scaffolding, vector DB selection, chunking/embedding guidance |
Projects can match multiple types. The forge also supports 4 execution modes -- from Full (all 12 gates, production-grade) to Solo (3 gates, weekend projects) -- so you control cost and rigor.
For large story groups, the generator spawns parallel sub-agents that each own a slice of work. A dependency handshake identifies shared files before work begins, preventing merge conflicts. Each teammate gets the full context: learned rules, architecture constraints, and test requirements.
npx claudepluginhub rlpatrao/claude_harness_forge15-agent AI engineering team for Claude Code. Full sprint cycle: Think, Plan, Build, Review, Test, Ship, Reflect. HITL checkpoints, handoff contracts, eval system, and 3 sprint modes.
The only Claude Code plugin that verifies AI-generated code against its own design specs.
Describe your goal, approve the spec, then step away — Claude and Codex loop together until it's right.
Autonomous multi-agent development framework with spec-driven sprints and convergent iteration
Structured project planning and execution through brainstorm, spec, and build phases across three execution tiers: sequential, delegated sub-agents, and full agent teams
Production-ready Claude Code configuration with role-based workflows (PM→Lead→Designer→Dev→QA), safety hooks, 44 commands, 19 skills, 8 agents, 43 rules, 30 hook scripts across 19 events, auto-learning pipeline, hook profiles, and multi-language coding standards