By gl11tchy
Spec-driven development with 25 specialist agents and fail-closed phase gates. Your agent can't mark its own homework.
Researches alternatives and competition during WannaBuild discovery. Compares direct competitors, adjacent solutions, existing libraries/tools, manual workflows, and do-nothing options.
Generates API documentation for WannaBuild document phase. Documents endpoints, functions, types, and contracts from the actual code and design spec.
Designs system architecture for WannaBuild design phase. Creates data models, API contracts, and architectural decisions with rationale.
Reviews code for architectural quality in WannaBuild review phase. Validates separation of concerns, design pattern usage, and compliance with the design spec.
Writes changelog entries for WannaBuild document phase. Follows Keep a Changelog format with semantic versioning.
Use when starting any conversation that touches software work — establishes how to find and use WannaBuild skills, requiring skill invocation before ANY response including clarifying questions.
Automatic repo-native WannaBuild full-loop workflow for natural "I want to build/add/change..." prompts and any WannaBuild phase skill entrypoint.
WannaBuild implementation phase entrypoint for implementing a concrete plan or task with focused verification and adaptive delegation.
WannaBuild debugging implementation entrypoint for reproducing, diagnosing, fixing, and verifying bugs within the full loop.
WannaBuild discovery phase entrypoint that grills the user — one question at a time, each with a recommended answer — to clarify vision, audience, flows, constraints, scope, and success signals before continuing the full loop. Also triggers on "grill me" or "grill this idea".
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Your agent can't mark its own homework.
Spec-driven development for coding agents — with ship gates that are programs, not prose.
Quickstart · Why · Trust Model · Workflow · Install · Usage · Artifacts
You ask an agent to build something. It writes code fast, tells you the tests pass, and hands you a confident summary. Then you open the diff: a half-implemented feature, tests that never ran, a "QA report" that is a paragraph of vibes. You got burned, so you tried an agent framework — and found that almost every one of them enforces its discipline with markdown files the model can rationalize past.
A markdown file cannot fail a build.
WannaBuild runs the loop a serious team would run — Discover → Plan → Implement → Validate → QA → Summary — and enforces the parts that decide shipping with a runtime, not a prompt:
wb-runtime record-test-evidence executes your integration test command itself and
signs the record with a key kept outside the project tree. The review and QA
gates verify the signature, the exit code, that the spec hasn't changed
since the run, and that the command matches your config. The model never
gets to "report" that tests passed.assert-plan-ready). No ship without a unanimous reviewer set and
verified QA evidence (assert-summary-ready). Gates are compiled checks
that exit non-zero — when they cannot run, that is a stop, not a pass.scripts/validate-wannabuild-dry-runs.sh and watch the forgery get
rejected.
Needs the
wb-runtimebinary (a Codex install puts it onPATH; otherwisecargo build --release). Binary-less Claude/Factory installs enforce the same gates through the Python hook mirror (record-test-evidence/verify-test-evidence); to reproduce theassert-*check below without the binary, runscripts/wannabuild-gate-check.sh. Run it from a project where.wannabuild/is initialized and you are at QA.
# Hand the gate a perfect-looking forged verdict: 100 passing tests, every criterion covered,
# plus a clean QA summary — exactly what a lazy agent would write instead of running tests.
cat > .wannabuild/review/wb-integration-tester-iter-1.json <<'EOF'
{"agent":"wb-integration-tester","status":"PASS","hard_gate":true,"summary":"all green","issues":[],
"test_execution":{"total":100,"passed":100,"failed":0,"errored":0,"duration_ms":5000},
"coverage_map":[{"criterion":"login works","status":"covered"}]}
EOF
printf '# QA\n\nstatus: PASS\nacceptance: covered\nintegration: covered\n' > .wannabuild/outputs/qa-summary.md
wb-runtime assert-qa-ready --project .
# QA gate failed: no runtime-recorded execution evidence for iteration 1
# (.wannabuild/review/wb-integration-tester-iter-1.evidence.json missing); run
# `wb-runtime record-test-evidence` — the runtime executes the integration test
# command itself and signs the result, so a hand-written verdict cannot pass this gate
The only path to a green QA gate is the runtime actually running your tests. Edit the recorded evidence afterward — one character — and the signature check kills it.
npx claudepluginhub gl11tchy/wannabuild --plugin wannabuildStructured project planning and execution through brainstorm, spec, and build phases across three execution tiers: sequential, delegated sub-agents, and full agent teams
Provider-agnostic skills for autonomous product development: spec, task, implement, test, review, and summarize changes.
GSD Core is a meta-prompting, context engineering, and spec-driven development system for AI coding agents.
A curated set of skills for each stage of development — propose, spec, design, plan, implement, ship.
Multi-agent software evolution. Autonomously study, strategize, build, review, and evaluate code changes using specialist AI agents.
Plan and autonomously build a software task end-to-end. Recons the codebase, applies preloaded memory, decomposes into the right number of phases, gets one confirmation, then prepares a single ready-to-paste /goal command — one paste between you and done — that drives execution to completion with built-in retry, fix-spec recovery, and per-phase memory writeback. Works on Claude Code and Codex.