By DMokong
Speculator — spec quality scoring and a 7-gate pipeline (4 required + 3 opt-in: eval intent, eval quality, comprehension) with LLM-as-judge evaluation, git worktree isolation, and beads issue tracking. Quality In, Quality Out.
Reviews implementation code against a spec and produces Gate 3 evidence — evaluates correctness, error handling, readability, security, performance, and spec alignment using a 6-point checklist. Invoked by /sdlc run during Gate 3 or directly by the gate-check skill.
Generates a per-AC comprehension artifact by reading the spec and the diff cold (no access to the implementing agent's reasoning), then scores the artifact against the comprehension rubric — 4 dimensions (AC Coverage, Accuracy, Spec Fidelity, Scope Containment) — and writes the Gate 2c evidence file. Invoked by /sdlc gate comprehension and by the sdlc-run pipeline Phase 3b.
Evaluates a set of authored evals (intent artifacts) against the eval-intent rubric — scores 4 dimensions (intent coverage, anti-pattern detection, journey completeness, implementation independence), checks SYSTEM-SPEC.md for conflicts, scans prior specs for regression signals, and writes the Gate 2a evidence artifact. Invoked by /sdlc eval and by the sdlc-run pipeline Phase 2a.
Evaluates a test suite's quality as an instrument for detecting spec violations — scores 7 dimensions (AC coverage, behavioral specificity, intent fidelity, sensitivity, scenario completeness, assertion density, test independence), emits blocking/advisory flags, and writes the Gate 2b evidence artifact. Invoked by /sdlc gate eval-quality and by the sdlc-run pipeline.
Folds a closing spec's behavioral contributions into the compacted system specification (SYSTEM-SPEC.md) — extracts behaviors, organizes by domain, maintains provenance trails, and handles amendments to existing behaviors. Invoked by /sdlc close and /spec compact.
Runs the eval authoring phase for a spec — presents each acceptance criterion and guides the author to write an intent-capturing eval, scores the eval set via the eval-intent-scorer agent, runs a SYSTEM-SPEC.md compatibility check and prior-spec regression check, and iterates until the configured quality threshold is met. Use when the user says "/sdlc eval", "author evals", "write evals for this spec", "eval phase", or when the sdlc-run pipeline reaches Phase 2a.
Checks or runs quality gates (1-4) for a specification — verifies whether gate evidence exists, or collects new evidence by running tests, code review, or building the evidence package. Use when the user says "/sdlc gate", "/spec gate", "check gate", "verify gate", "gate status", "are my tests passing?", "check code quality", "is this ready to merge?", "run the review", "collect evidence", or wants to verify or produce gate evidence.
Runs the Speculator delivery workflow (Gate 4 + close) for a completed spec — builds the evidence package, closes the beads epic and stories, releases the spec lock, delivers to main via merge or PR per close.strategy, and compacts the spec into SYSTEM-SPEC.md. Use when the user says "/sdlc close", "/spec close", "close the spec", "close out this spec", "finalize the feature and merge", "deliver this spec to main", "ship this spec", or when the sdlc-run pipeline reaches Phase 5. NOT for closing individual beads issues or generic "close this issue/PR" requests — those have nothing to do with the spec delivery pipeline.
Runs diagnostic health checks on the Speculator environment — verifies git state, project config, plugin wiring, spec directory health, lock status, and beads CLI availability. Use when the user says "/sdlc doctor", "/spec doctor", "sdlc check", "spec check", "sdlc health", "check sdlc setup", "is my setup working?", "diagnose speculator", or when invoked automatically by /sdlc start.
Runs the Speculator implementation phase — bridges from an approved, Gate-1-passed spec to actionable work by creating an implementation plan derived from the spec, generating linked beads user stories, and handing off execution. Use when the user says "/sdlc implement", "/spec implement", "implement the spec", "start the implementation phase", "plan the implementation from this spec", "turn the spec into tasks", or when the spec has passed Gate 1 and is ready for implementation. NOT for generic "implement this function/feature" requests outside the spec pipeline.
Executes bash commands
Hook triggers when Bash tool is used
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
In an agentic world, the craft that used to live in the developer's head needs to live in the specification.
Agents can reason, decompose, and build — but they can't fill gaps from decades of domain intuition. The spec is where intent transfers from human to agent. We built Speculator because a great spec means the agent barely needs to improvise.
📖 Read the full manifesto: Quality In, Quality Out →
📝 Read the series: The Tax Nobody's Measuring · The Specification Tax
A Claude Code plugin that enforces a 7-stage quality pipeline (4 required + 3 opt-in gates) on agentic development workflows with LLM-as-judge spec scoring, git worktree isolation, and in-repo evidence artifacts. Spec + Evaluator = Speculator.
Speculator is being built toward an explicit goal: an anti-dark-code pipeline — a workflow that won't ship code unless intent, behavior, and comprehension can all be evidenced. Today it covers spec quality (Gate 1), eval intent (Gate 2a), code quality (Gate 2), eval quality (Gate 2b), comprehension (Gate 2c, experimental), code review (Gate 3), and evidence packaging (Gate 4). Gate 2c closes the last gap between "tests pass" and "any human or agent could explain what shipped" — its calibration corpus is the remaining open work. See ROADMAP.md.
npm install -g beads-cli)# Add the marketplace (one-time)
claude plugin marketplace add DMokong/claude-plugins
# Install the plugin
claude plugin install speculator@dmokong-plugins --scope project
# Bootstrap your project
/spec doctor --init
/spec doctor --init will create a default .claude/sdlc.local.md config and verify prerequisites. The pre-commit gate hook ships with the plugin and registers automatically — no setup step needed.
/spec doctor # verify environment is healthy
/spec start # create spec + worktree + beads epic
Now brainstorm before you spec. Use Claude's brainstorming skill to explore intent, clarify requirements, and design your approach — before committing anything to the spec. The brainstorm is where ambiguity dies and intent crystallizes.
# brainstorm your feature (Superpowers skill)
# → explores intent, asks clarifying questions, proposes approaches with trade-offs
# → produces a design-approved plan ready to be formalized
/spec create # Speculator turns your brainstormed plan into a formal spec.md
/spec score # Gate 1: automated spec quality scoring
/spec implement # create plan + beads stories + execute
/spec review # Gate 3: code review
/spec close # Gate 4: evidence package + deliver to main (merge or PR)
Give /spec run a brainstormed plan and it handles everything — creates the spec, scores it, improves it, implements it, reviews it, and merges it:
/spec run # autonomous: create → score → improve → implement → review → merge
This is designed to scale across multiple agents. Each agent gets a brainstormed plan, runs /spec run in its own worktree, and delivers a quality-gated feature independently.
Both
/specand/sdlcprefixes work — they're equivalent.
Here's what it actually looks like to build a feature with Speculator.
You: /spec start
Speculator: What feature are you building?
You: Add a webhook notification system for order events
Speculator: Created:
✅ Spec: docs/specs/webhook-notifications/spec.md
✅ Worktree: .claude/worktrees/webhook-notifications/
✅ Epic: beads-a3f "Webhook notification system"
Switched to worktree. Brainstorm your design, then run /spec create.
Now brainstorm. This is where the real work happens — before a single line of spec is written:
You: I want to add webhooks for order events. Let's brainstorm.
Claude: [explores intent] What's the main use case — are these for internal
microservices, external integrations, or both?
You: External integrations. Third-party apps need to react to order changes.
Claude: [clarifying questions] A few things to nail down:
• Which order events? Created, updated, cancelled, fulfilled?
• Retry semantics — how many retries, what backoff?
• Security — do we need payload signing?
• Rate limiting per consumer?
npx claudepluginhub dmokong/claude-plugins --plugin speculatorComplete collection of battle-tested Claude Code configs from an Anthropic hackathon winner - agents, skills, hooks, and rules evolved over 10+ months of intensive daily use
Comprehensive SEO analysis plugin for Claude Code. 25 sub-skills (21 core + 1 orchestrator + 1 framework + 2 extension mirrors) and 18 sub-agents cover technical SEO, content quality, schema, sitemaps, Core Web Vitals, local SEO, backlinks, AI/GEO, ecommerce, hreflang, SXO, clustering, drift monitoring, and Google APIs. Includes optional MCP extensions, SPA-aware rendering, portability, and hardened SSRF/DNS-rebinding safe fetchers.
Modern R development skills for Claude Code - tidyverse patterns, rlang metaprogramming, Bayesian inference, performance optimization, and more
Unity Development Toolkit - Expert agents for scripting/refactoring/optimization, script templates, and Agent Skills for Unity C# development
Complete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.
Comprehensive .NET development skills for modern C#, ASP.NET, MAUI, Blazor, Aspire, EF Core, Native AOT, testing, security, performance optimization, CI/CD, and cloud-native applications