Execute a multi-stage planning and verification pipeline that turns any task description into an actionable plan, implements it with strict TDD cycles, and independently verifies results before approving completion. Reduces hallucination and regressions through automated codebase exploration, reality-checking of claims, and quality scoring.
TDD implementation agent for beast. Executes plan tasks with strict RED-GREEN-REFACTOR discipline.
Quick codebase explorer for beast. Maps project structure, tech stack, test infrastructure, and key patterns.
Combines all tool outputs and agent findings into unified health report. Generates architecture.md, calculates health score, classifies severity.
Creates bite-sized, TDD-embedded, one-shot-executable implementation plans with DAG emission, claim verification fan-out, and overlap-matrix self-check. Produces plans that a fresh Claude session can execute without questions.
Creates bite-sized, TDD-embedded, one-shot-executable implementation plans for beast-plan. Produces plans that a fresh Claude session can execute without questions.
Codebase health analysis: dead code, test quality, duplicates, complexity, security, architecture mapping. Tool-first, structured storage, forge integration.
Full documentation hygiene pass: memory, CLAUDE.md, lessons, references, guides. Audit freshness, delete stale, update outdated, compress index.
Ore in, steel out. Planning pipeline with independent verification, persistent memory, and compounding knowledge. Use for 3+ files or unclear scope.
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Ore in, steel out.
A blacksmith doesn't blame the ore. It smelts, shapes, tempers, and quenches — until what comes out holds an edge. Forge does the same with code: takes any task, however raw, and pushes it through planning gates, independent review, and verified execution until the result is proven to work.
Claude Code plugin. Three skills: planning pipeline, docs hygiene, and code hygiene. Persistent memory that learns from every run.
"fix the auth bug" → research → plan → 2 independent reviews → TDD execute → independent verify → done
git clone https://github.com/malakhov-dmitrii/forge.git ~/.claude/plugins/forge
bun run ~/.claude/plugins/forge/scripts/install.mjs
The installer wires SessionStart/SessionEnd/PreCompact hooks into the plugin's own hooks/hooks.json and initializes the cross-project knowledge DB at ~/.forge/global.db. The project-scoped .omc/forge.db is created lazily on your first /forge run in each project — nothing to set up per project.
Safe to re-run. Pass --dry-run to preview changes, --no-db to skip DB init. Reverse with bun run ~/.claude/plugins/forge/scripts/uninstall.mjs (data is preserved).
Requires bun on PATH — the DB layer uses bun:sqlite.
/forge — planning + execution + verificationThe main pipeline. Takes a task from idea to verified implementation.
/forge "add rate limiting to the API" — plan → execute → verify
/forge --full "migrate to new auth system" — deeper research + spike
/forge --discuss "improve engagement" — clarify vague input first
/forge --plan-only — stop after final plan
Two machines work in sequence:
Plan Forge — refinement loop. Searches your project's gotchas, past plans, and architecture docs. Spikes risky assumptions. Gets a second opinion from a different AI model. Cycles until two independent reviewers find zero issues. Max 5 iterations.
Verification Chain — two agents who have never seen the executor's output independently verify every acceptance criterion. An auditor spot-checks 30-50% of the evidence. Gaps get fed back to execution.
Iron rules (hardcoded, system can't override):
/docs-refresh — documentation hygieneAudits all project docs for freshness. Deletes what's dead, updates what drifted, compresses what's bloated. Also runs as Forge's final stage.
/docs-refresh — full audit: memory, CLAUDE.md, lessons, references
/docs-refresh --scan-only — report only, no changes
/docs-refresh --memory-only — scope to memory files
/code-hygiene — codebase health analysisRuns static analysis tools, interprets findings, maps architecture, evaluates test quality. Stores structured results in .omc/hygiene/ for tracking over time.
/code-hygiene — full scan (adaptive: inline or agents based on project size)
/code-hygiene --module src/auth — scope to specific directory
/code-hygiene --deep — include mutation testing (slow)
Tool-first approach: runs tsc, scc, semgrep, knip, jscpd, dependency-cruiser — then interprets JSON output. Produces a health score (0-100), severity-classified findings (P0-P3), architecture overview with mermaid diagrams, and test quality analysis. P0/P1 findings can be turned into parked forge tasks for refactoring.
Every forge run writes to a SQLite database (.omc/forge.db). The system learns from its own history.
What it tracks:
What it enables:
--park / --resume — save work, come back in a new session--spawn "sub-task" — create child forges with dependency trackingCross-project knowledge lives in ~/.forge/global.db — verified facts about tools and libraries that apply everywhere. Spikes about external tools get promoted automatically.
Schema is versioned (PRAGMA user_version) — future updates migrate automatically.
npx claudepluginhub malakhov-dmitrii/forgeMulti-model consensus planner. Claude + Codex + DeepSeek independently draft a plan, cross-verify each other, and must reach consensus before any plan is emitted. Output is a plan; it never edits your code.
Verification-first engineering toolkit for Claude Code. 15 skills across a 5-phase spine (Investigate → Design → Implement → Verify → Ship), 8 specialist agents, an interactive setup wizard. Every skill has rationalizations + evidence requirements. Built for senior ICs and tech leads.
Adversarial multi-agent pipeline for Claude Code. GAN-style loops where generators produce artifacts, discriminators validate them, and feedback drives convergence.
Pre- and post-implementation validation with parallel subagents: /replan validates plans before execution, /recheck verifies implementations match the plan
AI-powered development workflow automation - Phase-based planning, implementation orchestration, preflight code quality checks with security scanning, ship-it workflow, and development principles generator for CLAUDE.md
Persona-driven AI development team: orchestrator, team agents, review agents, skills, slash commands, and advisory hooks for Claude Code
A Claude Code plugin for structured, context-efficient software development.