Search everything...

Stats

Actions

Available In

beast-forge

Name: beast-forge
Author: malakhov-dmitrii

By malakhov-dmitrii

Execute a multi-stage planning and verification pipeline that turns any task description into an actionable plan, implements it with strict TDD cycles, and independently verifies results before approving completion. Reduces hallucination and regressions through automated codebase exploration, reality-checking of claims, and quality scoring.

npx claudepluginhub malakhov-dmitrii/forge

Popularity

Stars

Top 25%

Med: 0·Avg: 285

Installs

Med: 0·Avg: 1

What's Inside

Slash Commands2

forge-setup

/forge-setup

One-time project setup for forge: creates docs/ vault, .semgrep/ rules, CLAUDE.md sections

forge

/forge

Ore in, steel out. Full pipeline: plan → execute → verify. Use: /forge 'task', /forge --full, --discuss, --plan-only, --execute, --park, --resume

Agents21

executor

/executor

TDD implementation agent for beast. Executes plan tasks with strict RED-GREEN-REFACTOR discipline.

explorer

/explorer

Quick codebase explorer for beast. Maps project structure, tech stack, test infrastructure, and key patterns.

hygiene-synthesizer

/hygiene-synthesizer

Combines all tool outputs and agent findings into unified health report. Generates architecture.md, calculates health score, classifies severity.

planner-v3

/planner-v3

Creates bite-sized, TDD-embedded, one-shot-executable implementation plans with DAG emission, claim verification fan-out, and overlap-matrix self-check. Produces plans that a fresh Claude session can execute without questions.

planner

/planner

Creates bite-sized, TDD-embedded, one-shot-executable implementation plans for beast-plan. Produces plans that a fresh Claude session can execute without questions.

Skills3

code-hygiene

/code-hygiene

Codebase health analysis: dead code, test quality, duplicates, complexity, security, architecture mapping. Tool-first, structured storage, forge integration.

docs-refresh

/docs-refresh

Full documentation hygiene pass: memory, CLAUDE.md, lessons, references, guides. Audit freshness, delete stale, update outdated, compress index.

forge

/forge

Ore in, steel out. Planning pipeline with independent verification, persistent memory, and compounding knowledge. Use for 3+ files or unclear scope.

Hooks1

Event Hooks

3 hooks across 3 events

Stats

Version3.0.0

LanguageJavaScript

Stars22

Forks4

MaintenanceExcellent

LicenseMIT

Last CommitApr 19, 2026

AddedMar 9, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).

Safety Signals

Caution

Uses power tools

Uses Bash, Write, or Edit tools

README

Forge

Ore in, steel out.

A blacksmith doesn't blame the ore. It smelts, shapes, tempers, and quenches — until what comes out holds an edge. Forge does the same with code: takes any task, however raw, and pushes it through planning gates, independent review, and verified execution until the result is proven to work.

Claude Code plugin. Three skills: planning pipeline, docs hygiene, and code hygiene. Persistent memory that learns from every run.

"fix the auth bug" → research → plan → 2 independent reviews → TDD execute → independent verify → done

Install

git clone https://github.com/malakhov-dmitrii/forge.git ~/.claude/plugins/forge
bun run ~/.claude/plugins/forge/scripts/install.mjs

The installer wires SessionStart/SessionEnd/PreCompact hooks into the plugin's own hooks/hooks.json and initializes the cross-project knowledge DB at ~/.forge/global.db. The project-scoped .omc/forge.db is created lazily on your first /forge run in each project — nothing to set up per project.

Safe to re-run. Pass --dry-run to preview changes, --no-db to skip DB init. Reverse with bun run ~/.claude/plugins/forge/scripts/uninstall.mjs (data is preserved).

Requires bun on PATH — the DB layer uses bun:sqlite.

Skills

`/forge` — planning + execution + verification

The main pipeline. Takes a task from idea to verified implementation.

/forge "add rate limiting to the API"      — plan → execute → verify
/forge --full "migrate to new auth system"  — deeper research + spike
/forge --discuss "improve engagement"       — clarify vague input first
/forge --plan-only                          — stop after final plan

Two machines work in sequence:

Plan Forge — refinement loop. Searches your project's gotchas, past plans, and architecture docs. Spikes risky assumptions. Gets a second opinion from a different AI model. Cycles until two independent reviewers find zero issues. Max 5 iterations.

Verification Chain — two agents who have never seen the executor's output independently verify every acceptance criterion. An auditor spot-checks 30-50% of the evidence. Gaps get fed back to execution.

Iron rules (hardcoded, system can't override):

Minimum 2 independent review gates on every plan
Planner ≠ reviewer. No shared context.
Pipeline config is project-scoped. Never auto-propagates between projects.

`/docs-refresh` — documentation hygiene

Audits all project docs for freshness. Deletes what's dead, updates what drifted, compresses what's bloated. Also runs as Forge's final stage.

/docs-refresh                — full audit: memory, CLAUDE.md, lessons, references
/docs-refresh --scan-only    — report only, no changes
/docs-refresh --memory-only  — scope to memory files

`/code-hygiene` — codebase health analysis

Runs static analysis tools, interprets findings, maps architecture, evaluates test quality. Stores structured results in .omc/hygiene/ for tracking over time.

/code-hygiene                    — full scan (adaptive: inline or agents based on project size)
/code-hygiene --module src/auth  — scope to specific directory
/code-hygiene --deep             — include mutation testing (slow)

Tool-first approach: runs tsc, scc, semgrep, knip, jscpd, dependency-cruiser — then interprets JSON output. Produces a health score (0-100), severity-classified findings (P0-P3), architecture overview with mermaid diagrams, and test quality analysis. P0/P1 findings can be turned into parked forge tasks for refactoring.

Forge Intelligence

Every forge run writes to a SQLite database (.omc/forge.db). The system learns from its own history.

What it tracks:

Gate results per iteration (which reviews failed, what they found)
Spike cache (confirmed/refuted assumptions with TTL)
Risk scores per system (auto-aggregated from past failures)
Co-failure patterns (which pairs of systems tend to break together)

What it enables:

--park / --resume — save work, come back in a new session
--spawn "sub-task" — create child forges with dependency tracking
Compaction survival — state preserved when context window resets
PRECEDENT phase queries past runs before planning ("last time this system needed 3 iterations")

Cross-project knowledge lives in ~/.forge/global.db — verified facts about tools and libraries that apply everywhere. Spikes about external tools get promoted automatically.

Schema is versioned (PRAGMA user_version) — future updates migrate automatically.

How It Works

Plan Forge

View full README on GitHub

beast-forge

Popularity

What's Inside

Confidence

README

Forge

Install

Skills

/forge — planning + execution + verification

/docs-refresh — documentation hygiene

/code-hygiene — codebase health analysis

Forge Intelligence

How It Works

Plan Forge

Similar Plugins

claudekit

forge

replan

ai-workflow

More by malakhov-dmitrii

fusion

Forge

Install

Skills

/forge — planning + execution + verification

/docs-refresh — documentation hygiene

/code-hygiene — codebase health analysis

Forge Intelligence

How It Works

Plan Forge

Popularity

Health & Quality

More by malakhov-dmitrii

fusion

Similar Plugins

claudekit

forge

replan

ai-workflow

agentic-dev-team

accelerator

`/forge` — planning + execution + verification

`/docs-refresh` — documentation hygiene

`/code-hygiene` — codebase health analysis

`/forge` — planning + execution + verification

`/docs-refresh` — documentation hygiene

`/code-hygiene` — codebase health analysis