Search everything...

Stats

Actions

Available In

anvil

Name: anvil
Author: fakoli

By fakoli

Local-first, runtime-neutral project state for humans and AI coding agents — turn PRDs into lockable, evidence-backed work packets.

npx claudepluginhub fakoli/anvil

Popularity

Stars

Med: 0·Avg: 285

Installs

Med: 0·Avg: 1

What's Inside

Agents5

critic

/critic

Code-review an anvil task's submitted changes against its acceptance criteria; return a structured PASS / SHOULD FIX / MUST FIX verdict. Read-only. Triggers: "review the changes for <task>", "review against acceptance criteria", "is this ready to accept".

docs-scribe

/docs-scribe

Maintain anvil's inward-facing documentation: the `docs/` folder (specs, runbooks, design notes, plan archives), `CHANGELOG.md`, and the `description` field of `.claude-plugin/plugin.json`. Audits cross-references — broken `[[wikilinks]]`, mismatched anchors, dangling `see also` pointers, moved/archived file references — and reconciles prose against the actual code (schema, CLI, agents). Triggers: "update anvil docs", "fix broken links", "write the changelog", "doc cross-reference audit", "after-phase docs sweep".

planner

/planner

Turn a parsed anvil PRD into a coherent task graph — group Requirements into Features, draft Tasks with acceptance criteria and verification commands, flag high-complexity tasks for expansion. Proposes; never writes state. Triggers: "generate features and tasks", "plan from the PRD", "extend the task graph for new requirements", "break down / expand this task".

sentinel

/sentinel

Validate that submitted evidence on an anvil task actually proves the acceptance criteria — re-run verification commands, inspect outputs, return a binary PASS / FAIL scorecard. Read-only. Triggers: "verify the evidence for <task>", "does the evidence prove the criteria", "re-run verification". Unlike critic (code quality), sentinel checks evidence completeness.

state-keeper

/state-keeper

Run sync reconciliation across anvil's three sources of truth inside an initialized project — SQLite (.anvil/state.db), the filesystem (packets/, .evidence-buffer/, worktrees), and git (branches, commits, claims). Surfaces drift as a structured report (orphan branches, orphan packets, stale claims, missing sync_mappings); reports only, never remediates — that is the user's choice via `anvil sync --fix --yes`. Triggers: "reconcile state", "sync drift", "check for orphans", "audit anvil", "is my project state stale".

Skills8

claim

/claim

Acquire an exclusive lease on an anvil task — pick from the ready queue, check for file conflicts, claim the task, and get a working git branch to commit into. Use this skill when ready to start work on an approved task.

execute

/execute

Run the agentic execution loop on a claimed anvil task — fetch the work packet, do the work, submit completion evidence. Use this skill when an agent has just claimed a task and needs to execute it end-to-end without juggling individual CLI commands.

finish

/finish

Decide what to do with an anvil task that has submitted evidence and is awaiting human review — accept and ship, reject and reopen, or hold for further investigation. Use this skill when one or more tasks are in needs_review and need a final disposition.

plan

/plan

Turn a reviewed PRD into a ready-to-execute task graph — generate features and tasks, score each on six dimensions, surface dependencies and conflict groups, promote drafted tasks to ready. Use this skill once the PRD is approved and before any agent claims work.

prd

/prd

Author, parse, and review a project PRD in anvil — capture the requirements that everything downstream (features, tasks, claims, evidence) gets generated from. Use this skill when starting a new project or revising requirements before any planning work happens.

Hooks1

Event Hooks

Bash

File writes

4 hooks across 3 events

MCP Servers1

anvil

admin

Stats

Version0.0.8

LanguagePython

Stars0

MaintenanceExcellent

LicenseMIT

Last CommitJun 18, 2026

AddedJun 18, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).

Safety Signals

Critical

Admin access level

Server config contains admin-level keywords

Caution

Executes bash commands

Hook triggers when Bash tool is used

README

Anvil

The system of record for agent teams.

Durable, evidence-gated, lease-coordinated state for multi-agent software work.

Beta — v0.0.8. The core loop works today; command surfaces and APIs may change before 1.0.

Why Anvil

Anvil is a local-first, backend-neutral project-state layer for humans and AI coding agents. It records requirements, tasks, claims, evidence, and reviews in SQLite under .anvil/, then exposes that state through a CLI (anvil) and an MCP server.

It is for developers running Claude Code, Codex, Cursor, OpenHands, or Copilot who need multiple agents, and multiple humans, to coordinate against the same plan without overwriting each other. Solo builders can use it to keep PRDs and task state across sessions; project leads can use it to audit what work was claimed, reviewed, and completed.

When an AI agent claims a task, that claim is an enforced database row with a lease and heartbeat. Completion is evidence-gated: Anvil does not record completed work without attached proof.

What ships today

Surface	Count	Notes
CLI command entries	35	Top-level commands plus `prd`, `review`, `hook`, `sync`, and `migrate` sub-app entries. `--use-llm` augmentation picks Anthropic API / Bedrock / OpenAI-compatible endpoints via the same multi-provider resolver as the LLM-planner backstop.
MCP tools	24	FastMCP stdio; works in any MCP-compatible client. `plan_tasks` honors the project's `llm_provider` / `llm_tier` / Bedrock+custom knobs.
Skills	8 skills	start-prd, prd, plan, claim, execute, finish, state-ops, resolve-decisions
Agents	5 agents	planner (opus), critic (opus), docs-scribe (sonnet), sentinel (haiku), state-keeper (haiku) — tier-mapped per docs/model-strategy.md
Hooks	4 hooks	detect-state, check-claim, record-file-change, capture-evidence
LLM providers	3	Anthropic API (default) · Amazon Bedrock (`[bedrock]` extra) · OpenAI-compatible custom endpoints (`[custom]` extra). See docs/llm-providers.md.

Highlights:

Multi-provider LLM access. BedrockProvider (boto3 chain) and CustomEndpointProvider (vLLM / OpenRouter / LiteLLM-proxy / Together / Groq / Azure-OpenAI / self-hosted) ship alongside the existing AnthropicProvider. Precedence: explicit config > env auto-detect > fail loudly. Optional extras keep the default install lean.
Tier-aware model defaults. New MODEL_TIERS vocabulary (opus / sonnet / haiku) with per-agent tier mapping that drops typical session cost ~60% versus the prior "everything routes to Opus" pattern. Override always wins.
CI covers the full pytest suite and benchmark smoke test; SQLite schema remains at version 5.

Full release notes in CHANGELOG.md.

Quick Start

Zero-to-next in one command

anvil is a standalone CLI. To see the whole loop end-to-end against a seeded sample project:

anvil init --with-sample
# → scaffolds .anvil/, writes a valid sample prd.md, and runs
#   parse → review → approve → plan → score → review tasks offline (no API key)
anvil next
# → returns a ready task immediately — nothing else to author or run

--with-sample is purely additive: plain anvil init is unchanged and seeds nothing. Use the sample to learn the flow, then delete .anvil/ and run init for real on your own PRD as shown below.

The full loop on your own PRD

# 1. Scaffold per-project state
anvil init --name "My Project"
# → creates .anvil/{config.yaml,state.db,events.jsonl,packets/}
# → next step: author your PRD at .anvil/prd.md

# 2. Author the PRD against the template (see docs/prd-template.md)
$EDITOR .anvil/prd.md

# 3. Parse, review, approve — the state machine requires draft → reviewed → approved
anvil prd parse
# → Parsed PRD: 4 requirements, 12 tasks staged for review
anvil prd review             # draft → reviewed
anvil prd review --approve   # reviewed → approved

# 4. Generate features and tasks; score across six dimensions
anvil plan
anvil score
# → tabular output: TaskID / Complexity / Parallel / CtxLoad / Blast / Review / Agent (1–5)
anvil review tasks

# 5. Pick the next ready task and claim it
anvil next
# → T001 — "Wire orchestrator retry to DLQ" (ready, no conflicts)
anvil claim T001
# → Claim C001 active; branch agent/t001-<slug> created

View full README on GitHub

anvil

Popularity

What's Inside

Confidence

README

Anvil

Why Anvil

What ships today

Quick Start

Zero-to-next in one command

The full loop on your own PRD

Similar Plugins

drawio-diagramming

claude-buddy

prompts.chat

anthropic-essentials

agent-skills

fullstack-dev-skills

More by fakoli

systems-thinking-plugin

terminalq

nano-banana-pro

fakoli-flow

gws

Anvil

Why Anvil

What ships today

Quick Start

Zero-to-next in one command

The full loop on your own PRD

Popularity

Health & Quality

More by fakoli

systems-thinking-plugin

terminalq

nano-banana-pro

fakoli-flow

gws

Similar Plugins

drawio-diagramming

claude-buddy

prompts.chat

anthropic-essentials

agent-skills

fullstack-dev-skills