By fakoli
Local-first, runtime-neutral project state for humans and AI coding agents — turn PRDs into lockable, evidence-backed work packets.
Code-review an anvil task's submitted changes against its acceptance criteria; return a structured PASS / SHOULD FIX / MUST FIX verdict. Read-only. Triggers: "review the changes for <task>", "review against acceptance criteria", "is this ready to accept".
Maintain anvil's inward-facing documentation: the `docs/` folder (specs, runbooks, design notes, plan archives), `CHANGELOG.md`, and the `description` field of `.claude-plugin/plugin.json`. Audits cross-references — broken `[[wikilinks]]`, mismatched anchors, dangling `see also` pointers, moved/archived file references — and reconciles prose against the actual code (schema, CLI, agents). Triggers: "update anvil docs", "fix broken links", "write the changelog", "doc cross-reference audit", "after-phase docs sweep".
Turn a parsed anvil PRD into a coherent task graph — group Requirements into Features, draft Tasks with acceptance criteria and verification commands, flag high-complexity tasks for expansion. Proposes; never writes state. Triggers: "generate features and tasks", "plan from the PRD", "extend the task graph for new requirements", "break down / expand this task".
Validate that submitted evidence on an anvil task actually proves the acceptance criteria — re-run verification commands, inspect outputs, return a binary PASS / FAIL scorecard. Read-only. Triggers: "verify the evidence for <task>", "does the evidence prove the criteria", "re-run verification". Unlike critic (code quality), sentinel checks evidence completeness.
Run sync reconciliation across anvil's three sources of truth inside an initialized project — SQLite (.anvil/state.db), the filesystem (packets/, .evidence-buffer/, worktrees), and git (branches, commits, claims). Surfaces drift as a structured report (orphan branches, orphan packets, stale claims, missing sync_mappings); reports only, never remediates — that is the user's choice via `anvil sync --fix --yes`. Triggers: "reconcile state", "sync drift", "check for orphans", "audit anvil", "is my project state stale".
Acquire an exclusive lease on an anvil task — pick from the ready queue, check for file conflicts, claim the task, and get a working git branch to commit into. Use this skill when ready to start work on an approved task.
Run the agentic execution loop on a claimed anvil task — fetch the work packet, do the work, submit completion evidence. Use this skill when an agent has just claimed a task and needs to execute it end-to-end without juggling individual CLI commands.
Decide what to do with an anvil task that has submitted evidence and is awaiting human review — accept and ship, reject and reopen, or hold for further investigation. Use this skill when one or more tasks are in needs_review and need a final disposition.
Turn a reviewed PRD into a ready-to-execute task graph — generate features and tasks, score each on six dimensions, surface dependencies and conflict groups, promote drafted tasks to ready. Use this skill once the PRD is approved and before any agent claims work.
Author, parse, and review a project PRD in anvil — capture the requirements that everything downstream (features, tasks, claims, evidence) gets generated from. Use this skill when starting a new project or revising requirements before any planning work happens.
Admin access level
Server config contains admin-level keywords
Executes bash commands
Hook triggers when Bash tool is used
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Modifies files
Hook triggers on file write and edit operations
Modifies files
Hook triggers on file write and edit operations
Uses power tools
Uses Bash, Write, or Edit tools
Uses power tools
Uses Bash, Write, or Edit tools

The system of record for agent teams.
Durable, evidence-gated, lease-coordinated state for multi-agent software work.
Beta — v0.0.8. The core loop works today; command surfaces and APIs may change before 1.0.
Anvil is a local-first, backend-neutral project-state layer for humans and AI coding agents. It records requirements, tasks, claims, evidence, and reviews in SQLite under .anvil/, then exposes that state through a CLI (anvil) and an MCP server.
It is for developers running Claude Code, Codex, Cursor, OpenHands, or Copilot who need multiple agents, and multiple humans, to coordinate against the same plan without overwriting each other. Solo builders can use it to keep PRDs and task state across sessions; project leads can use it to audit what work was claimed, reviewed, and completed.
When an AI agent claims a task, that claim is an enforced database row with a lease and heartbeat. Completion is evidence-gated: Anvil does not record completed work without attached proof.
| Surface | Count | Notes |
|---|---|---|
| CLI command entries | 35 | Top-level commands plus prd, review, hook, sync, and migrate sub-app entries. --use-llm augmentation picks Anthropic API / Bedrock / OpenAI-compatible endpoints via the same multi-provider resolver as the LLM-planner backstop. |
| MCP tools | 24 | FastMCP stdio; works in any MCP-compatible client. plan_tasks honors the project's llm_provider / llm_tier / Bedrock+custom knobs. |
| Skills | 8 skills | start-prd, prd, plan, claim, execute, finish, state-ops, resolve-decisions |
| Agents | 5 agents | planner (opus), critic (opus), docs-scribe (sonnet), sentinel (haiku), state-keeper (haiku) — tier-mapped per docs/model-strategy.md |
| Hooks | 4 hooks | detect-state, check-claim, record-file-change, capture-evidence |
| LLM providers | 3 | Anthropic API (default) · Amazon Bedrock ([bedrock] extra) · OpenAI-compatible custom endpoints ([custom] extra). See docs/llm-providers.md. |
Highlights:
BedrockProvider (boto3 chain) and CustomEndpointProvider (vLLM / OpenRouter / LiteLLM-proxy / Together / Groq / Azure-OpenAI / self-hosted) ship alongside the existing AnthropicProvider. Precedence: explicit config > env auto-detect > fail loudly. Optional extras keep the default install lean.MODEL_TIERS vocabulary (opus / sonnet / haiku) with per-agent tier mapping that drops typical session cost ~60% versus the prior "everything routes to Opus" pattern. Override always wins.Full release notes in CHANGELOG.md.
anvil is a standalone CLI. To see the whole loop end-to-end against a seeded sample project:
anvil init --with-sample
# → scaffolds .anvil/, writes a valid sample prd.md, and runs
# parse → review → approve → plan → score → review tasks offline (no API key)
anvil next
# → returns a ready task immediately — nothing else to author or run
--with-sample is purely additive: plain anvil init is unchanged and
seeds nothing. Use the sample to learn the flow, then delete .anvil/
and run init for real on your own PRD as shown below.
# 1. Scaffold per-project state
anvil init --name "My Project"
# → creates .anvil/{config.yaml,state.db,events.jsonl,packets/}
# → next step: author your PRD at .anvil/prd.md
# 2. Author the PRD against the template (see docs/prd-template.md)
$EDITOR .anvil/prd.md
# 3. Parse, review, approve — the state machine requires draft → reviewed → approved
anvil prd parse
# → Parsed PRD: 4 requirements, 12 tasks staged for review
anvil prd review # draft → reviewed
anvil prd review --approve # reviewed → approved
# 4. Generate features and tasks; score across six dimensions
anvil plan
anvil score
# → tabular output: TaskID / Complexity / Parallel / CtxLoad / Blast / Review / Agent (1–5)
anvil review tasks
# 5. Pick the next ready task and claim it
anvil next
# → T001 — "Wire orchestrator retry to DLQ" (ready, no conflicts)
anvil claim T001
# → Claim C001 active; branch agent/t001-<slug> created
npx claudepluginhub fakoli/anvilApply systems thinking to infrastructure decisions so you stop discovering hidden costs, vendor assumptions, and scaling traps after you've committed. Extract what vendors bury in their docs, map dependencies the design review missed, reuse what worked before instead of forgetting lessons, and produce structured artifacts grounded in source evidence — not confident guesses.
Bloomberg-style financial terminal — 32 MCP tools, 6 workflow skills, audit trails, Polygon.io fallback, usage tracking
Generate, edit, remix, and optimize images using Google's Gemini 3 Pro Image Preview model
Intent-driven workflow orchestration — brainstorm, plan, and execute complex projects through coordinated specialist agents with critic gates and evidence-based verification.
Operate across all Google Workspace services using the gws CLI — 100 skills, 15 slash commands, 11 role-based agents, and 44 automation recipes covering email, calendar, drive, docs, sheets, slides, tasks, chat, meet, forms, classroom, keep, people, events, admin reports, model armor, and apps script.
Intelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.
Permanent coding companion for Claude Code — survives any update. MCP-based terminal pet with ASCII art, stats, reactions, and personality.
Access thousands of AI prompts and skills directly in your AI coding assistant. Search prompts, discover skills, save your own, and improve prompts with AI.
Feature development with code-architect/explorer/reviewer agents, CLAUDE.md audit and session learnings, and Agent Skills creation with eval benchmarking from Anthropic.
Production-grade engineering skills for AI coding agents — covering the full software development lifecycle from spec to ship.
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.