Autonomous multi-agent software development workflow: Intake → Planning → Scheduler → Executor × N → Review. Takes a one-shot human brief and delivers a software project end-to-end as a batch job. Do NOT use for interactive pair-programming.
AI-Robin Committer Agent. Executes a git commit using the exact message provided by the trigger signal. Invoked only by the AI-Robin kernel after review_merged or degradation_spec_written.
AI-Robin Degrader Agent. Writes the context-degraded-*.yaml spec with narrative and updates escalation-notice.md. Invoked only by the AI-Robin kernel when a scope is degraded.
AI-Robin Executor Agent. Writes actual application code for one milestone. May modify source files and run tests. Does NOT git commit (kernel does that). Invoked only by the AI-Robin kernel.
AI-Robin Finalizer Agent. Generates the end-of-run delivery bundle (DELIVERY.md). Invoked only by the AI-Robin kernel on all_complete.
AI-Robin Intake Agent. Intake stage — the only sub-agent that interacts with the user. Do not invoke for general intake tasks; only invoke as part of an AI-Robin dispatch loop.
AI-Robin Committer. Executes a git commit on behalf of the kernel using the verbatim message provided by Merger or Degrader. Never composes commit messages itself.
AI-Robin Degrader. Writes the context-degraded-*.yaml spec narrative and updates escalation-notice when a scope is degraded. Reads original specs + ledger history to compose the narrative (domain work the kernel cannot do itself).
AI-Robin Executor stage. Given a single task (typically one milestone), loads relevant context, writes/modifies code and specs, and returns a structured artifacts summary. Does NOT git commit (kernel delegates that to Committer).
AI-Robin Finalizer. Generates the end-of-run delivery bundle (.ai-robin/DELIVERY.md) summarizing what was built vs degraded, by scanning ledger + reading intent specs.
AI-Robin Intake stage. Conducts the one-time user Q&A at the start of a Robin run — surfaces decisions, fills gaps, produces planning-ready Feature Room specs. Only human-facing stage of an AI-Robin run.
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
English | 简体中文
Drop a brief in. Walk away. Verify the diff.
Robin is an autonomous multi-agent workflow that takes a one-shot human brief and delivers a software project end to end. It runs as a Claude Code plugin: you spend 15–45 minutes on intake, then walk away for a multi-hour unattended run, then verify the diff.
The bet: generation is expensive, verification is cheap. If intake is good enough, the hours of execution between don't need you.
Robin is a batch job, not a copilot. Do not use it for interactive pair-programming.
Inside a Claude Code session:
/plugin marketplace add waynewangyuxuan/Robin
/plugin install robin@robin
/robin-start, /robin-resume, and /robin-status become available in every session. Claude Code auto-pulls updates from this repo's default branch on restart.
If you're editing Robin itself and want changes to reflect live (no copy / no pin):
git clone https://github.com/waynewangyuxuan/Robin.git
cd Robin
source ./dev-install.sh # adds `claude-robin` alias, active immediately
claude-robin # start Claude Code with Robin loaded from your source
After editing skills, agents, or hooks mid-session, run /reload-plugins to refresh without restarting. Uninstall the alias with ./dev-install.sh remove.
| Command | When |
|---|---|
/robin-start <brief> | Begin a new run. Intake stage starts immediately. |
/robin-resume | Continue a run interrupted mid-stage (auto-detects .ai-robin/stage-state.json). |
/robin-status | Read-only inspection of current stage and ledger. |
| Good fit | Poor fit |
|---|---|
| Greenfield, medium-complexity (web app, CLI, API, agent app) | Highly fuzzy requirements that need exploratory iteration |
| Requirements expressible in ≤15 Q&A rounds | Strong stylistic preferences hard to articulate |
| You accept "some scope may be degraded" over "must be 100%" | Massive existing codebases needing deep context |
| Acceptance criteria are concrete (gate criteria) | Life-critical, financial, or legal production code |
flowchart LR
U([Brief]) --> I[Intake]
I --> P[Planning]
P --> S[Scheduler]
S --> E1[Executor]
S --> E2[Executor]
S --> En[Executor ×N]
E1 & E2 & En --> R[Review<br/>plan→fan-out→merge]
R -- pass --> S
R -- fail --> P
R -- done --> F[Finalize]
F --> V([Verify diff])
classDef human fill:#fef3c7,stroke:#d97706,color:#000
classDef stage fill:#dbeafe,stroke:#2563eb,color:#000
classDef review fill:#fce7f3,stroke:#db2487,color:#000
class U,V human
class I,P,S,E1,E2,En,F stage
class R review
| Stage | Role |
|---|---|
| Intake | The only human-facing stage. Surfaces decisions, fills gaps, freezes spec. ≤15 Q&A budget. |
| Planning | Turns spec into milestones, module boundaries, API contracts. May spawn research. |
| Scheduler | Reads plan + progress; decides next batch's parallel/serial scope. Stateless. |
| Executor ×N | Parallel workers per Scheduler. Write code + spec updates. No inter-agent visibility. |
| Review | Review-Planner picks domain playbooks → N reviewers fan out → Merger consolidates. Always commits. |
Runtime state lives in .ai-robin/: ledger.jsonl (append-only audit), dispatch/inbox/ (signal files between agents), stage-state.json (current stage), META/ (Feature Room on disk).
| Cluster | Skills | Role |
|---|---|---|
| Kernel | robin-kernel | Main dispatch loop. Routes signals. Never reads domain content. |
| Stages | robin-intake, robin-planner, robin-scheduler, robin-executor | Pipeline stages. |
| Support | robin-researcher | Answers Planning's factual questions. |
| Review | robin-review-planner, robin-reviewer, robin-merger | Plan-fan-out-merge for domain-specific checks. |
| Relief | robin-committer, robin-degrader, robin-finalizer | Git ops, degradation narratives, delivery summaries — domain work the kernel can't do itself. |
npx claudepluginhub waynewangyuxuan/robin --plugin robinSemantic code knowledge graph builder. Scans repos, builds structured feature trees and spec graphs, detects code-spec drift, generates documentation. Six skills: build, check, update, render, preview, help. Plus /shadowrepo-review pre-commit alignment check and /shadowrepo-pr-comment GitHub-ops draft-PR wrap.
AI tests skills, humans write them. Stress-test AI agent skills across diverse scenarios, get diagnostic evidence and improvement proposals.
Harness-native ECC operator layer - 67 agents, 271 skills, 92 legacy command shims, reusable hooks, rules, selective install profiles, and production-ready workflows for Claude Code, Codex, OpenCode, Cursor, and related agent harnesses
Feature development with code-architect/explorer/reviewer agents, CLAUDE.md audit and session learnings, and Agent Skills creation with eval benchmarking from Anthropic.
Comprehensive feature development workflow with specialized agents for codebase exploration, architecture design, and quality review
Access thousands of AI prompts and skills directly in your AI coding assistant. Search prompts, discover skills, save your own, and improve prompts with AI.
Complete developer toolkit for Claude Code
Intelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.