By iwo-szapar
Autonomous coding pipeline for Claude Code with guided onboarding dashboard: minimum-diff -> onboard -> track -> plan -> work -> test -> review -> compound -> ship. Multi-agent planning, proof contracts, and binary review gates.
Code writer specialist for Yalla Coding Team. Writes implementation code following approved plans. Does NOT write tests or review code.
Orchestrates the Yalla Coding Team across all phases — planning, build/test separation, review, and shipping. Coordinates specialist teammates through phase-specific teams with binary quality gates.
Adversarial code reviewer for Yalla Coding Team. Reviews code in fresh context with no anchoring bias. Uses binary pass/fail per check.
Test writer and runner for Yalla Coding Team. Writes tests, runs test suite, validates behavior. Does NOT write implementation code.
Guided Yalla project onboarding. Use this skill whenever the user says /onboard, "set up yalla", "onboard this repo to yalla", "configure yalla", "show onboarding status", "what is missing for yalla", or wants a visual dashboard of Yalla setup. It checks .claude/YALLA.md, project commands, task labels, issue template, autopilot readiness, and generates an HTML dashboard showing done vs missing steps.
Product-intent gate for non-trivial product, GTM, pricing, onboarding, user/admin journey, access, delivery, generated artifact, or workflow changes. Use before /yalla planning or inside strategy-to-build handoffs when the work needs outcome framing, load-bearing assumptions, cheapest validation, MVP scope, kill criteria, or intended-vs-implemented review. Do NOT use for tiny bugfixes, isolated tests, dependency bumps, mechanical refactors, or docs edits that do not define future product behavior.
Post-mortem diagnostic on a completed /yalla or /yalla-team run. Checks issue-based artifacts, PR evidence, review quality, and shipping discipline. Use after a run produced mediocre results, after a production incident traced to a yalla-shipped change, or periodically to improve the pipeline. Do NOT use during an active run.
4-agent adversarial planning with red-team. Produces a grounded plan with concrete approach, files affected, acceptance criteria, and addressed risks. Use standalone for planning without full pipeline, or invoked by /yalla and /yalla-team. Do NOT use when the task is trivial (plan directly in /yalla).
Adversarial code review with binary pass/fail checks. Each reviewer answers one specific question about the code. Any Fail blocks shipping. Use standalone for reviewing changes, or invoked by /yalla and /yalla-team. Do NOT use when changes haven't been tested yet (run tests first).
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Give Claude Code a one-line task. Get back a tested, reviewed pull request.
Yalla is an autonomous coding pipeline for Claude Code. It turns a description into a planned, built, tested, reviewed, and shipped PR — using specialized agents and adaptive ceremony, grounded in a project knowledge base (your gotchas, risk checks, and architecture) and held to a Proof Contract that a built-in eval harness grades. A run only ships when evidence artifacts say it's proven. GitHub Issues are the task store; no database or external service required.
/yalla add rate limiting to the public API
┌──────────────────── KNOWLEDGE BASE ────────────────────┐
│ gotchas · minimum diff · risk gates · review checks · │ grounds
│ architecture · test seams · task classification │ every phase
└────────────────────────────┬───────────────────────────┘
▼
minimum-diff ─▶ classify ─▶ track ─▶ plan ─▶ work ─▶ test ─▶ review ─▶ compound ─▶ ship
│ │ │ │ │ │ │
pick adversarial build write & binary learnings PR — only
ceremony + diagnosis vertical run pass/ fed back if verdict
+ gates plan slices tests fail into the is PROVEN
gates knowledge
▲
┌─────────────────────────────┴──────────────────────────┐
│ PROOF CONTRACT → evidence artifacts (.pipeline/*) │ grades
│ verdict: PROVEN / NOT_PROVEN / INCONCLUSIVE │ every run
│ graded by the eval harness (npm run eval:yalla:smoke) │
└─────────────────────────────────────────────────────────┘
Two things wrap the linear pipeline and make it more than a prompt: the knowledge base feeds project-specific constraints into every phase, and compound routes each run's learnings back into it — so the pipeline gets sharper the more you run it. The Proof Contract + eval harness sit underneath, turning "looks done" into a graded, artifact-backed verdict.
| Phase | What happens |
|---|---|
| 0 · Minimum Diff + Classify | Runs the minimum-diff ladder, then picks a task_type, risk tier, ceremony mode, evidence mode, and gates. A no-build/docs/config answer, one-line fix, and payment-flow change get different ceremony. |
| 1 · Track | Creates (or resumes) a GitHub issue and a worktree branch. |
| 2 · Plan | Researches the codebase, designs an approach, and adversarially challenges it. Bugs run a diagnosis gate first. You approve before any code is written. |
| 3 · Work | Builds in vertical slices — each one a thin, demoable end-to-end behavior — writing a failing behavior test at the highest correct seam before the implementation that passes it. |
| 4 · Test | Runs the suite until green, verifies every acceptance criterion maps to evidence, and records falsifiable verification (VERIFIED / NOT VERIFIED / INCONCLUSIVE). |
| 5 · Review | Independent reviewers each answer one binary question (security? complexity? correctness?). Any Fail blocks the ship. The author never reviews their own code. |
| 6 · Compound | Captures actionable learnings to their smallest lasting home so the same mistake isn't repeated. |
| 7 · Ship | Writes .pipeline/outcome-evaluation.json, commits specific files, and opens a PR — PR-only by default; never auto-merges unless you asked in this run. |
The core idea: keep the universal pipeline small, and activate risk-specific gates only when the diff touches that subsystem. A docs typo doesn't get dragged through payment, migration, and auth review. A change to your billing code does.
A run is "done" only when its verdict is PROVEN — and PROVEN is backed by artifacts, not prose. Before shipping, Yalla writes .pipeline/outcome-evaluation.json with a verdict of exactly one of:
PROVEN — every acceptance criterion is covered by valid evidence (a passing test, a static check, a browser/API probe, a smoke run), all required review checks pass, and no remaining delta exists. Only PROVEN may be called done, complete, or ready to merge.NOT_PROVEN — evidence or review shows the promise isn't satisfied. An honest outcome, not a failure to hide.INCONCLUSIVE — local proof is blocked or external evidence is unavailable. Can still open a PR, but the PR clearly says human review or external evidence is needed.Missing evidence never becomes PROVEN. Deterministic proof is preferred — Yalla won't lean on a model judge when a concrete test or check can verify the behavior. This is what stops "looks done" from masquerading as "is done."
npx claudepluginhub iwo-szapar/yalla --plugin yallaComprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Feature development with code-architect/explorer/reviewer agents, CLAUDE.md audit and session learnings, and Agent Skills creation with eval benchmarking from Anthropic.
Develop, test, build, and deploy Godot 4.x games with Claude Code. Includes GdUnit4 testing, web/desktop exports, CI/CD pipelines, and deployment to Vercel/GitHub Pages/itch.io.
Comprehensive feature development workflow with specialized agents for codebase exploration, architecture design, and quality review
Access thousands of AI prompts and skills directly in your AI coding assistant. Search prompts, discover skills, save your own, and improve prompts with AI.
Complete developer toolkit for Claude Code