By WatchAndyTW
Delegates token-heavy, self-contained tasks to local CLI backends chosen by task size and type, with credit-exhaustion fallback through the backend chain to native Claude and a glanceable statusline HUD. Routing and shipped backend behavior are configured in config/roster.json; adding a new backend kind requires invoker and health support in code. /team runs a decompose → dispatch → verify → fix → synthesize pipeline; an optional config-gated UserPromptSubmit hook can proactively nudge delegation.
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Run a question through the multi-model Fusion pipeline — fan it out to a configurable panel of models in parallel (default panel is the user-configurable set from `reasoning.panel` in roster.json, defaulting to Opus+Sonnet+Gemini; override per-invocation with a spec like "2:gemini,opus,codex"), have a judge compare their answers into structured analysis (consensus / contradictions / unique insights / blind spots), then synthesize one unified answer that is better than any single model's.
Dry-run the multi-model-team router on a task and print the decision JSON + score breakdown. No model call — pure routing logic, for tuning.
Run a task through the multi-model team pipeline — decompose into backend-assigned subtasks (commodity → parallel agy/Gemini, judgment/hard-line → native Claude), dispatch dependency-aware, verify each result, fix failures in a bounded loop, then synthesize. Optional caps like "5:gemini,2:claude".
Set up multi-model-team — create your personal roster at ~/.claude/mmt-roster.json (seeded from the shipped default) so your tuning survives plugin updates, and print the next steps (roster precedence + statusline HUD).
Backend-agnostic dispatcher for STANDARD, VERIFIABLE coding and Gemini's edges where the result is compact or easy to verify: new React/UI components, CSS, SVG/animation, scaffolding, CRUD, REST endpoints, scripts, CLI tools, glue code, SQL, regex, configs, Dockerfiles, unit tests, fixtures, data transforms — plus web-search/doc summarization and bulk text ingestion. Explicitly NOT for reverse-engineering, IL2CPP/protobuf-RE, disassembly, FFI/unsafe, binary, injection/hooking, concurrency, protocol/KCP design, proc-macros, or anything systems-hard (those are Opus-only and never offloaded).
Delegate code review, test-writing, and verification to the OpenAI Codex CLI (codex exec): review a diff or file for correctness, bugs, edge cases, and regressions; write or extend a unit/integration test suite; or verify that an implementation meets its spec and the tests actually pass. Best for focused, self-contained review/test/verify units where the output is checkable. Use when you specifically want Codex on review/test/verify work rather than the Gemini backend or native Claude. Not for reverse-engineering, IL2CPP/protobuf-RE, injection, or other systems-hard work (those stay native Opus).
Let Claude Code delegate the grunt work to Gemini & Codex — and keep the hard thinking for itself.
Multi-model orchestration for Claude Code. Route by task, fan out in parallel, fall back gracefully.
A Claude Code plugin that offloads token-heavy, self-contained tasks to local pre-authed CLI
backends — agy (Gemini) and codex (OpenAI Codex CLI) — picking the backend and model by
task size and type, with credit-exhaustion fallback through the chain to native Claude, and a
glanceable statusline HUD.
The core idea:
Offload commodity work (UI/components, scaffolding, CRUD, scripts, SQL, configs, unit tests, web research, bulk summarization) to a fast/cheap CLI — keep judgment-heavy and systems-hard work (reverse-engineering, FFI/unsafe, injection, concurrency, protocol design) on Claude. Every routing decision is config-driven; tune it without touching code.
agy, codex, and native Claude are equal, configurable tools. /team decomposes a task and
assigns each subtask to its best-fit backend; /reasoning fans one question across a panel of all
three and fuses the answers.
1 · Install the backends (one-time, pre-auth each)
npm install -g node-pty # the one native dep — gives agy a pseudo-terminal (see note below)
npm install -g @openai/codex # then: codex login
# Windows Powershell
irm https://antigravity.google/cli/install.ps1 | iex # then: agy login
# macOS / Linux
curl -fsSL https://antigravity.google/cli/install.sh | bash # then: agy login
2 · Add the plugin. This repo is the plugin — point Claude Code at it as a local plugin (local
marketplace or --plugin-dir). On enable, Claude Code auto-discovers commands/, agents/, and
hooks/hooks.json. Nothing else to wire up.
3 · (Optional) Turn on the HUD. Add a statusLine to your own ~/.claude/settings.json
(the plugin can't register one for you) — see Statusline HUD.
4 · Use it.
/reasoning 2:gemini,opus,codex What's the best caching strategy for a read-heavy API?
/team 3:gemini,1:codex Build a REST CRUD service with tests
/route-test Write a SQL query to list users by signup date ← dry-run, no call
…or just work normally and let Claude reach for the agy / codex agents on its own.
| Command | What it does |
|---|---|
/reasoning [panel] <question> | Fusion pipeline. Fan one question across a panel of models in parallel → a judge compares them (consensus / contradictions / unique insights / blind spots) → synthesize one unified answer better than any single model's. |
/team [N:gemini,M:claude,X:codex] <task> | Team pipeline. Decompose → dispatch each subtask to its best-fit backend (dependency-aware waves) → verify each result → bounded fix loop → synthesize. |
/route-test <task> | Dry-run the router: prints {backend, model, tier}, detected types, matched rule. No backend call — a tuning tool. |
Both /team and /reasoning have two engines: an Ultracode deterministic Workflow path
(preferred, when the Workflow tool is available) and a parallel Task-agent fallback. Either way the
work runs across parallel agents — never one inline session.
| Agent | Use for | Backend |
|---|---|---|
agy | Standard, verifiable coding + Gemini's edges (compact/checkable results) | agy |
codex | Code review, test-writing, verification | codex |
There is intentionally no RE/injection agent — that work stays native by default. An explicit agent spawn is honored as-is (forces that backend; the router's hard line won't bounce it).
src/bin/route.mjs scores the task (char count + keyword types from config/tags.txt), then matches
routes rules in the roster (first match wins; order encodes priority). src/bin/run.mjs runs the
chosen backend with a fallback chain, writes HUD state, and cleans output.
npx claudepluginhub watchandytw/multi-model-team --plugin multi-model-teamMulti-model consensus engine integrating OpenAI Codex CLI, Gemini CLI, and Claude CLI for collaborative code review and problem-solving.
Ultra-compressed communication mode. Cuts ~75% of tokens while keeping full technical accuracy by speaking like a caveman.
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Unified status line for Claude Code with multi-CLI (Claude, Codex, Gemini, z.ai) usage monitoring, context, rate limits, and cost tracking
Curate auto-memory, promote learnings to CLAUDE.md and rules, extract proven patterns into reusable skills.