Multi-model task delegation with quota-aware routing across AI CLIs (Claude, Codex, Gemini, Cursor Agent)
A collection of Claude Code plugins for working with AI coding agents.
# Add the marketplace
claude plugin marketplace add PocketConservatory/plugins
# Install a plugin (e.g. subtask)
claude plugin install subtask@tonic --scope project
Restart Claude Code after installing.
Route coding tasks across Claude Code, OpenAI Codex, Google Gemini CLI, and Cursor Agent based on task type and remaining quota. Never hit a quota wall on a single provider again.
Every AI coding CLI has usage limits. When you hit one, you're stuck waiting for it to reset. subtask automatically routes tasks to whichever provider has the most remaining quota, based on what the task actually needs:
| Task type | Routed to | Why |
|---|---|---|
| Code review | Codex | Separate review quota pool |
| Implementation | Codex or Claude | Whichever has more quota |
| Research / analysis | Gemini | Large context window, separate quota |
| MCP-dependent (Linear, Notion, etc.) | Claude (host) | Only Claude has MCP servers |
When your preferred provider runs out, subtask automatically falls back to the next best option — including Cursor Agent as a last resort (separate quota pool entirely).
npm i -g @openai/codexnpm i -g @google/gemini-clinpx cclimits)claude plugin marketplace add PocketConservatory/plugins
claude plugin install subtask@tonic --scope project
Restart Claude Code. The routing strategy and quota hooks activate automatically.
subtask uses two hooks to manage quota without you doing anything:
/subtask:delegate instead, listing available providersUse /subtask:delegate to send a task to the optimal provider:
/subtask:delegate Research best practices for caching in Firebase
subtask will automatically pick Gemini (read-only research task, large context window, preserves Claude quota).
You can also force a specific provider:
/subtask:delegate Review the auth changes using codex
/subtask:delegate Use gemini to analyze the performance impact
/subtask:delegate Implement retry logic with codex
Priority: Claude → Codex → Gemini. Claude is the primary model. External CLIs are overflow buckets when Claude is under pressure.
The PreToolUse hook computes pressure levels from remaining quota and reset times. When Claude is HIGH or CRITICAL and alternative providers are available, it blocks Task calls with a directive to delegate. Code reviews always go to Codex (separate quota pool).
| Skill | Description |
|---|---|
/subtask:delegate <task> | Route a task to the optimal AI CLI based on type and quota |
MIT
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub pocketconservatory/plugins --plugin subtaskDelegate plan execution to Codex CLI via ASP. Part of cc-multi-cli-plugin. Requires the `multi` plugin.
Sub-agent runner — runs agent definitions on Codex, Claude Code, Cursor CLI, or Gemini CLI from any host AI tool.
Intelligent delegation framework for routing tasks to external LLM services while retaining strategic oversight
Use Qwen Code from Claude Code to delegate investigation, implementation, and review tasks.
Intelligent model routing for Claude Code - routes queries to optimal Claude model (Haiku/Sonnet/Opus) based on complexity, with persistent knowledge system, context forking, and multi-turn awareness
Intelligent orchestration platform for AI coding tools — routes tasks to the best model, learns from outcomes, and enforces quality through multi-model consensus. 46 MCP tools for agent management, research, memory, consensus voting, codebase intelligence, and a full dev pipeline.