Skill

model-routing

Classifies tasks by cognitive tier (sm0l/ch0nky/frontier) and routes to optimal model from b00t config, preferring local/cheap for mechanical work and frontier for reasoning. Checks resources.

Anthropic

ai-ml

developer-tools

Popularity

Parent stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/next-task:model-routing

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

NEVER use frontier model for mechanical work. Route by cognitive tier.

SKILL.md

78 lines · ~751 tokens

Stats

LanguageRust

Parent stars12

MaintenanceFair

Last CommitMar 15, 2026

Actions

View Source View Plugin View on GitHub View README

Model Routing

NEVER use frontier model for mechanical work. Route by cognitive tier. Reads routing table from _b00t_/model-routing.tomllm. Falls back to hardcoded tiers.

Cognitive Tiers

Tier	Tasks	b00t Models (in priority order)
`small` (sm0l)	grep, lint, classify, route, test pass/fail	haiku, local sm0l
`chunky` (ch0nky)	implement, refactor, debug, code review	qwen3-coder-local (RTX 3090), sonnet
`frontier`	architecture, security, novel design, planning	opus, sonnet

Steps

Classify the task type. # output: task_type
Map task_type → cognitive tier. # output: tier (sm0l|ch0nky|frontier)
Load routing config: b00t learn model-routing via MCP or CLI — NEVER read .tomllm directly. # output: available_models[]
Select best available model for tier (prefer local, fallback frontier). # output: selected_model
Check resource gate: b00t hive status — ensure RAM/GPU available. # output: resource_ok
If resource gate fails: escalate one tier up or queue. # output: model_or_queue
Return {model, tier, rationale} for caller to invoke.

Task → Tier Mapping

small sm0l (Haiku / local 3B):

Running tests, checking lint output
Classifying/routing messages
Extracting structured data from well-defined input
File diffing, counting, summarizing short text
Executing known shell commands

chunky ch0nky (qwen3-coder-local → Sonnet fallback):

Writing or refactoring code
Debugging with stack traces
Multi-file code review
Translating between languages/formats
Implementing skills from SKILL.md spec

frontier (Opus → Sonnet fallback):

System architecture decisions
Security threat modeling
Novel algorithm design
Planning complex multi-step workflows
Evaluating ambiguous requirements

Output Contract to Executive Context

Executive context is costly. Sub-agents MUST return compressed summaries:

Tier	Max output to executive
`sm0l`	`PASS` or `FAIL: <name> <5 lines>`
`ch0nky`	diff + test result (no full file dumps)
`frontier`	structured decision with rationale

Resource Awareness

Before invoking ch0nky/frontier check hive:

b00t hive status  # output: RAM free, GPU VRAM free, active profile

Anti-pattern: running vLLM (qwen3-coder, 20GB VRAM) + HuggingFace download simultaneously on 24GB.

Integration

Used by /next-task at each phase to select model. Used by b00t-mcp agent delegation. Load via: b00t learn model-routing (MCP preferred, CLI fallback)

model-routing

Popularity

Invocation

Context Preview

SKILL.md

model-routing

Popularity

Invocation

Context Preview

SKILL.md

Model Routing

Cognitive Tiers

Steps

Task → Tier Mapping

Output Contract to Executive Context

Resource Awareness

Integration

🤓 NEVER read b00t/*.tomllm directly — always use b00t learn/MCP which applies guardrails, guru enrichment & tribal knowledge

Similar Skills

Model Routing

Cognitive Tiers

Steps

Task → Tier Mapping

Output Contract to Executive Context

Resource Awareness

Integration

🤓 NEVER read b00t/*.tomllm directly — always use b00t learn/MCP which applies guardrails, guru enrichment & tribal knowledge

Similar Skills