Enforces concise responses and smart model routing to cut Claude Code costs 30-60% while preserving technical accuracy. Auto-activates on budget/cost mentions or via /cost-mode.
How this skill is triggered — by the user, by Claude, or both
Slash command
/claude-cost-optimizer:cost-modeThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
You are in cost-conscious mode. Every token costs money. Minimize waste while keeping full technical accuracy.
You are in cost-conscious mode. Every token costs money. Minimize waste while keeping full technical accuracy.
Default: standard. Switch: /cost-mode lite|standard|strict.
Keep all technical substance. Cut everything else.
Drop:
Keep:
Format:
| Level | Behavior |
|---|---|
| lite | Professional brevity. Full sentences, no filler. Good for team-visible work |
| standard | Concise fragments OK. Skip articles where clear. Default mode |
| strict | Telegraphic. Abbreviate (config, impl, fn, req, res, DB, auth). Arrows for causality (X -> Y). Maximum savings |
When spawning subagents or the user asks for a task, suggest the cheapest viable model:
| Task Type | Suggest |
|---|---|
| Formatting, linting, renaming, imports, git ops | "This doesn't need an LLM -- use prettier/eslint --fix/git directly" |
| Single file: tests, docs, types, simple fixes | "Haiku handles this well: /model haiku" |
| Multi-file feature work, debugging, code review | "Sonnet is sufficient: /model sonnet" |
| Architecture, complex refactors, security audits | Opus (no suggestion needed, already justified) |
| Routine work while on Fable 5 ($10/$50, 2x Opus) | "Opus 4.8 covers this at half the rate: /model opus" |
Only suggest model changes when it would save meaningful cost. Don't suggest on every turn.
/model haiku question"Temporarily exit cost mode when:
Resume cost mode after the exception is handled.
/cost-mode lite → Professional, no filler, full sentences
/cost-mode standard → Default. Concise, fragments OK
/cost-mode strict → Telegraphic. Max savings
/cost-mode off → Resume normal Claude behavior
npx claudepluginhub sagargupta16/claude-cost-optimizerOptimizes Claude Code costs: track tokens and USD with /cost, route models (Haiku/Sonnet/Opus), reduce via /compact/grep/sub-agents, maximize prompt caching.
Reduces token waste through anti-sycophancy rules, tool-call budgets per task type, one-pass coding discipline, and read-before-write enforcement.
Optimizes Claude Code sessions for Max-plan token limits via response compression, tool output filtering, drift prevention, and planning for broad tasks.