From pro-workflow
Reduces token waste through anti-sycophancy rules, tool-call budgets per task type, one-pass coding discipline, and read-before-write enforcement.
How this skill is triggered — by the user, by Claude, or both
Slash command
/pro-workflow:token-efficiencyThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Reduce output token waste and prevent iteration cycles that consume context.
Reduce output token waste and prevent iteration cycles that consume context.
Use when:
These patterns waste 30-60% of output tokens:
| Pattern | Example | Fix |
|---|---|---|
| Sycophantic opener | "Sure! Great question!" | Delete. Lead with answer. |
| Prompt restatement | "You're asking about X..." | Delete. Answer directly. |
| Closing fluff | "Let me know if you need anything!" | Delete. Stop after the answer. |
| Unsolicited suggestions | "You might also want to..." | Delete unless asked. |
| AI disclaimers | "As an AI model..." | Delete entirely. |
| Verbose preambles | "I'll help you with that..." | Delete. Start with the action. |
Set explicit budgets by task complexity:
| Task Type | Tool-Call Budget | Wrap-Up At |
|---|---|---|
| Quick fix / lookup | 20 calls | 15 |
| Bug fix | 30 calls | 25 |
| Feature (small) | 50 calls | 40 |
| Feature (large) | 80 calls | 65 |
| Refactor | 50 calls | 40 |
| Exploration / research | 30 calls | 25 |
At the wrap-up threshold: commit progress, assess remaining work, decide whether to continue or start fresh.
For simple-to-medium tasks:
Switch profiles based on what you're doing:
Hard rules:
Use ASCII characters only in all output:
-- not — (em dash)" not " " (smart quotes)' not ' ' (curly apostrophes)This ensures clean copy-paste for code and compatibility with downstream systems.
Track these metrics to measure token savings:
Token efficiency patterns adapted from drona23/claude-token-efficient (MIT).
npx claudepluginhub rohitg00/pro-workflow --plugin pro-workflowEnforces concise responses and smart model routing to cut Claude Code costs 30-60% while preserving technical accuracy. Auto-activates on budget/cost mentions or via /cost-mode.
Guides token-efficient tool patterns: Edit for diffs, Glob/Grep for searches, concise context. Applies always to extend session capacity.
Applies token optimization rules to reduce context usage and response length. Always active, it minimizes file reads, avoids preamble, and batches tool calls.