Skill

cost-aware-llm-pipeline

Optimizes LLM API costs for Claude/GPT calls via task-complexity model routing, immutable budget tracking, narrow transient-error retries, and prompt caching. For batch tasks with budget limits.

Python

Anthropic

OpenAI

ai-ml

Popularity

Stars

551

Forks

105

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/everything-claude-code:cost-aware-llm-pipeline

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

在保持质量的同时控制 LLM API 成本的模式。将模型路由 (Model Routing)、预算跟踪 (Budget Tracking)、重试逻辑 (Retry Logic) 和提示词缓存 (Prompt Caching) 组合成一个可复用的流水线。

SKILL.md

184 lines · ~1.1k tokens

Stats

LanguageJavaScript

Stars551

Forks105

MaintenanceExcellent

Last CommitMar 5, 2026

Actions

View Source View Plugin View on GitHub View README

模型	输入 ($/1M tokens)	输出 ($/1M tokens)	相对成本
Haiku 4.5	$0.80	$4.00	1x
Sonnet 4.6	$3.00	$15.00	~4x
Opus 4.5	$15.00	$75.00	~19x

模型	输入 ($/1M tokens)	输出 ($/1M tokens)	相对成本
Haiku 4.5	$0.80	$4.00	1x
Sonnet 4.6	$3.00	$15.00	~4x
Opus 4.5	$15.00	$75.00	~19x

cost-aware-llm-pipeline

Popularity

Invocation

Context Preview

SKILL.md

cost-aware-llm-pipeline

Popularity

Invocation

Context Preview

SKILL.md

成本感知型 LLM 流水线 (Cost-Aware LLM Pipeline)

何时启用

核心概念

1. 基于任务复杂度的模型路由 (Model Routing)

2. 不可变成本跟踪 (Immutable Cost Tracking)

3. 精细化重试逻辑 (Narrow Retry Logic)

4. 提示词缓存 (Prompt Caching)

组合使用

价格参考 (2025-2026)

最佳实践

应避免的反模式 (Anti-Patterns)

使用场景

Similar Skills

成本感知型 LLM 流水线 (Cost-Aware LLM Pipeline)

何时启用

核心概念

1. 基于任务复杂度的模型路由 (Model Routing)

2. 不可变成本跟踪 (Immutable Cost Tracking)

3. 精细化重试逻辑 (Narrow Retry Logic)

4. 提示词缓存 (Prompt Caching)

组合使用

价格参考 (2025-2026)

最佳实践

应避免的反模式 (Anti-Patterns)

使用场景

Similar Skills