From vanguard-frontier-agentic
Fetches live per-token, per-image, and per-GPU-hour prices for foundation models across Anthropic, OpenAI, Google, AWS, Azure, OCI, and Vertex AI. Supports single-model lookup and comparative tables.
How this skill is triggered — by the user, by Claude, or both
Slash command
/vanguard-frontier-agentic:fetch-foundation-model-pricingThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Retrieve current public pricing for foundation models across the major AI/cloud providers and return structured, provenance-labeled output. Supports two modes:
Retrieve current public pricing for foundation models across the major AI/cloud providers and return structured, provenance-labeled output. Supports two modes:
Use this skill when:
live-price — fetched from a provider's public pricing page or API within this session; include source URL and ISO 8601 timestamp.documentation-based — sourced from official documentation when a live fetch was not possible; note the documentation URL and its visible publication date.assumed — derived from an analogous model or tier when no direct published price exists; state the assumption explicitly.excluded — pricing that exists but was intentionally omitted from the output; state why.2026-05-13T14:32Z).BilledCost (what the provider charges), EffectiveCost (after credits or discounts), ServiceCategory (AI and Machine Learning), ChargeCategory (Usage), SkuId (model ID + deployment tier), SkuPriceId (price dimension: input-token / output-token / cached-token / image / gpu-hour).| Dimension | Unit | Applies to |
|---|---|---|
| Input tokens | per 1M tokens | All text models |
| Output tokens | per 1M tokens | All text models |
| Cached input tokens | per 1M tokens | Models with context caching (Anthropic, Gemini, Bedrock) |
| Batch input tokens | per 1M tokens | Models with async batch mode (Anthropic, OpenAI, Bedrock) |
| Batch output tokens | per 1M tokens | Models with async batch mode |
| Images (input) | per image or per 1K images | Multimodal models |
| GPU-hour | per hour per GPU type | Self-hosted / dedicated endpoints (Vertex AI, Bedrock provisioned throughput, Azure PTU) |
| Provider | Deployment target | Reference |
|---|---|---|
| Anthropic | Direct API | references/providers.md |
| OpenAI | Direct API | references/providers.md |
| Vertex AI | references/providers.md | |
| AWS | Bedrock | references/providers.md |
| Azure | Azure OpenAI Service | references/providers.md |
| OCI | Generative AI Service | references/providers.md |
Return, at minimum:
Load these only when needed:
npx claudepluginhub raishin/vanguard-frontier-agentic --plugin vanguard-frontier-agenticQueries OpenRouter API to list, search, compare, and resolve 300+ AI models by pricing, context lengths, capabilities, throughput; checks provider latency, uptime, performance.
Cost estimation scripts and tools for calculating GPU hours, training costs, and inference pricing across Modal, Lambda Labs, and RunPod platforms. Use when estimating ML training costs, comparing platform pricing, calculating GPU hours, budgeting for ML projects, or when user mentions cost estimation, pricing comparison, GPU budgeting, training cost analysis, or inference cost optimization.
Queries OpenRouter's 400+ LLM catalog API to list, filter by pricing/context/tools, and compare models/providers. Use for model selection in AI apps.