By ypollak2
Intelligently route AI tasks across 20+ LLM providers (OpenAI, Anthropic, Gemini, Ollama, etc.) using complexity-first model selection to minimize costs while preserving output quality. Automatically classify prompts and dispatch to the cheapest capable model, track cross-session savings, and receive weekly cost digests via Slack/Discord.
Automates the full release pipeline for llm-router. Run this skill whenever
Route a task to the best LLM based on task type and complexity
Route tasks to the cheapest capable model automatically using llm-router MCP tools.
Track and report how much you've saved by routing tasks to cheaper models.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub ypollak2/llm-router --plugin llm-routerA real-time directory of AI models that allows your AI agent to advise and pick the ideal LLM for the user's task.
OpenRouter SDK plugin - unified interface for 500+ LLM models with intelligent routing, cost optimization, and framework integrations (Vercel AI SDK, LangChain, OpenAI SDK, PydanticAI)
Flagship+ skill pack for OpenRouter - 30 skills for multi-model routing, fallbacks, and LLM gateway mastery
Teach Claude to detect, wire, and troubleshoot FreeRide — a local OpenAI-compatible gateway routing across free-tier providers (OpenRouter, Groq, NVIDIA NIM, Cloudflare Workers AI, HuggingFace).
Spawn any third-party LLM provider with an Anthropic-compatible API (e.g. DeepSeek, GLM, Kimi, Qwen, MiniMax) as real Claude Code agent-team teammates or one-shot subagents — driven exactly like native teammates. Your main session's own auth is untouched (OAuth subscription or API key, either works); provider workers bill the provider API key via apiKeyHelper (the key never enters env/argv/history). Requires the `cc-fleet` binary on PATH, installed separately.
When calling LLM APIs from Python code. When connecting to llamafile or local LLM servers. When switching between OpenAI/Anthropic/local providers. When implementing retry/fallback logic for LLM calls. When code imports litellm or uses completion() patterns.