llm-router routes AI coding prompts across free, budget, and premium model tiers.

llm-router

Make Claude Code, Codex, and Gemini CLI use the cheapest model that can still do the job well.
Save 35-80% on routine prompts, protect premium quota, and fall back automatically when providers fail.

Install in 30 seconds

pip install llm-routing

_{Works with Claude Code, Codex, and Gemini CLI · No API keys required on Claude Pro/Max}

Local-first. No hosted proxy. No account required.

📑 Table of Contents

Why People Install This
What You Get
Ranked #8 on RouterArena
Need Enterprise-Grade Routing? Meet Chuzom
Quick Start
Example Routing
Works With
How It Works
What You Can Do
Providers
Routing Policies
MCP Tools (60)
Savings: How It Works
Trust, Privacy, and Local-First Design
Configuration
Documentation
Contributing
Package Names
Star History
Activity

Why People Install This

AI coding tools send too many prompts to premium models by default.

That means:

You waste paid tokens on simple questions
You burn through Claude, Gemini, or OpenAI quota faster than necessary
You stop working when one provider is rate-limited or down

llm-router sits between your coding tool and your model providers. It classifies each prompt, tries the cheapest capable model first, and falls back automatically when needed.

You keep the same workflow. The router changes the model choice underneath.

llm-router

Popularity

What's Inside

README

llm-router

Why People Install This

Confidence

Similar Plugins

which-ai

openrouter

openrouter-pack

freeride

cc-fleet

litellm

Popularity

Health & Quality

Similar Plugins

which-ai

openrouter

openrouter-pack

freeride

cc-fleet

litellm