Run an Ollama review that challenges the implementation approach and design choices
Cancel an active background Ollama job in this repository
Autonomously execute an approved Ollama plan, with implement → verify → retry per step
Render an existing Ollama plan as markdown
Generate an evidence-grounded plan for a multi-step task using an Ollama planner model
Internal helper contract for calling the ollama-companion runtime from Claude Code
Guidance on selecting and prompting open-weight Ollama models for review and rescue tasks
Internal guidance for presenting Ollama helper output back to the user
Uses power tools
Uses Bash, Write, or Edit tools
No model invocation
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Executes directly as bash, bypassing the AI model
Executes directly as bash, bypassing the AI model
Runs pre-commands
Contains inline bash commands via ! syntax
Runs pre-commands
Contains inline bash commands via ! syntax
Use a local Ollama model from Claude Code to review code or delegate tasks.
This plugin lets you run code reviews and background rescue tasks against an
Ollama server — local by default, with optional access to
hosted frontier models via Ollama Cloud (:cloud suffix). No OpenAI account, no
API key plumbing, and your code stays on your hardware unless you explicitly
choose a cloud-hosted model.
Install Ollama — download the desktop app or follow the CLI instructions at ollama.com.
Pull a model:
ollama pull llama3.1:8b
Install the plugin (placeholder — update once published to a marketplace):
/plugin install ollama@darrylmorley/ollama-plugin-cc
Run setup:
/ollama:setup
Setup checks that Ollama is installed, running, and has at least one model. It also lets you set a default model and optionally enable the stop-time review gate.
Try a review:
/ollama:review
| Command | What it does |
|---|---|
/ollama:review | Read-only review of current uncommitted changes or a branch diff |
/ollama:adversarial-review | Steerable review that challenges design decisions and tradeoffs |
/ollama:rescue | Delegates a task to Ollama; runs an agentic tool-calling loop by default (--emit-patch for one-shot diff) |
/ollama:status | Shows running and recent Ollama jobs for the current repo |
/ollama:result | Shows the stored output for a finished job |
/ollama:cancel | Cancels an active background job |
/ollama:setup | Checks Ollama readiness, pulls models, sets defaults, toggles review gate |
See the ollama-model-prompting skill for full guidance. Short version:
Empirically battle-tested against a SQL-injection fixture. See
docs/MODELS.md for the full results table and reproducer.
| Model | Review | Adv. review | Rescue | Best for |
|---|---|---|---|---|
gpt-oss:20b | ✓ 26s | ✓ 24s | ✓ 4 iter / 20s | All-rounder, balanced size/quality |
gemma4:26b | ✓ 69s | ✓ 110s | ✓ 5 iter / 29s | Rescue when patches reject; reliable structured output |
qwen3.5:9b | ✓ 79s | ✓ 74s | ✓ 3 iter / 44s | VRAM-constrained rigs (6.6 GB) |
qwen3.6:27b-coding-nvfp4 | ✗ schema | flaky | ✓ 4 iter / 120s | Rescue only — review path unstable on Apple Silicon |
batiai/qwen3.6-27b:q6 | ✗ schema | ✗ schema | ✓ 1 iter / 300s | Rescue only — review path drifts off schema |
Tool-calling (used by agentic rescue) is reliable on Llama 3.1+, Qwen 2.5+/3+, DeepSeek-Coder-V2+, GPT-OSS, Gemma 3+, GLM 4+, Kimi K2+, and Granite 3. Smaller models (3B, 1B), thinking-token models (DeepSeek-R1 distills), and pre-3 Gemma fall back to patch-emit automatically.
If your hardware can't run a strong local model — or you want frontier-quality
output for a tough adversarial review — Ollama Cloud exposes hosted models
behind the same API via a :cloud suffix. The plugin treats them identically
to local models; nothing in the plugin needs to change.
| Model | Review | Adv. review | Rescue | Notes |
|---|---|---|---|---|
qwen3-coder-next:cloud | ✓ 6s | ✓ 6s | ✓ 3 iter / 9s | Fastest across the board — 80B FP8 |
glm-5.1:cloud | ✓ 63s | ✓ 47s | ✓ 6 iter / 29s | Reliable structured output; strong rescue |
kimi-k2.6:cloud | flaky | ✓ 113s | ✓ 3 iter / 13s | 1T params; review schema drift, adversarial fine |
Cloud models send your diff context to Ollama's hosted endpoint — opt in by
passing one explicitly via --model or /ollama:setup --default-model.
Everything else stays local.
Override the model on any command with --model <name>.
| Variable | Description |
|---|---|
OLLAMA_HOST | Ollama server URL (default: http://127.0.0.1:11434) |
OLLAMA_PLUGIN_DEFAULT_MODEL | Fallback model when --model is not passed and no per-workspace config is set |
OLLAMA_PLUGIN_RESCUE_ALLOW_COMMANDS | Comma-separated list of extra commands for agentic rescue's run_command tool; use * to allow all |
Per-workspace config (set via /ollama:setup --default-model) is stored in the plugin state
directory and takes precedence over OLLAMA_PLUGIN_DEFAULT_MODEL.
npx claudepluginhub darrylmorley/ollama-plugin-cc --plugin ollamaMulti-model consensus engine integrating OpenAI Codex CLI, Gemini CLI, and Claude CLI for collaborative code review and problem-solving.
Ultra-compressed communication mode. Cuts ~75% of tokens while keeping full technical accuracy by speaking like a caveman.
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Curate auto-memory, promote learnings to CLAUDE.md and rules, extract proven patterns into reusable skills.
Memory compression system for Claude Code - persist context across sessions
Standalone image generation plugin using Nano Banana MCP server. Generates and edits images, icons, diagrams, patterns, and visual assets via Gemini image models. No Gemini CLI dependency required.