Stats

Actions

Available In

Tags

3-Surgeons

Three-model code review consensus — disagreement is the value, not consensus.

Three independent AI models cross-examine your code. They challenge each other's blind spots, hunt for what the others missed, and surface every disagreement instead of burying it. Built on five Constitutional Physics invariants and a four-phase operating protocol. Provider-agnostic across OpenAI, DeepSeek, Anthropic, Ollama, LM Studio, vLLM, and MLX.

Surgeon

Role

Default Model

🔪

Atlas (Head Surgeon / Judge)

Synthesizes findings, weighs evidence, decides, implements — never overrides without grounds

Claude (your IDE session)

🩺

Cardiologist (External Skeptic)

External-model perspective from a different training distribution. Surfaces what Atlas can't see

DeepSeek-chat (drop-in OpenAI also supported)

🧠

Neurologist (Local Devil's Advocate)

Runs locally for privacy and corrigibility. Forces counter-position before consensus locks in

DeepSeek-chat via local proxy (Qwen3-4B legacy)

Scale

What it unlocks

1 model

One opinion. Fast. Possibly wrong, but you wouldn't know.

2 models

A check. Often agree. Disagreement = stop and look.

3 models

Triangulation. Truth becomes recoverable. The blind spot of any one model is exposed by the other two.

5+ models

A specialty board. Each surgeon brings a different training distribution. Convergence under independent attack is evidence, not opinion.

Continuous review

Every diff cross-examined. Every claim audited. Every blind spot named. Calibration compounds across the codebase.

3-Surgeons

Three-model code review consensus — disagreement is the value, not consensus.

Quick Start · Constitutional Physics · Install · IDE Compatibility · Pairs With

Would you wing a complicated surgery with one surgeon?

Then why are you shipping code reviewed by one AI?

The Problem

Every AI coding tool has the same flaw: one model, one perspective, one set of blind spots. Claude confabulates confidently where GPT hedges. GPT over-engineers where a local model stays lean. A single AI reviewer is a single point of failure — and you'd never accept that in a real operating room.

The Solution

3-Surgeons puts three independent AI models on the same operating table. They don't just review — they cross-examine, challenge assumptions, and hunt for what the others missed. Your code ships only when all three agree it's ready.

	Surgeon	Role	Default Model
🔪	Atlas (Head Surgeon / Judge)	Synthesizes findings, weighs evidence, decides, implements — never overrides without grounds	Claude (your IDE session)
🩺	Cardiologist (External Skeptic)	External-model perspective from a different training distribution. Surfaces what Atlas can't see	DeepSeek-chat (drop-in OpenAI also supported)
🧠	Neurologist (Local Devil's Advocate)	Runs locally for privacy and corrigibility. Forces counter-position before consensus locks in	DeepSeek-chat via local proxy (Qwen3-4B legacy)

The Vision: Calibrated Correctness at Any Scale

A single AI reviewer is a single point of failure. Three reviewers, hunting independently, force the truth into the open.

3-Surgeons is built on one belief: the bottleneck in AI-assisted coding is no longer speed — it's calibration. A confidently wrong answer ships faster than a careful right one. Three independent surgeons make confidence earnable — every claim survives cross-examination or it dies on the table.

Scale	What it unlocks
1 model	One opinion. Fast. Possibly wrong, but you wouldn't know.
2 models	A check. Often agree. Disagreement = stop and look.
3 models	Triangulation. Truth becomes recoverable. The blind spot of any one model is exposed by the other two.
5+ models	A specialty board. Each surgeon brings a different training distribution. Convergence under independent attack is evidence, not opinion.
Continuous review	Every diff cross-examined. Every claim audited. Every blind spot named. Calibration compounds across the codebase.

The protocol scales linearly with the number of surgeons. The architecture is provider-agnostic. The only ceiling is your tolerance for groupthink.

Constitutional Physics

Five principles that govern every surgical operation. These are invariants — no tool call, no config flag, no shortcut overrides them.

3-surgeons

Popularity

What's Inside

Confidence

README

3-Surgeons

The Problem

The Solution

The Vision: Calibrated Correctness at Any Scale

Constitutional Physics

Similar Plugins

deliberation

consensus

octo

llm-council-plugin

multi-ai-review

owlex

More by supportersimulator

multi-fleet

3-Surgeons

The Problem

The Solution

The Vision: Calibrated Correctness at Any Scale

Constitutional Physics

Popularity

Health & Quality

More by supportersimulator

multi-fleet

Similar Plugins

deliberation

consensus

octo

llm-council-plugin

multi-ai-review

owlex