🧪 Research Reviewer

An iterative, multi-agent academic peer-review pipeline for Claude Code. It respawns a brutally honest "fresh pair of eyes" Reviewer Agent every round, scores your paper out of 100, refactors the weak spots, and loops until the manuscript hits a flawless 100/100.

Why?

Self-review is biased. After staring at your own thesis for weeks you stop seeing the missing error bars, the undefined symbol in Eq. 7, the overlapping legend in Figure 4, and the claim of "universal superiority" that three datasets don't support.

Research Reviewer fixes that by treating every review as a blind, first-time review. Each iteration spawns a brand-new Reviewer Agent with zero memory of previous drafts or scores — so it can't get complacent, can't overfit to the last version, and keeps hunting for reasons to reject. You apply the fixes, it re-reviews from scratch, and the score climbs.

How it works

            ┌──────────────────────────────────────────────┐
            │   Manuscript (.tex / .md / .pdf + figures)    │
            └──────────────────────┬───────────────────────┘
                                   │
                                   ▼
              ╭───────── fresh, blind subagent ─────────╮
              │   🧐  Reviewer Agent  (no prior memory)  │
              │   • scores X / 100                       │
              │   • itemized deductions → 4 pillars      │
              │   • refactored, polished text            │
              │   • "Required Experiments" checklist     │
              ╰─────────────────────┬───────────────────╯
                                    │ apply fixes
                                    ▼
                         score == 100/100 ?  ──No──┐
                                    │ Yes          │  (respawn fresh eyes)
                                    ▼              └──────────┘
                              ✅  Done

A new agent every loop = no cognitive bias, no "I already approved this," no overfitting.

The four pillars it grades on

Pillar	What it hunts for
1. Methodological Soundness & Mathematical Precision	Unstated assumptions, undefined variables, inconsistent notation/dimensions, arbitrary/unjustified design choices.
2. Empirical Rigor & Validity of Claims	Missing error bars / CIs / p-values, unfair or under-tuned baselines, ablations that change >1 variable at once.
3. Scope of Validity & Narrative Honesty	Overclaiming beyond the data, no honest limitations / failure modes / scalability discussion.
4. Visual Layout & Editorial Integrity	Fails the 30-second graphic test, overlapping labels/legends, orphan headings, hallucinated or placeholder citations.

Install

Option A — Plugin marketplace (recommended, one line)

This repo is a Claude Code plugin marketplace. Inside Claude Code:

/plugin marketplace add roeimichael/Research-Reviewer
/plugin install research-reviewer@research-reviewer

That's it — the research-reviewer skill is now available in every session. Update later with /plugin marketplace update research-reviewer, remove with /plugin uninstall research-reviewer@research-reviewer.

Option B — Manual copy

Drop the skill straight into a skills/ folder Claude Code already reads:

git clone https://github.com/roeimichael/Research-Reviewer.git
cp -r Research-Reviewer/skills/research-reviewer ~/.claude/skills/   # global, all projects
# or:  cp -r Research-Reviewer/skills/research-reviewer .claude/skills/   # this project only

On Windows (PowerShell):

git clone https://github.com/roeimichael/Research-Reviewer.git
Copy-Item -Recurse Research-Reviewer\skills\research-reviewer ~\.claude\skills\

Usage

In Claude Code, just ask — the skill auto-triggers on review intent:

> review my thesis in ./paper/main.tex and get it to 100/100

or invoke it explicitly:

> /research-reviewer

Claude will:

Read your manuscript (and figures/tables/datasets if you point to them).
Spawn a fresh Reviewer Agent that scores it X/100 with itemized deductions.
Apply the refactored text and notation fixes back into your paper.
Hand you a Required Experiments checklist for the things only you can run.
Respawn a new blind reviewer and repeat until 100/100.

research-reviewer

Popularity

What's Inside

README