By brainfo
Plan / execute / review harness for trustworthy computational analysis. Five gated subagents (planner, executor, reviewer-numeric, reviewer-literature, overseer) and the /autoreview-run command drive a deterministic, file-based ledger so no conclusion rests on an agent's say-so. Requires the autoreview CLI (pip install autoreview, or uv tool install).
Executes the plan. For each plan step it writes the analysis code, runs it, captures the results, and logs claims to the ledger - each claim carrying the structured numbers it produced, the input/output files (with hashes), and a first set of numeric checks. Produces results and conclusions; does not judge them.
The overarching watcher. Guards file integrity (declared inputs exist and are byte-for-byte intact before a step, declared outputs appear after it) and checks that every other agent actually did its job - the planner left a plan, the executor logged the claims the plan promised, the reviewers cleared the pending queue, no input was mutated mid-run. Returns a go / no-go at each checkpoint.
Designs the analysis plan. Turns an analysis question plus the available input files into an ordered, explicit plan of steps - each declaring its inputs, the approach, its expected outputs, and the claims it will produce - written to plan.json. Does not run any analysis code.
Reviews each claim's interpretation against published literature. For every claim that carries a domain assertion and has no literature verdict, it searches the literature, weighs supporting and contradicting evidence skeptically, and logs a verdict with citations, a confidence level, and an explicit caveat. It judges the interpretation, not the arithmetic.
Reviews the numbers by logic. For each claim it reasons about the invariants the numbers must obey - proportions summing to 1, count == fraction * total, consistency of a quantity across analyses, expected orderings, order-of-magnitude and dimensional sanity - and authors those as declarative check specs that the deterministic engine evaluates. It never eyeballs arithmetic; every judgement is a runnable check.
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
A plan / execute / review harness for trustworthy computational analysis.
It separates the work of an analysis into distinct roles and it keeps a deterministic, append-only ledger of every claim and every verdict so nothing rests on an agent's say-so.
A full worked run is saved in examples/placenta_showcase/, which was from one prompt "/autoreview-run what you notice with @first_trimester_final.h5ad
@term_final.h5ad".
The following two are required
uv tool install git+https://github.com/brainfo/autoreview.git
The five subagents, the /autoreview-run command, and the Stop hook ship as a
plugin, so they are available in every project without copying .claude/ around.
This repo is its own marketplace.
/plugin marketplace add brainfo/autoreview
/plugin install autoreview@autoreview-marketplace
Without the plugin you can still copy .claude/agents/ and
.claude/commands/autoreview-run.md into a project's .claude/, or into
~/.claude/ for all projects.
In Claude Code:
/autoreview-run <your analysis question> path/to/data ...
Every role records and checks its work through the autoreview CLI, so the
state of the review lands in .autoreview/ as it happens:
| role | drives | writes |
|---|---|---|
conductor (the /autoreview-run command) | sets AUTOREVIEW_DIR; launches each role and the overseer; autoreview report | orchestration; the final REPORT.md and summary |
overseer (at every checkpoint) | autoreview guard register, autoreview guard verify, autoreview pending, autoreview status | the integrity manifest; a GO / NO-GO gate between stages |
planner | writes plan.json directly | the ordered plan: inputs, steps, and the claims each step will produce |
executor | autoreview claim add, autoreview guard register | each claim with its numbers, seed checks, and input/output hashes |
reviewer-numeric | autoreview check run | the task-specific invariants and the numeric verdicts |
reviewer-literature | autoreview verdict add | cited literature verdicts, each with a confidence and a caveat |
Because the record is in files and the checks have no LLM in them, you point the
same CLI at .autoreview/ to inspect the run and independently reproduce its
verdicts - without re-running the agents or taking their word for anything:
autoreview status # every claim, both review tracks, one line each
autoreview pending # anything still unreviewed
autoreview guard verify # re-hash inputs/outputs, confirm nothing drifted mid-run
autoreview check run # re-run the numeric checks - deterministic, same verdicts
autoreview report # regenerate REPORT.md
The agent run and your audit use identical commands against identical files. That sameness is reproducibility, not redundancy: it is what lets you re-earn trust in a result without re-trusting the agents that produced it.
.claude/hooks/autoreview_stop.sh re-runs the deterministic checks and refreshes
the report after each turn, surfacing what still needs an agent. It is a safe
no-op in any project without an .autoreview ledger.
To enable it without the plugin, create a
.claude/settings.json yourself with:
{
"permissions": { "allow": ["Bash(autoreview:*)"] },
"hooks": {
"Stop": [
{ "matcher": "",
"hooks": [
{ "type": "command",
"command": "bash \"$CLAUDE_PROJECT_DIR/.claude/hooks/autoreview_stop.sh\"" }
] }
]
}
}
MIT - see LICENSE.
npx claudepluginhub brainfo/autoreview --plugin autoreviewComprehensive PR review agents specializing in comments, tests, error handling, type design, code quality, and code simplification
Upstash Context7 MCP server for up-to-date documentation lookup. Pull version-specific documentation and code examples directly from source repositories into your LLM context.
Comprehensive feature development workflow with specialized agents for codebase exploration, architecture design, and quality review
Comprehensive startup business analysis with market sizing (TAM/SAM/SOM), financial modeling, team planning, and strategic research
v9.44.1 — Patch release for Gemini environment/version detection and qwen auth gating. Run /octo:setup.
Complete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.