Research Loop — Agent OS for scientific researchers. Auto-triggering skills for deep research, literature review, autonomous experimentation, paper writing, and more.
Use when user mentions autonomous iteration, metric-driven optimization, $research-loop plan, $research-loop debug, $research-loop fix, $research-loop security, $research-loop ship, $research-loop scenario, $research-loop predict, $research-loop learn, $research-loop reason, $research-loop probe, or mentions "research-loop" with a goal/metric. Autonomous Goal-directed Iteration — apply Karpathy's autoresearch principles: modify, verify, keep/discard, repeat. Supports bounded mode via Iterations: N inline config.
> Adapted from [karpathy/autoresearch](https://github.com/karpathy/autoresearch) program.md.
Mandatory activation layer — loads on any conversation start. Establishes skill-loading protocol, Red Flags, priority rules, and HARD-GATE enforcement for all research-loop skills.
Run a thorough, multi-phase deep research investigation on a topic with subagent dispatch, provenance tracking, and integrity verification.
Use when user wants to test multiple angles on an idea, run parallel hypothesis lanes, or explore a gap from different entry points.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Research Loop is a zero-dependency Claude Code plugin that gives your coding agent a complete scientific research workflow. 21 composable skills, 4 specialist subagents, zero dependencies. Clone and go.
It starts from the moment you open your coding agent and mention anything research-related. As soon as it sees that you're exploring a topic, it doesn't just start searching and dumping information. Instead, it steps back and asks you what you're really trying to figure out.
Once it's teased the research framing out of the conversation, it explores in parallel — papers, repos, debates, open problems — and shows you the landscape in a synthesis short enough to actually read and act on.
After you've picked a direction, your agent finds the gaps, runs them through a Carlini gate (one question at a time, waiting for your answers), and surfaces the ideas worth pursuing. Then it spins up parallel hypothesis lanes, applies gates between them, and kills the weak ones early.
Next up, once you say "go", it launches a subagent-driven experiment loop — proposing code mutations, running benchmarks, annotating results causally, and building a living knowledge graph that remembers what was tried and why it failed.
When you want to understand something deeply, just say "explain X." The learn skill walks you through how experts actually think about a topic — not a summary, but the underlying reasoning structures: core mental models one at a time, the places where the field genuinely disagrees, questions that expose whether you understand or just recognize, and finally a reverse test where you explain it back. Every gap gets logged to the lab notebook.
Need a paper? Say "write a paper on X" and the paper-pipeline skill drives a 14-phase pipeline from topic framing all the way to conference-formatted export.
Need to optimize something? Say "optimize this metric" and the autonomous-iteration skill runs a Karpathy-style Modify → Verify → Keep/Discard loop with automatic rollback.
There's a bunch more to it, but that's the core of the system. And because the skills trigger automatically, you don't need to do anything special. Your agent just has Research Loop.
If Research Loop has helped you do work that matters and you are so inclined, consider sponsoring the project.
Thanks!
— Alexander
Research Loop is a zero-dependency Claude Code plugin — it works by dropping skill files into your project. No binaries, no builds, no dependencies.
git clone https://github.com/moralespanitz/research-loop
cd research-loop
That's it. Open a new Claude Code session in this directory and the skills auto-activate.
git clone https://github.com/moralespanitz/research-loop /tmp/research-loop
cp -r /tmp/research-loop/.claude /tmp/research-loop/CLAUDE.md /your/project/
/plugin install research-loop
A Go binary is available for users who want CLI tools (research-loop init, research-loop start <arxiv-url>, etc.):
go install github.com/moralespanitz/research-loop/cmd/research-loop@latest
research-loop init
Open a new Claude Code session in the workspace and say anything research-related — "I want to explore transformer memory systems" or "explain policy compression." The bootstrap skill loads automatically, then routes to the right sub-skill. You should see the agent announce which skill it's using.
Research Loop ships 21 skills organized by workflow layer. Skills auto-trigger — you just talk naturally and the right skill activates.
npx claudepluginhub moralespanitz/research-loop --plugin research-loopDecision-making framework based on Nicholas Carlini's research methodology. Evaluate and rank ideas, projects, and priorities across 10 dimensions: impact, uniqueness, comparative advantage, knowledge, de-risking, collaboration, timing, focus, communication, and kill/persist.
Autonomous, personalized research loops for Claude Code. Set a topic, walk away, come back to a quality-gated report adapted to your projects.
Oh My Paper research harness: memory system, Codex delegation, and pipeline commands for academic research projects.
Autonomous research orchestration: agents for hypothesis-driven investigation, experiment running, fresh-eyes review, and batch evaluation.
Synapse research orchestration plugin for Claude Code. Connects AI agents to Synapse for experiment execution, literature search, progress reporting, and autonomous research loops.
Scientific research agent extension - turns research goals into reproducible Jupyter notebooks with Python REPL, data analysis, and ML workflows
Autonomous research loops with 10 commands. Generalizes Karpathy's autoresearch loop to any domain with mechanical evaluation, overnight persistence, and zero dependencies.