Search everything...

Stats

Actions

Available In

autoresearch

Name: autoresearch
Author: dev-jahn

By dev-jahn

Karpathy-style autonomous LLM research loop adapted for generic ML codebases. Ships two skills — setup and run — plus an `ar` helper CLI that owns launch orchestration, metric extraction, atomic checkpoint commit, and chain-mode transitions so the main Claude session stays context-light across hundreds or thousands of iterations. v0.3.0 drops the wandb/accelerate monoculture: metric backend is pluggable (wandb / tensorboard / log-scan / custom with auto-detection), distributed-framework resolution is auto-detected (accelerate/deepspeed/fsdp/ddp/lightning/none), a new `hydra` entry pattern renders a Hydra-override-style train wrapper, and `--checkpoint-glob` gives priority-0 control over checkpoint discovery for Lightning / plain-torch / custom layouts.

npx claudepluginhub dev-jahn/jahns-cc-marketplace --plugin autoresearch

Popularity

Stars

Med: 0·Avg: 285

Installs

Med: 0·Avg: 1

What's Inside

Skills2

autoresearch:run

/run

This skill should be used when the user asks to "start the autoresearch loop", "kick off overnight iteration", "begin autonomous experiment runs", "run /autoresearch:run", "run the autoresearch expr <slug>", "continue the autoresearch loop", "resume autoresearch", "chain through follow-up experiments", or otherwise hand off an ML experiment to the autonomous runner. Drives the self-propelling train.py iteration loop on a configured `.autoresearch/{expr}/` experiment — one-line edit, `ar run`, read `result.json`, decide next edit, repeat — for hours or days until a termination condition fires. Context-minimized so thousands of iterations fit in a single session. Invoke immediately without asking clarifying questions beyond the structured interview; the skill itself is self-driving and must never stop mid-loop to ask the user "continue?" — Ctrl+C is the only authorized interrupt.

autoresearch:setup

/setup

Scaffolds a new autonomous-research experiment directory (`.autoresearch/{YYMMDD}-{slug}/`) inside a deep-learning project so Claude can run a long train.py-mutation loop without blowing context. This skill should be used when the user asks to "start an autoresearch experiment", "set up autonomous research loop on this project", "create a new .autoresearch run", "scaffold autoresearch", "initialize autoresearch for this repo", "kick off an autonomous training loop", "set up Karpathy-style autoresearch here", or otherwise indicates they want Claude to begin autonomous iteration on their ML research code. The skill performs a venv preflight, analyzes the project's editable-install Python packages, surfaces primary-metric candidates from whichever tracker the host uses (wandb / tensorboard / plain stdout logs), introspects the host's training entrypoint (argparse-CLI script vs importable main() function vs hydra app), infers the distributed framework (accelerate / torchrun / FSDP / DDP / pytorch-lightning / none), detects checkpoint conventions (HF Trainer / Lightning / plain torch.save), runs a short interview, and then materializes the expr by calling `ar init` which renders the train.py / prepare.py / program.md templates.

README

autoresearch-plugin

Published artifact for the autoresearch Claude Code plugin.

This repo is the public mirror of the plugin/ directory of Dev-Jahn/autoresearch (source is private; this repo is what end users install).

Install

/plugin marketplace add Dev-Jahn/jahns-cc-marketplace
/plugin install autoresearch@jahns-cc-marketplace
/reload-plugins

See PLUGIN.md for the plugin's own README (installation, capabilities, known limitations).

Releases

Tags v0.x.0 are cut by the source repo's publish workflow every minor bump. Patch versions (0.x.y, y>0) are not published — iterate in private first.

Similar Plugins

creative-writing

28·58·

Complete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.

4mo

v1.7.0

greyhaven-ai

caveman

71.9k·6.6K·

Ultra-compressed communication mode. Cuts ~75% of tokens while keeping full technical accuracy by speaking like a caveman.

v1.9.0

JuliusBrussee

frontend-design

30.2k·663·

Frontend design skill for UI/UX implementation

v1.0.0

anthropics

fullstack-dev-skills

10.0k·455·

Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.

v0.4.15

Jeffallan

ui-design

35.7k·318·

Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns

3mo

v1.0.3

wshobson

claude-mem

83.0k·304·

Memory compression system for Claude Code - persist context across sessions

v13.6.2

thedotmack

More by dev-jahn

md-linker

0·

Markdown link graph and staleness detection for Claude Code. Automatically tracks cross-references between markdown documents and detects when linked content becomes stale.

3mo

v0.1.0

dev-jahn

jahns-workflow

0·

SSOT-anchored agentic development workflow harness: one-click project setup, task registry with a global naming convention, zero-token roadmap rendering, round closeout, external review ingestion, and scoped SSOT audits.

v0.1.2

Dev-Jahn

autoresearch

Popularity

Confidence

What's Inside

README

autoresearch-plugin

Install

Releases

Similar Plugins

creative-writing

caveman

frontend-design

fullstack-dev-skills

ui-design

claude-mem

More by dev-jahn

md-linker

jahns-workflow

autoresearch-plugin

Install

Releases

Popularity

Health & Quality

More by dev-jahn

md-linker

jahns-workflow

Similar Plugins

creative-writing

caveman

frontend-design

fullstack-dev-skills

ui-design

claude-mem