By dev-jahn
Karpathy-style autonomous LLM research loop adapted for generic ML codebases. Ships two skills — setup and run — plus an `ar` helper CLI that owns launch orchestration, metric extraction, atomic checkpoint commit, and chain-mode transitions so the main Claude session stays context-light across hundreds or thousands of iterations. v0.3.0 drops the wandb/accelerate monoculture: metric backend is pluggable (wandb / tensorboard / log-scan / custom with auto-detection), distributed-framework resolution is auto-detected (accelerate/deepspeed/fsdp/ddp/lightning/none), a new `hydra` entry pattern renders a Hydra-override-style train wrapper, and `--checkpoint-glob` gives priority-0 control over checkpoint discovery for Lightning / plain-torch / custom layouts.
Based on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
This skill should be used when the user asks to "start the autoresearch loop", "kick off overnight iteration", "begin autonomous experiment runs", "run /autoresearch:run", "run the autoresearch expr <slug>", "continue the autoresearch loop", "resume autoresearch", "chain through follow-up experiments", or otherwise hand off an ML experiment to the autonomous runner. Drives the self-propelling train.py iteration loop on a configured `.autoresearch/{expr}/` experiment — one-line edit, `ar run`, read `result.json`, decide next edit, repeat — for hours or days until a termination condition fires. Context-minimized so thousands of iterations fit in a single session. Invoke immediately without asking clarifying questions beyond the structured interview; the skill itself is self-driving and must never stop mid-loop to ask the user "continue?" — Ctrl+C is the only authorized interrupt.
Scaffolds a new autonomous-research experiment directory (`.autoresearch/{YYMMDD}-{slug}/`) inside a deep-learning project so Claude can run a long train.py-mutation loop without blowing context. This skill should be used when the user asks to "start an autoresearch experiment", "set up autonomous research loop on this project", "create a new .autoresearch run", "scaffold autoresearch", "initialize autoresearch for this repo", "kick off an autonomous training loop", "set up Karpathy-style autoresearch here", or otherwise indicates they want Claude to begin autonomous iteration on their ML research code. The skill performs a venv preflight, analyzes the project's editable-install Python packages, surfaces primary-metric candidates from whichever tracker the host uses (wandb / tensorboard / plain stdout logs), introspects the host's training entrypoint (argparse-CLI script vs importable main() function vs hydra app), infers the distributed framework (accelerate / torchrun / FSDP / DDP / pytorch-lightning / none), detects checkpoint conventions (HF Trainer / Lightning / plain torch.save), runs a short interview, and then materializes the expr by calling `ar init` which renders the train.py / prepare.py / program.md templates.
Published artifact for the autoresearch Claude Code plugin.
This repo is the public mirror of the plugin/ directory of
Dev-Jahn/autoresearch (source is
private; this repo is what end users install).
/plugin marketplace add Dev-Jahn/jahns-cc-marketplace
/plugin install autoresearch@jahns-cc-marketplace
/reload-plugins
See PLUGIN.md for the plugin's own README (installation,
capabilities, known limitations).
Tags v0.x.0 are cut by the source repo's publish workflow every minor bump.
Patch versions (0.x.y, y>0) are not published — iterate in private first.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimnpx claudepluginhub dev-jahn/jahns-cc-marketplace --plugin autoresearchMarkdown link graph and staleness detection for Claude Code. Automatically tracks cross-references between markdown documents and detects when linked content becomes stale.
SSOT-anchored agentic development workflow harness: one-click project setup, task registry with a global naming convention, zero-token roadmap rendering, round closeout, external review ingestion, and scoped SSOT audits.
Complete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.
Ultra-compressed communication mode. Cuts ~75% of tokens while keeping full technical accuracy by speaking like a caveman.
Frontend design skill for UI/UX implementation
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Memory compression system for Claude Code - persist context across sessions