By Gaaaavin
Claude Code skills for ML/AI researchers. Commands for experiments, debugging, visualization, and SLURM jobs.
Package a local skill or command and open a PR to claude-eureka
Author new Claude Code skills or commands for ML/AI research workflows
Systematic root-cause debugging for ML/AI research code. Four phases — investigate first, fix second.
Create, launch, track, and log ML experiments with hypothesis-driven workflow
Scan a research project and generate a tailored CLAUDE.md with detected stack, research context, and active work state.
Auto-save context when the agent learns something — triggers on: remember, convention, always do, note for future, learned that, from now on, never do, experiment completion, bug resolution
Debugging guidance for research code — triggers on: error, exception, failed, crash, NaN, OOM, CUDA, traceback, bug, broken
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Claude Code skills for ML/AI researchers.
Stop re-teaching Claude your project every session. Eureka auto-detects your stack, manages experiments, debugs training, and learns as you work.
/plugin marketplace add Gaaaavin/claude-eureka
/plugin install claude-eureka@claude-eureka
Then inside Claude Code: /init-eureka
Claude Code is Anthropic's AI coding agent. Skills extend it with slash commands and passive triggers. claude-eureka is a curated skill pack built specifically for ML/AI research workflows.
Out of the box you get commands for experiments, debugging, code review, SLURM job submission, and publication-quality plots — all assuming you know PyTorch and care about research velocity, not boilerplate.
Before eureka: After /init-eureka:
"Here's my project Claude already knows:
structure..." ✓ PyTorch + Lightning + Hydra
"I use Hydra for ✓ experiment layout in runs/
config..." ✓ SLURM cluster + GPU types
"My runs go in..." ✓ active branches and open TODOs
"Oh, and I'm ✓ NaN bug you fixed last week
working on..."
1. Install (30 seconds)
Inside Claude Code:
/plugin marketplace add Gaaaavin/claude-eureka
/plugin install claude-eureka@claude-eureka
This registers the eureka marketplace and installs the plugin. No npm, no pip — it's markdown files and one shell script.
2. Initialize your project
Open Claude Code in your project directory and run:
/init-eureka
This scans your repo, detects your stack (PyTorch, Lightning, Hydra, W&B, SLURM, etc.), and writes a tailored CLAUDE.md. Claude reads it on every prompt. Your context is set.
3. Get to work
/experiment baseline --lr 1e-4 --batch 64
/debug ← training loss exploded
/viz runs/ ← generate paper-quality figures
/submit-job train.py ← SLURM submission
/slash invocations)| Command | What it does |
|---|---|
/init-eureka | Scan project → generate tailored CLAUDE.md |
/refresh-context | Re-detect stack, update auto-sections, keep your edits |
/experiment | Create, launch, track, and log experiments |
/debug | Root-cause debugging — investigate first, patch second |
/review | Code review with YAGNI/KISS + ML anti-pattern detection |
/scaffold | Boilerplate: model, dataset, trainer, config, SLURM script |
/viz | Publication-quality figures from experiment outputs |
/notebook | Structured Jupyter analysis notebooks |
/submit-job | SLURM submission, status monitoring, log tailing |
/create-skill | Author new skills or commands (guided) |
/contribute-skill | Package a skill and open a PR to this repo |
/update-eureka | Pull latest commands and skills from GitHub |
| Skill | Activates on |
|---|---|
research-debugging | Errors, exceptions, NaN, OOM, CUDA errors, tracebacks |
auto-memory | "remember", conventions, experiment completions, bug resolutions |
Eureka uses a tiered context architecture so Claude gets exactly what it needs without wasting tokens:
CLAUDE.md (~50 lines, loaded every prompt)
│ Project identity, stack, key paths, active work state.
│
└── .claude/context/*.md (loaded on demand)
Experiments, architecture decisions, resolved bugs,
team conventions. Claude loads relevant files per query.
|
└── auto-memory (agent-maintained)
Results and learnings written back automatically.
Your second session is smarter than your first.
/init-eureka populates tiers 1 and 2. The auto-memory skill fills tier 3 over time. /refresh-context re-runs detection to keep auto-generated sections current while preserving your manual edits.
Plugin install — recommended, native Claude Code integration:
/plugin marketplace add Gaaaavin/claude-eureka
/plugin install claude-eureka@claude-eureka
Update anytime with:
/plugin update claude-eureka@claude-eureka
Alternatively (older Claude Code versions):
curl -fsSL https://raw.githubusercontent.com/Gaaaavin/claude-eureka/main/install.sh | bash
Choose user-level (~/.claude/, recommended) or project-level (./.claude/) when prompted.
npx claudepluginhub gaaaavin/claude-eureka --plugin claude-eurekaML/perf investigation skills: topic, plan, judge, run, sweep
Set up ML experiment tracking
ML engineering plugin: Give your AI coding agent ML engineering superpowers.
ML experiment tracking with metrics logging and run comparison
Guardrails your research workflow — checks hypotheses, catches known bugs, flags sloppy methodology.
Multi-model consensus engine integrating OpenAI Codex CLI, Gemini CLI, and Claude CLI for collaborative code review and problem-solving.