Skill

ML Experiment Management

From ml-research

Systematic experiment tracking, comparison, and analysis for machine learning research.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/ml-research:ml-experiment

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Systematic experiment tracking, comparison, and analysis for machine learning research.

Supporting Files

examples/experiment-analysis.mdexamples/wandb-integration.mdscripts/compare_experiments.pyscripts/experiment_registry.pytemplates/experiment-templates.yaml

SKILL.md

544 lines · ~2.9k tokens

Stats

LanguagePython

Stars0

MaintenanceExcellent

Last CommitApr 6, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

ML Experiment Management

Invocation

Context Preview

Supporting Files

SKILL.md

ML Experiment Management

Invocation

Context Preview

Supporting Files

SKILL.md

ML Experiment Management

Quick Start

1. Create Experiment Config

2. Track Experiment Results

Automatic Tracking with Callbacks

Experiment Registry Format

3. Compare Experiments

4. Experiment Templates

Baseline Experiment

Ablation Study

Hyperparameter Optimization

5. Experiment Reproduction

Save Full Environment

Reproduce Experiment

6. Experiment Analysis

Analyze Single Experiment

Multi-Experiment Analysis

7. W&B Integration

Query W&B Runs

Compare Runs in W&B

W&B Sweeps

8. Experiment Best Practices

Naming Conventions

Documentation

Version Control

Organization

9. Common Experiment Types

A. Baseline Experiment

B. Ablation Study

C. Hyperparameter Tuning

D. Transfer Learning

E. Architecture Search

10. Experiment Commands

Troubleshooting

Success Criteria

Similar Skills

ML Experiment Management

Quick Start

1. Create Experiment Config

2. Track Experiment Results

Automatic Tracking with Callbacks

Experiment Registry Format

3. Compare Experiments

4. Experiment Templates

Baseline Experiment

Ablation Study

Hyperparameter Optimization

5. Experiment Reproduction

Save Full Environment

Reproduce Experiment

6. Experiment Analysis

Analyze Single Experiment

Multi-Experiment Analysis

7. W&B Integration

Query W&B Runs

Compare Runs in W&B

W&B Sweeps

8. Experiment Best Practices

Naming Conventions

Documentation

Version Control

Organization

9. Common Experiment Types

A. Baseline Experiment

B. Ablation Study

C. Hyperparameter Tuning

D. Transfer Learning

E. Architecture Search

10. Experiment Commands

Troubleshooting