From nerd
Reference for designing nerd experiments — competing theories, sweep harnesses, ground truth strategies, metric selection, and feasibility checks. Use when creating or reviewing experiment plans.
How this skill is triggered — by the user, by Claude, or both
Slash command
/nerd:experiment-planningThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
1. **Competing theories** (3+) — not just "is X optimal?" but "what's really going on?"
combos = product(range_sizes)
time = combos * data_size * cost_per_eval
if time > 1 hour: reduce ranges or use random search
Not all parameters can be swept. Parameters in non-executable files (markdown agent prompts, documentation, config comments) or those requiring human judgment are analytical — they can be reasoned about with competing theories but cannot be iterated in a nerd-loop.
For analytical parameters:
experiment_type: "analytical" in the planFor experimentable parameters:
npx claudepluginhub shawnroos/shrimpshack --plugin nerdDesigns ML experiments: ablation studies, baseline comparisons, experiment matrices; estimates GPU/API costs; generates config stubs, execution scripts, and analysis plans.
Designs controlled experiments (A/B, multivariate, quasi) with hypothesis, success metrics, sample size, and statistical power. For validating features via /design-experiment or phrases like 'design experiment'.
Designs A/B experiment plans with hypothesis, primary/secondary/guardrail metrics, audience allocation, holdout strategy, duration estimates, and risks. Use for feature test planning.