From ab-testing
Analyze A/B test results with proper statistical methods. Accepts data as CSV, summary stats, or a file path. Reports significance, confidence intervals, effect sizes, and flags common issues.
How this skill is triggered — by the user, by Claude, or both
Slash command
/ab-testing:analyze-resultsThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Analyze A/B test results provided by the user. Accept data in any reasonable format:
Analyze A/B test results provided by the user. Accept data in any reasonable format:
Parse $ARGUMENTS for any inline data or file paths.
Always do this first. Run a chi-squared goodness-of-fit test comparing observed sample sizes to expected allocation (default 50/50 or as stated by user).
Choose the appropriate test based on metric type:
Proportions (conversion rate):
Continuous metrics (mean values):
Revenue / skewed metrics:
Multiple variants (>2):
If the result is NOT statistically significant:
Provide a clear summary, e.g.:
"Variant B increased conversion rate by 3.2% (relative), from 12.0% to 12.4%. The 95% confidence interval for the absolute difference is [0.1%, 0.7%]. This result is statistically significant (p = 0.008) with a small effect size (Cohen's h = 0.12)."
Check for and flag:
Generate a complete Python script using scipy.stats and/or statsmodels that reproduces the full analysis. The code should:
npx claudepluginhub weisberg/agile_agentic_analytics --plugin ab-testingAnalyzes A/B test results for statistical significance, sample size validation, confidence intervals, lift, guardrails, and ship/extend/stop recommendations. Handles CSV/Excel data via Python scripts.
Analyzes A/B tests and experiments with statistical rigor: assesses power, significance, validity, segments; recommends ship/kill/extend.