Streamline end-to-end data science and ML workflows: frame business problems into ML tasks, preprocess and validate data with quality checks, perform EDA on diverse formats, design and execute experiments with hyperparameter tuning via Optuna and interpretability via SHAP, audit reproducibility and leakage, evaluate model performance and readiness for deployment, generate model cards, and extract structured learnings into docs.
Extract and categorize learnings from completed experiments into docs/ds/learnings/ for future retrieval
Profile a dataset for structure, quality, distributions, and anomalies, then output an EDA report
Design an ML experiment with hypothesis, split strategy, leakage check, and evaluation plan
Frame a data science problem and plan the approach, surfacing relevant past learnings
Clean, validate, and transform raw data using automated preprocessing pipelines
Translate business questions into DS problems with target variables, metrics, and constraints. Use when starting a project or when the objective needs sharpening.
Define hypothesis, variables, split strategy, baselines, and comparison protocol. Use before running an experiment to lock down methodology.
Compute metrics, slice by subgroups, check calibration, and flag fairness gaps. Use after training to decide ship/iterate/abandon.
Evaluate whether an ML model is ready for production deployment by checking infrastructure, monitoring, rollback, and operational requirements. Use before shipping a model to production.
Extract reusable insights from experiment results and write them as searchable learning documents. Use at project end to capture what worked, failed, and surprised.
Aeon API patterns for time series machine learning -- classification, regression, clustering, anomaly detection, segmentation, and similarity search. Use when /ds:experiment needs time-series-specific ML algorithms (ROCKET, InceptionTime, DTW classifiers), or /ds:eda needs temporal feature extraction (Catch22, ROCKET features) or change point detection. For classical statistical forecasting (ARIMA/SARIMAX) use statsmodels; for tabular ML pipelines use scikit-learn; for visualization use matplotlib.
Pre-model data preparation pipelines for cleaning, validation, transformation, and ETL orchestration. Use when raw data needs deduplication, schema validation, format conversion, or quality assurance before EDA or modeling.
Data quality validation with Great Expectations, dbt tests, and data contracts. Use when building formal validation rules, expectation suites, or data contracts for repeatable quality gates.
Systematic exploratory data analysis checklist covering structure, quality, distributions, relationships, and target analysis. Use when starting EDA on any dataset.
Standard format for logging ML experiments including hypothesis, config, results, and learnings. Use when running experiments to maintain a consistent record.
External network access
Connects to servers outside your machine
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
No model invocation
Executes directly as bash, bypassing the AI model
No model invocation
Executes directly as bash, bypassing the AI model
Data science and ML workflow tools that compound institutional knowledge. 9 agents, 8 commands, 19 skills for problem framing, preprocessing, validation, EDA, experimentation, review, deployment, and knowledge compounding.
Add the repo as a marketplace, then install:
/plugin marketplace add andikarachman/data-science-plugin
/plugin install ds@data-science-plugin
The plugin's skills and agents use Python libraries for data analysis. Install them into your active environment:
uv pip install pandas scikit-learn scipy statsmodels numpy
Optional libraries (visualization, advanced models, and high-performance DataFrames):
uv pip install matplotlib seaborn aeon xgboost lightgbm shap polars optuna
Run /ds:setup to check which libraries are installed.
Frame -> Preprocess -> Validate -> Explore -> Experiment -> Review -> Ship -> Compound -> Repeat
| Command | Purpose |
|---|---|
/ds:plan | Frame business questions as DS problems and plan approach |
/ds:preprocess | Clean, validate, and transform raw data with automated pipelines |
/ds:validate | Run formal data quality validation with expectation suites |
/ds:eda | Run structured exploratory data analysis |
/ds:experiment | Design and run rigorous ML experiments |
/ds:review | Peer review experiments for methodology and reproducibility |
/ds:ship | Assess deployment readiness and generate model cards |
/ds:compound | Capture learnings to make future projects faster |
Each cycle compounds: experiment learnings surface in future plans, error patterns inform feature engineering, and review feedback becomes institutional knowledge.
| Component | Count |
|---|---|
| Agents | 9 |
| Commands | 8 |
| Skills | 19 |
| Templates | 9 |
| MCP Servers | 1 |
| Agent | Description |
|---|---|
problem-framer | Frame business questions as structured DS problems |
data-profiler | Profile datasets for quality, structure, and anomalies |
feature-engineer | Design and evaluate feature transformations |
pipeline-builder | Assess raw data quality and design preprocessing pipelines |
| Agent | Description |
|---|---|
experiment-designer | Design rigorous experiments with hypotheses and evaluation plans |
model-evaluator | Evaluate performance with slicing, calibration, and fairness checks |
| Agent | Description |
|---|---|
documentation-synthesizer | Synthesize findings into reusable learning documents |
reproducibility-auditor | Audit experiments for reproducibility (seeds, versions, data hashes) |
deployment-readiness | Evaluate models for production deployment readiness |
| Command | Description |
|---|---|
/ds:plan | Search past learnings, frame the problem, plan the approach, output a plan doc |
/ds:preprocess | Assess data quality, design and execute preprocessing pipelines, output preprocessing report |
/ds:validate | Run data quality validation with Great Expectations, pandas, or data contracts, output validation report |
/ds:eda | Profile data, analyze distributions, check quality, output an EDA report |
/ds:experiment | Formulate hypothesis, design methodology, check for leakage, output experiment plan and results |
/ds:review | Peer review experiments for methodology, leakage, reproducibility, and statistical validity |
/ds:ship | Assess deployment readiness, generate model card and deployment documentation |
/ds:compound | Extract learnings from completed work, categorize, and save to docs/ds/learnings/ |
npx claudepluginhub andikarachman/data-science-plugin --plugin dsSkills to support Machine Learning experimentation using the Python ecosystem.
Self-documenting, self-improving framework for analytical repositories
Feature creation, selection, and transformation tools
Design experiments, profile datasets, build models, and audit them for bias before shipping
Evaluate and compare ML model performance metrics
Harness-native ECC operator layer - 67 agents, 271 skills, 92 legacy command shims, reusable hooks, rules, selective install profiles, and production-ready workflows for Claude Code, Codex, OpenCode, Cursor, and related agent harnesses