Search everything...

Stats

Actions

Available In

ds

Name: ds
Author: andikarachman

By andikarachman

Streamline end-to-end data science and ML workflows: frame business problems into ML tasks, preprocess and validate data with quality checks, perform EDA on diverse formats, design and execute experiments with hyperparameter tuning via Optuna and interpretability via SHAP, audit reproducibility and leakage, evaluate model performance and readiness for deployment, generate model cards, and extract structured learnings into docs.

npx claudepluginhub andikarachman/data-science-plugin --plugin ds

Popularity

Stars

Top 25%

Med: 0·Avg: 285

Installs

Top 10%

Med: 0·Avg: 1

What's Inside

Slash Commands8

ds:compound

/compound

Extract and categorize learnings from completed experiments into docs/ds/learnings/ for future retrieval

ds:eda

/eda

Profile a dataset for structure, quality, distributions, and anomalies, then output an EDA report

ds:experiment

/experiment

Design an ML experiment with hypothesis, split strategy, leakage check, and evaluation plan

ds:plan

/plan

Frame a data science problem and plan the approach, surfacing relevant past learnings

ds:preprocess

/preprocess

Clean, validate, and transform raw data using automated preprocessing pipelines

Agents9

problem-framer

/problem-framer

Translate business questions into DS problems with target variables, metrics, and constraints. Use when starting a project or when the objective needs sharpening.

experiment-designer

/experiment-designer

Define hypothesis, variables, split strategy, baselines, and comparison protocol. Use before running an experiment to lock down methodology.

model-evaluator

/model-evaluator

Compute metrics, slice by subgroups, check calibration, and flag fairness gaps. Use after training to decide ship/iterate/abandon.

deployment-readiness

/deployment-readiness

Evaluate whether an ML model is ready for production deployment by checking infrastructure, monitoring, rollback, and operational requirements. Use before shipping a model to production.

documentation-synthesizer

/documentation-synthesizer

Extract reusable insights from experiment results and write them as searchable learning documents. Use at project end to capture what worked, failed, and surprised.

Skills19

aeon

/aeon

Aeon API patterns for time series machine learning -- classification, regression, clustering, anomaly detection, segmentation, and similarity search. Use when /ds:experiment needs time-series-specific ML algorithms (ROCKET, InceptionTime, DTW classifiers), or /ds:eda needs temporal feature extraction (Catch22, ROCKET features) or change point detection. For classical statistical forecasting (ARIMA/SARIMAX) use statsmodels; for tabular ML pipelines use scikit-learn; for visualization use matplotlib.

data-preprocessing

/data-preprocessing

Pre-model data preparation pipelines for cleaning, validation, transformation, and ETL orchestration. Use when raw data needs deduplication, schema validation, format conversion, or quality assurance before EDA or modeling.

data-quality-frameworks

/data-quality-frameworks

Data quality validation with Great Expectations, dbt tests, and data contracts. Use when building formal validation rules, expectation suites, or data contracts for repeatable quality gates.

eda-checklist

/eda-checklist

Systematic exploratory data analysis checklist covering structure, quality, distributions, relationships, and target analysis. Use when starting EDA on any dataset.

experiment-tracking

/experiment-tracking

Standard format for logging ML experiments including hypothesis, config, results, and learnings. Use when running experiments to maintain a consistent record.

MCP Servers1

context7

External

Stats

Version2.6.0

LanguagePython

Stars11

Forks1

Copy clicks1

MaintenanceExcellent

LicenseMIT

Last CommitFeb 25, 2026

AddedMar 21, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).

Available In

data-science-plugin9

Safety Signals

Caution

External network access

Connects to servers outside your machine

Uses power tools

Uses Bash, Write, or Edit tools

README

Data Science Plugin

Data science and ML workflow tools that compound institutional knowledge. 9 agents, 8 commands, 19 skills for problem framing, preprocessing, validation, EDA, experimentation, review, deployment, and knowledge compounding.

Install

Add the repo as a marketplace, then install:

/plugin marketplace add andikarachman/data-science-plugin
/plugin install ds@data-science-plugin

Prerequisites

The plugin's skills and agents use Python libraries for data analysis. Install them into your active environment:

uv pip install pandas scikit-learn scipy statsmodels numpy

Optional libraries (visualization, advanced models, and high-performance DataFrames):

uv pip install matplotlib seaborn aeon xgboost lightgbm shap polars optuna

Run /ds:setup to check which libraries are installed.

Workflow

Frame -> Preprocess -> Validate -> Explore -> Experiment -> Review -> Ship -> Compound -> Repeat

Command	Purpose
`/ds:plan`	Frame business questions as DS problems and plan approach
`/ds:preprocess`	Clean, validate, and transform raw data with automated pipelines
`/ds:validate`	Run formal data quality validation with expectation suites
`/ds:eda`	Run structured exploratory data analysis
`/ds:experiment`	Design and run rigorous ML experiments
`/ds:review`	Peer review experiments for methodology and reproducibility
`/ds:ship`	Assess deployment readiness and generate model cards
`/ds:compound`	Capture learnings to make future projects faster

Each cycle compounds: experiment learnings surface in future plans, error patterns inform feature engineering, and review feedback becomes institutional knowledge.

Components

Component	Count
Agents	9
Commands	8
Skills	19
Templates	9
MCP Servers	1

Agents

Analysis (4)

Agent	Description
`problem-framer`	Frame business questions as structured DS problems
`data-profiler`	Profile datasets for quality, structure, and anomalies
`feature-engineer`	Design and evaluate feature transformations
`pipeline-builder`	Assess raw data quality and design preprocessing pipelines

Modeling (2)

Agent	Description
`experiment-designer`	Design rigorous experiments with hypotheses and evaluation plans
`model-evaluator`	Evaluate performance with slicing, calibration, and fairness checks

Review (3)

Agent	Description
`documentation-synthesizer`	Synthesize findings into reusable learning documents
`reproducibility-auditor`	Audit experiments for reproducibility (seeds, versions, data hashes)
`deployment-readiness`	Evaluate models for production deployment readiness

Commands

Command	Description
`/ds:plan`	Search past learnings, frame the problem, plan the approach, output a plan doc
`/ds:preprocess`	Assess data quality, design and execute preprocessing pipelines, output preprocessing report
`/ds:validate`	Run data quality validation with Great Expectations, pandas, or data contracts, output validation report
`/ds:eda`	Profile data, analyze distributions, check quality, output an EDA report
`/ds:experiment`	Formulate hypothesis, design methodology, check for leakage, output experiment plan and results
`/ds:review`	Peer review experiments for methodology, leakage, reproducibility, and statistical validity
`/ds:ship`	Assess deployment readiness, generate model card and deployment documentation
`/ds:compound`	Extract learnings from completed work, categorize, and save to docs/ds/learnings/

Skills

View full README on GitHub

ds

Popularity

What's Inside

Confidence

README

Data Science Plugin

Install

Prerequisites

Workflow

Components

Agents

Analysis (4)

Modeling (2)

Review (3)

Commands

Skills

Similar Plugins

probabl-skills

mycelium

feature-engineering-toolkit

data-scientist

model-evaluator

ecc

Data Science Plugin

Install

Prerequisites

Workflow

Components

Agents

Analysis (4)

Modeling (2)

Review (3)

Commands

Skills

Popularity

Health & Quality

Similar Plugins

probabl-skills

mycelium

feature-engineering-toolkit

data-scientist

model-evaluator

ecc