From data-preprocessing-pipeline
Generates and executes Python pipelines for automated data cleaning, transformation, validation, and ETL in ML workflows. Activates on data prep or ETL requests.
How this skill is triggered — by the user, by Claude, or both
Slash command
/data-preprocessing-pipeline:preprocessing-data-with-automated-pipelinesThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Construct and execute automated data preprocessing pipelines for cleaning, transforming, and validating ML-ready datasets.
Construct and execute automated data preprocessing pipelines for cleaning, transforming, and validating ML-ready datasets.
construct and execute automated data preprocessing pipelines, ensuring data quality and readiness for machine learning. It streamlines the data preparation process by automating common tasks such as data cleaning, transformation, and validation.
This skill activates when you need to:
User request: "Preprocess the customer data from the CSV file to remove duplicates and handle missing values."
The skill will:
User request: "Create an ETL pipeline to transform the sensor data from the database into a format suitable for time series analysis."
The skill will:
This skill can be integrated with other Claude Code skills for data analysis, model training, and deployment. It provides a standardized way to prepare data for these tasks, ensuring consistency and reliability.
The skill produces structured output relevant to the task.
npx claudepluginhub jeremylongshore/claude-code-plugins-plus-skills --plugin data-preprocessing-pipelineProvides patterns for pre-model data preprocessing pipelines covering cleaning, validation, transformation, ETL orchestration. Use for raw data deduplication, schema validation, format conversion before EDA or modeling.
Builds end-to-end AutoML pipelines with data checks, feature engineering, model selection, hyperparameter tuning, evaluation, and deployment artifacts for repeatable ML workflows.
Guides end-to-end MLOps pipeline orchestration from data preparation through model training, validation, deployment, and monitoring using DAG patterns like Airflow.