From data-scientist
Profiles datasets for data quality issues: missing values, outliers, class imbalance, correlation problems, and schema drift. Provides detection methods and actionable recommendations.
How this skill is triggered — by the user, by Claude, or both
Slash command
/data-scientist:dataset-profilingThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
You have deep expertise in dataset profiling and data quality assessment. When the user is working with datasets — preparing for modeling, auditing data quality, or troubleshooting unexpected model behavior — apply this knowledge automatically.
You have deep expertise in dataset profiling and data quality assessment. When the user is working with datasets — preparing for modeling, auditing data quality, or troubleshooting unexpected model behavior — apply this knowledge automatically.
Missing-value analysis:
Outlier detection:
Class imbalance:
Correlation and leakage:
Schema drift and stability:
When assisting with dataset profiling tasks:
revenue per Little's MCAR test") not just "missing values found"Data quality findings, outlier flags, and bias indicators produced through this plugin are drafts based on the dataset description provided. Real data may exhibit different patterns. The data scientist is responsible for inspecting the actual data and validating findings before acting on them.
More data-science AI tools and resources at https://theaicareerlab.com/professions/data-scientist
npx claudepluginhub alexclowe/awesome-claude-cowork-plugins --plugin data-scientistGenerates data profiles for pandas DataFrames with column stats, correlations, and missing patterns. Use for EDA and data discovery on new datasets.
References data quality dimensions with qsv checks and provides remediation decision tree for tabular CSV assessment and fixes.
Profiles unfamiliar datasets: schema structure, column distributions, null rates, cardinality, outliers, table relationships, and temporal coverage. Onboard new data sources, audit freshness, or discover foreign keys.