From datascience
Data science project conventions and standards. Auto-triggered when working with data science projects, notebooks, datasets, ML models, PyTorch, Polars, sklearn, or any data analysis workflow. Applies naming conventions, file format standards, project structure rules, and code patterns.
How this skill is triggered — by the user, by Claude, or both
Slash command
/datascience:datascienceThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Conventions for data science projects.
Conventions for data science projects.
Pattern: NN-initials-description.py (Jupytext percent format)
01, 02, 03kj for Konrad Jelendata-exploration, train-yolov8m01-kj-data-exploration.py, 04-kj-train-yolov8m.pySequential numbering within groupings. Archive obsolete to @archive/. Never delete. temp_ prefix for temporary notebooks excluded from Git.
Jupytext percent format (.py) = source of truth. # %% for code, # %% [markdown] for markdown. Add *.ipynb to .gitignore. Enables git diffs and code review.
data/raw/ # Original immutable datasets (never modify)
data/interim/ # Intermediate transformed data
data/processed/ # Final canonical datasets
data/external/ # Third-party data
notebooks/ # Jupytext notebooks
src/ # Reusable Python modules extracted from notebooks
models/ # Trained model artifacts
reports/ # Generated analysis and figures
models/<model_name>/
model.pt # TorchScript (torch.jit.load, no class needed, for inference)
checkpoint.pt # State dict (needs class definition, for retraining)
Folder rolling: current → -1 → -2, up to 5 versions.
__init__.py. Always explicit module importspurpose_df for DataFrames, purpose_lf for LazyFramesfrom rich.jupyter import printpl.LazyFrame + collect()) for large datasetssklearn.model_selection.train_test_split over manualfigsize=(12, 6), matplotlib + seabornStart thorough: data types, missing values, basic statistics, visual exploration (histograms, scatter, box plots), testable hypotheses. High-dimensional: try UMAP/t-SNE.
npx claudepluginhub stellarshenson/claude-code-plugins --plugin datascienceProvides UI/UX resources: 50+ styles, color palettes, font pairings, guidelines, charts for web/mobile across React, Next.js, Vue, Svelte, Tailwind, React Native, Flutter. Aids planning, building, reviewing interfaces.
Searches MemPalace before answering questions about past work, people, projects, or prior decisions. Returns verbatim stored content instead of guessing from model memory.