Make large gridded/meteorological array computes fast and portable on ANY HPC. Detects the cluster's backends (netCDF4, HDF5, zarr, NCZarr, PnetCDF), MPI/parallel-HDF5, filesystem (NFS/Lustre/GPFS), and core/NUMA/RAM layout, then recommends + applies the optimal chunking, read pattern, storage backend, and worker count. 1 skill + 3 commands + 4 CLI/JSON tools.
Detect the HPC environment (backends, MPI, filesystem, cores/NUMA) and recommend an I/O + parallel strategy
Diagnose a slow gridded-data pipeline and apply the detect → fix-I/O → parallelize workflow
Rechunk netCDF file(s) to their access pattern (data-preserving) so scattered reads are fast
A Claude Code skill that makes large gridded (climate / meteorological) array computes fast and portable on any HPC by fixing the two things that actually slow them down: I/O (chunking + read patterns) and parallelism (worker sizing + load balance). It detects your cluster's available backends and hardware, then recommends and applies the right strategy — so the same workflow works on NFS laptops, NUMA fat-nodes, and Lustre/MPI clusters alike.
The core lesson: a "slow parallel compute" is usually slow serial I/O (redundant decompression), not slow math. This skill makes you profile I/O before blaming compute or adding cores. In the case that motivated it, a 3.5-hour "compute" was 99 % netCDF decompression; a contiguous-slab read was 77× faster, and adding cores would have made it worse.
/plugin marketplace add yanxingjianken/hpc-netcdf-parallelization
/plugin install hpc-netcdf-parallelization
Then run /hpc-detect on your cluster, or just describe a slow netCDF/parallel job and the skill
activates. (Traditional flow: clone and symlink the skill dir into ~/.claude/skills/ or your agent's
skills path.)
detect_env.py probes netCDF4 / HDF5 / zarr / NCZarr / PnetCDF,
MPI + parallel-HDF5, the filesystem (NFS/Lustre/GPFS), and cores / NUMA / RAM, then prints a JSON
recommendation (write backend, worker count for memory- vs CPU-bound work, NUMA + multi-node path).isel(time=[…]); access-pattern
chunking on write (time=1); data-preserving rechunking of existing files; backend choice.pmap (process or Dask), combinable
per-pixel (sum, valid-count) accumulators for correct means, BLAS pinning, NUMA pinning, and
worker counts sized to the memory-bandwidth knee (not the core count).| Tool | Purpose |
|---|---|
scripts/detect_env.py | Environment + backend detection → JSON recommendation (--human for a table) |
scripts/read_slab.py | Contiguous-slab reader + bench (scattered vs slab on your file) |
scripts/rechunk.py | Data-preserving rechunk to the access pattern (nccopy/xarray, --verify) |
scripts/parallel_map.py | Load-balanced pmap + combinable-mean add_accumulators, BLAS-pinned |
/hpc-detect — report the cluster's backends + recommendation./hpc-rechunk — benchmark + rechunk file(s) to fast access-pattern chunks./hpc-parallelize — diagnose a slow pipeline and apply detect → fix-I/O → parallelize.PY=python # any env with xarray + netCDF4 (zarr/dask optional)
$PY hpc-netcdf-parallelization/scripts/detect_env.py --human
$PY hpc-netcdf-parallelization/scripts/read_slab.py bench myfile.nc --vars t u v --n 30
$PY hpc-netcdf-parallelization/scripts/rechunk.py in.nc out.nc --chunks "time/1,lev/9,lat/192,lon/288" --verify
from parallel_map import pmap, add_accumulators
import functools, numpy as np
parts = pmap(process_event, events, n_workers=96) # fine-grained, BLAS-pinned
total = functools.reduce(add_accumulators, parts) # (sum, count) merge
mean = {v: np.where(c > 0, s / c, np.nan) for v, (s, c) in total.items()}
hpc-netcdf-parallelization/
SKILL.md # the skill (when/how to use)
scripts/ # detect_env, read_slab, rechunk, parallel_map (CLI + JSON)
references/ # chunking.md, backends.md, parallel.md
examples/ # example_workflow.md (worked diagnosis)
commands/ # /hpc-detect, /hpc-rechunk, /hpc-parallelize
.claude-plugin/ # plugin.json + marketplace.json
MIT © Xingjian (Ken) Yan. Built from real CESM2-LENS2 / ERA5 blocking-pipeline debugging on the dolma HPC.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub yanxingjianken/hpc-netcdf-parallelization --plugin hpc-netcdf-parallelizationA growing collection of Claude-compatible academic workflow bundles. Covers scientific figures, manuscript writing and polishing, reviewer assessment, citation retrieval, data availability, paper reading, literature search, response letters, paper-to-PPTX conversion, and evidence-grounded Chinese invention patent drafting. Rules are organized as reusable skill folders with explicit workflows and quality checks.
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Comprehensive feature development workflow with specialized agents for codebase exploration, architecture design, and quality review
Harness-native ECC operator layer - 67 agents, 271 skills, 92 legacy command shims, reusable hooks, rules, selective install profiles, and production-ready workflows for Claude Code, Codex, OpenCode, Cursor, and related agent harnesses
Real-time statusline HUD for Claude Code - context health, tool activity, agent tracking, and todo progress
Tools to maintain and improve CLAUDE.md files - audit quality, capture session learnings, and keep project memory current.