Skill

aggregate-temporal

Roll up a weather-skills envelope Zarr along its time axis (or forecast step axis) into fixed windows (daily, weekly, dekadal, monthly) with a chosen reducer. Use whenever any dataset needs to be resampled to a canonical aggregation period before plotting or comparison.

Popularity

Stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/rhiza-forecasting:aggregate-temporal

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Source-agnostic temporal aggregation. Works on:

Supporting Files

scripts/aggregate.pyscripts/aggregate.py.lock

SKILL.md

172 lines · ~2.2k tokens

Stats

LanguagePython

Stars1

MaintenanceExcellent

Last CommitJun 15, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

aggregate-temporal

Source-agnostic temporal aggregation. Works on:

Observation envelopes with a time dim (e.g. CHIRPS, IMERG, TAHMO).
Forecast envelopes with a step dim (e.g. ECMWF S2S).

Autodetects which dim is present. For forecasts, aggregates ensemble members (number) independently.

When to use

Turning daily/half-hourly observations into weekly or dekadal totals.
Selecting weekly or dekadal subsets of a forecast initialized at multiple steps.

Usage

uv run --script ${CLAUDE_SKILL_DIR}/scripts/aggregate.py --input <in.zarr> --output <out.zarr> \
    --period daily|weekly|dekadal|monthly [--method sum|mean|max|min] \
    [--variable VAR ...] [--time-dim DIM] [--anchor-end YYYY-MM-DD]

Arguments

--input, -i — input Zarr.
--output, -o — output Zarr.
--period — window size: daily (1d), weekly (7d), dekadal (10d), monthly (calendar month for the default forward-anchored resample; 30-day approximation when combined with --anchor-end).
--method — reducer: sum (default for totals), mean, max, min.
--variable, -v — repeatable; restricts aggregation to the named data variable(s). Each name must be a data variable of the input; an unknown or non-data-variable name exits non-zero and lists the valid data variables. The selected data variable(s) are aggregated and relabeled as usual; the other data variables are dropped from the output (a stderr note lists which), and coordinates pass through. Default (unset) aggregates all data variables.
--time-dim — override; by default uses time if present, else step.
--anchor-end — ISO date (YYYY-MM-DD) used to anchor the LAST bin on the obs/time-resample path (no effect on the forecast step path). See "Anchor end" below.

Method and intensive quantities

--method sum adds the values within each window into a period total, which is meaningful only for an extensive quantity (an amount that accumulates, e.g. precipitation depth in mm or kg m**-2). When --method sum is requested on a variable that is clearly an intensive quantity, the skill exits with status 2 before any output is written and names the variable and the signal that marks it as intensive. Detection is conservative — it fires only on high-confidence signals and leaves ambiguous metadata to proceed:

a standard_name of air_temperature or any name ending in _temperature;
temperature units (K, degK, degC, Celsius, degree_Celsius, °C);
pressure units (Pa, hPa, mbar, bar) when the standard_name also indicates pressure;
a percentage (%, percent).

The other reducers (mean, max, min) are always accepted, and precipitation (mm, kg m**-2) with --method sum proceeds.

Units after sum

--method sum of a precipitation rate produces an accumulated depth: the sum of N daily mm/day values is the total depth in mm over those N days. So after a sum, any variable whose units are a recognized per-day depth rate is relabeled to the extensive depth, and a known precipitation-rate standard_name is remapped to its accumulated-amount form:

units mm/day (also mm day-1, mm/d, and kg m**-2/day spellings) → the bare depth (mm, kg m**-2);
standard_name is kept consistent with the new depth units (a rate name on depth units is invalid CF metadata): the known lwe_precipitation_rate → lwe_thickness_of_precipitation_amount; any other rate-shaped name (a _rate/_flux suffix) with no known amount equivalent is dropped with a stderr warning rather than left contradicting the units; a non-rate or absent standard_name is left unchanged.

This keeps downstream plot axis labels correct (e.g. a weekly total reads [mm], not [mm/day]) and keeps output metadata self-consistent for any input, including user-provided datasets. Scope is deliberately narrow:

Only --method sum relabels; mean/max/min keep the rate units (a window mean of mm/day is still a rate in mm/day).
Only the /day denominator family is recognized. The sum-equals-total identity holds only when each sample spans exactly one denominator unit, which is true for daily-cadence inputs but not for sub-daily cadence; /hr and /s rates are left untouched rather than silently mis-totaled.
Already-extensive units (mm, kg m**-2) and unrecognized units are left unchanged, as is any unrecognized or absent standard_name.

The numeric values are never altered — this is a metadata-only relabel.

Output

Same variables; the time/step axis is replaced by the aggregated window.

Provenance

The output stamps a JSON-encoded weather_skills_history attr: an append-only array of per-step entries {skill, version, args, input}. This skill reads the upstream input's weather_skills_history (default [] and stderr warning if absent) and appends its own entry. args is the argparse namespace minus the --input/--output path strings; input is a {basename, hash} dict — basename is the upstream zarr's filename and hash is a sha256 of its stored bytes, so a renamed-but-unchanged input still cache-hits and a same-named-but-modified input correctly cache-misses; version is the _SKILL_VERSION constant in scripts/aggregate.py, kept in lockstep with metadata.version in this SKILL.md by the CI version-bump workflow. Cache-hit comparison reads the existing output's weather_skills_history: a hit requires the upstream entries to match and the last entry's skill, args, and input to match the proposed new entry.

The args dict stores argparse dest names (underscored, e.g. time_dim, target_resolution, anchor_end), not the hyphenated CLI flag names (--time-dim, --target-resolution, --anchor-end). A consumer reconstructing a uv run --script ${CLAUDE_SKILL_DIR}/scripts/<skill>.py <args> invocation must translate underscore → hyphen.

Step coordinate convention

For forecast (step) inputs, the output step coord is the right edge of each aggregation bucket — i.e. each value is labeled with the end of the period it covers. Buckets are left-open and right-closed ((left, right]), so a step value sitting on a period boundary (e.g. step=7d for end-of-period labeled data like a deaccumulated forecast) lands in the bucket it physically belongs to. Trailing partial buckets that would extend past the input's last step are dropped rather than synthesized.

Anchor end

By default, the obs/time path delegates to xr.resample, which anchors bins forward from the start of the input time axis. Pass --anchor-end YYYY-MM-DD to instead anchor the LAST bin to end on that date: bins are period-day windows ((left, right]) synthesized backward from --anchor-end while their left edge is >= the input's earliest timestamp. Partial bins at the start whose left falls before the input range are dropped. The output time coord for each bin is the bin's right edge (matching the right-edge convention used for forecast step aggregation), so the last bin's label is exactly --anchor-end.

Caveat for monthly: with --anchor-end, monthly bins are 30-day fixed-width windows, not calendar months. Without --anchor-end, monthly continues to mean calendar months (xr.resample("MS")).

Example — anchor the last weekly bin to end on 2026-05-12:

uv run --script ${CLAUDE_SKILL_DIR}/scripts/aggregate.py -i /tmp/imerg.zarr -o /tmp/imerg_weekly.zarr \
    --period weekly --method sum --anchor-end 2026-05-12

Cumulative-since-init variables

For variables that are stored as cumulative-since-init (e.g. ECMWF S2S tp), run the deaccumulate skill before aggregate-temporal so each step value is per-period rather than cumulative. Running --method sum directly on an accumulated variable double-counts earlier steps and inflates totals.

Examples

uv run --script ${CLAUDE_SKILL_DIR}/scripts/aggregate.py -i /tmp/imerg.zarr -o /tmp/imerg_dekadal.zarr \
    --period dekadal --method sum

uv run --script ${CLAUDE_SKILL_DIR}/scripts/aggregate.py -i /tmp/ecmwf.zarr -o /tmp/ecmwf_weekly.zarr \
    --period weekly --method sum

aggregate-temporal

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

aggregate-temporal

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

aggregate-temporal

When to use

Usage

Arguments

Method and intensive quantities

Units after sum

Output

Provenance

Step coordinate convention

Anchor end

Cumulative-since-init variables

Examples

Similar Skills

aggregate-temporal

When to use

Usage

Arguments

Method and intensive quantities

Units after sum

Output

Provenance

Step coordinate convention

Anchor end

Cumulative-since-init variables

Examples

Similar Skills