Skill

arco-era5-fetch

Fetch ARCO-ERA5 reanalysis (temperature, wind, precipitation, pressure, and more) for a date range and region from the public, credential-free Google Cloud Zarr store, and write a weather-skills envelope Zarr. Use when a task needs multi-variable gridded reanalysis ground truth for comparison, verification, or downstream clipping/aggregation/plotting.

Popularity

Stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/rhiza-forecasting:arco-era5-fetch

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Opens the [ARCO-ERA5](https://github.com/google-research/arco-era5) analysis-ready

Supporting Files

scripts/fetch.pyscripts/fetch.py.lock

SKILL.md

157 lines · ~2.2k tokens

Stats

LanguagePython

Stars1

MaintenanceExcellent

Last CommitJun 15, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

arco-era5-fetch

Opens the ARCO-ERA5 analysis-ready Zarr store (gs://gcp-public-data-arco-era5/ar/full_37-1h-0p25deg-chunk-1.zarr-v3), subsets it by bounding box, time range, and variables, maps it onto the weather-skills envelope analysis shape, and writes a consolidated Zarr store. The store is a uniform 0.25° equiangular lat/lon grid, hourly, opened with anonymous Google Cloud access — no credentials and no API queue.

When to use

A task needs multi-variable reanalysis (2m temperature, winds, precipitation, geopotential, pressure-level fields) as gridded observations/ground truth.
A downstream skill will clip, aggregate, compare, or plot the result as a weather-skills envelope Zarr.

Not a forecast — ERA5 is reanalysis. For forecast grids use ecmwf-fetch or dynamical-fetch.

Usage

uv run --script ${CLAUDE_SKILL_DIR}/scripts/fetch.py --start <date> --end <date> [--bbox N/W/S/E] [-v VAR ...] -o <path.zarr>

Arguments

--start, --end — inclusive date range. Each value is one of:
- an absolute ISO date YYYY-MM-DD;
- now or today — the current UTC date;
- latest — the newest date with available data;
- an offset now-<int>{d|w} or latest-<int>{d|w} — the base minus N (w = 7 days). The offset is capped at 36525 days; a larger value, a future + offset, a month/year unit, or any malformed value exits 2 before any network call.
Boundary handling: absolute endpoints and ordinary relative ranges are inclusive of both ends. The duration idiom — start B-<int>{d|w} paired with end exactly its own base B (both now, or both latest) — yields an N-day window inclusive of B, with the far edge shifted in by one. For any invocation using a relative token, the resolved concrete window is echoed to stderr before fetching. latest discovery runs at most once per invocation and only when a token references latest. The cache key records the resolved absolute dates, never the relative token.
--bbox — spatial subset N/W/S/E decimal degrees. Longitudes are normalized to the [-180, 180) convention, so negative west/east values select correctly on ERA5's native 0..360 grid. The slice follows each axis's own order, so any region works. A bbox whose west is greater than its east is read as crossing the antimeridian (e.g. 10/170/-10/-170): the cells with lon >= west or lon <= east are kept and the interior band is dropped, preserving the grid's native ascending longitude order. The two dateline-flanking spans (the cells near +180 followed by the cells near -180) stay in ascending order — monotonic, though not physically contiguous across the dateline — so downstream tools that assume a sorted longitude keep working. Omit to fetch the full native grid. To fetch over a country, get its bbox from the resolve-region skill.
--variable, -v — restrict to one data variable; repeat once per variable (-v 2m_temperature -v total_precipitation). Almost always supply at least one: ARCO-ERA5 carries hundreds of surface and pressure-level fields, and omitting --variable selects all of them. An unknown name exits 2 and prints the available variables.
--output, -o — output Zarr path (overwritten if it exists).

Output

A consolidated, fully CF-1.13 compliant weather-skills envelope analysis Zarr with a time dimension and dims (time, latitude, longitude) — plus level when a pressure-level variable is selected. The weather-skills envelope is a CF superset: the output passes CF first, then carries the weather_skills_history provenance key.

CF stamping on the output:

Global attrs: Conventions="CF-1.13", plus title, institution, source (the ARCO-ERA5 store path), references, and history.
Coordinates: latitude (standard_name=latitude, units=degrees_north, axis=Y), longitude (standard_name=longitude, units=degrees_east, axis=X), time (standard_name=time, axis=T, with CF units and calendar=proleptic_gregorian carried in the on-disk encoding). When a pressure-level variable is selected, level carries standard_name=air_pressure, units=hPa, and positive=down.
Data variables: every variable carries a udunits-valid units and a long_name. The ARCO store's own units/long_name/standard_name are forwarded; a few udunits-invalid unit placeholders the store uses ((0 - 1), ~, dimensionless, m of water equivalent) are normalized to a CF-valid spelling. Each variable's final units string is then validated against udunits: a variable whose source units are missing, or are non-parseable and not covered by that normalization map, is a hard failure — the fetcher exits non-zero naming the variable and its offending units rather than writing invalid units under a Conventions="CF-1.13" claim. A missing-units variable is never silently relabeled dimensionless. standard_name is set when the store supplies one or a curated value applies, and omitted otherwise (CF permits this).

Stamped with weather_skills_source=arco-era5.

Memory, performance, and error handling

--bbox and --variable are the levers that set how much data is pulled: the selection size is variables × levels × bbox cells × window length, read and written once — xarray prunes to the selection before pulling any array bytes, so an unselected variable or out-of-bbox cell is never materialized.

The selection is materialized into host memory in a single eager .to_zarr() load, so a very large request (no --bbox, all variables, a pressure-level variable spanning all 37 levels, or a long window) is slow and may exhaust memory. The fetcher does not refuse a request on a size estimate — what to fetch is your call. Keep a request affordable by selecting the specific variables you need, keeping --bbox tight, and keeping the window to days or a small number of weeks; run clip-region afterward to shrink further.

Errors are reported reactively. Store-open, network, and path failures emit a one-line actionable message instead of a raw traceback. If the eager load actually runs out of memory and raises a catchable MemoryError, the fetcher reports it and suggests narrowing with -v, --bbox, or a shorter window (under Linux memory overcommit an OOM may instead kill the process outright, printing only "Killed").

Provenance

The output stamps a JSON-encoded weather_skills_history attr: an append-only array of per-step entries {skill, version, args, input}. For this fetcher it is a length-1 array with skill="arco-era5-fetch" and input=null; downstream zarr-writing skills append their own entry. args records every option except the --output path — start, end, bbox, and variable — with the resolved concrete dates substituted for any relative token. version is this skill's version, also printed by --help. Inspect a written output's provenance with the provenance skill.

The args dict stores option names with underscores, not the hyphenated CLI flag names. A consumer reconstructing a uv run --script ${CLAUDE_SKILL_DIR}/scripts/fetch.py <args> invocation must translate underscore → hyphen.

Examples

# 2m temperature over Kenya for two days
uv run --script ${CLAUDE_SKILL_DIR}/scripts/fetch.py --start 2026-01-01 --end 2026-01-02 \
  --bbox 7/32/-6/43 -v 2m_temperature -o /tmp/arco.zarr

# Last week ending at the newest available time, two variables
uv run --script ${CLAUDE_SKILL_DIR}/scripts/fetch.py --start latest-1w --end latest \
  --bbox 23/-20/-37/59 -v 2m_temperature -v total_precipitation -o /tmp/arco_week.zarr

# A pressure-level variable for one day — adds a CF `level` (air_pressure) dim
uv run --script ${CLAUDE_SKILL_DIR}/scripts/fetch.py --start 2026-01-01 --end 2026-01-01 \
  --bbox 7/32/-6/43 -v temperature -o /tmp/arco_level.zarr

arco-era5-fetch

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

arco-era5-fetch

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

arco-era5-fetch

When to use

Usage

Arguments

Output

Memory, performance, and error handling

Provenance

Examples

Similar Skills

arco-era5-fetch

When to use

Usage

Arguments

Output

Memory, performance, and error handling

Provenance

Examples

Similar Skills