Run `datacontract test` against ODCS contracts in the project to verify the live Snowflake data still conforms to schema, quality rules, and freshness. Handles two kinds of contracts with different semantics: output-port contracts under `models/output_ports/**/*.odcs.yaml` (tested against this project's Snowflake — "am I still producing what I promised?") and input-port contracts under `models/input_ports/*.odcs.yaml` (tested against the upstream Snowflake server — "is upstream still producing what I trusted?"). Trigger when the user asks to "test the data contracts", "verify the data product matches its contract", "are we still contract-conformant", "check upstream drift", or "run the contract tests".
How this skill is triggered — by the user, by Claude, or both
Slash command
/dataproduct-builder-demo:datacontract-testThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Run the **Data Contract CLI** (`datacontract test`) against contracts in the project to check whether the data currently in Snowflake still matches the schema and quality rules declared in the contract.
Run the Data Contract CLI (datacontract test) against contracts in the project to check whether the data currently in Snowflake still matches the schema and quality rules declared in the contract.
Two kinds of contracts live in this project and they test against different Snowflake servers:
models/output_ports/v<N>/*.odcs.yaml — what this data product commits to produce. They test against this project's Snowflake target. A failure means we are no longer producing what we promised.models/input_ports/*.odcs.yaml — cached snapshots of what we trust upstream to produce. They test against the upstream provider's Snowflake server, using the server block from the upstream ODCS. A failure means upstream drifted from the contract we trusted; the consequence is that our output may break too. Treat input-port failures as an upstream incident, not a local bug.dataproduct-implement and want to re-test without re-running dbt → this skill.--logs.
${PLUGIN_ROOT}below refers to the root of this plugin — the directory that containsskills/. On Claude Code it is set automatically as${CLAUDE_PLUGIN_ROOT}; use that. On any other agent (Codex, Copilot CLI, etc.) it is unset; resolve it as../..relative to thisSKILL.mdfile's directory (i.e. the grandparent ofskills/<this-skill>/).
Before running Step 0, print this plan to the user verbatim:
Running datacontract-test. I'll:
- Pre-checks: confirm
datacontractis on PATH and Snowflake credentials are in the environment.- Pick which contract(s) to test — defaults to all
models/output_ports/**/*.odcs.yamlandmodels/input_ports/*.odcs.yaml.- Pick the server (defaults to
productionif the contract defines one).- Run
datacontract testper contract and capture the result.- Report pass/fail with per-rule detail; flag missing credentials separately from real failures.
Then proceed.
Confirm uv run --quiet datacontract --version succeeds from the project root. If it fails, run uv sync (the bootstrap template seeds datacontract-cli[snowflake] as a dev dep in pyproject.toml) and retry. If uv sync still doesn't make it available, stop and tell the user to verify datacontract-cli[snowflake] is listed in pyproject.toml's [dependency-groups].dev. Do not propose uv tool install here — per-project venv is the convention. Use uv run datacontract … for every CLI invocation in this skill.
Confirm at least one *.odcs.yaml exists under models/output_ports/**/ or models/input_ports/. If not, stop and tell the user there is nothing to test.
Confirm Snowflake credentials for the Data Contract CLI are in the environment. The CLI reads DATACONTRACT_SNOWFLAKE_* env vars, not ~/.dbt/profiles.yml. The minimum set:
DATACONTRACT_SNOWFLAKE_USERNAME
DATACONTRACT_SNOWFLAKE_PASSWORD # or use JWT / SSO (see below)
DATACONTRACT_SNOWFLAKE_ROLE
DATACONTRACT_SNOWFLAKE_WAREHOUSE
If any are unset, surface the list and ask whether to continue (the CLI will fail-fast on that server) or stop. Do not try to source credentials yourself.
models/output_ports/**/*.odcs.yaml and models/input_ports/*.odcs.yaml.CONTRACTS. For each entry, also remember its role (output or input) — Step 4 surfaces failures differently.For each contract in CONTRACTS:
production. If production is not defined, ask the user which one.--server all if the user explicitly asks to test every server.For each contract:
uv run datacontract test <path-to-contract>.odcs.yaml --server <server> --logs
Where <path-to-contract> is the file resolved in Step 1 — typically models/output_ports/v<N>/<file>.odcs.yaml for output contracts, or models/input_ports/<file>.odcs.yaml for input contracts. The CLI does not care which directory; the role only matters for how Step 4 reports the result.
--logs ensures per-rule failure detail is in stdout — without it the CLI only prints a summary.--output ./test-results/<contract>.xml --output-format junit.--publish $API/test-results where $API is the Entropy Data host. Do not publish by default — it writes server-side state.Run sequentially, not in parallel — the warehouse is the bottleneck and parallel runs muddy the log output.
End with this two-part recap. Use the Status enum: passed, failed, skipped (missing creds).
Part 1 — outcome table. One row per contract tested. Group the rows: output-port contracts first, then input-port contracts under a sub-header (so the reader sees the two roles at a glance).
| Contract | Role | Server | Result | Failures | Details |
|---|---|---|---|---|---|
<contract-file> | output / input | <server> | passed / failed / skipped | count or — | one line per failing rule (field + rule), or "missing env var: …" if skipped |
Part 2 — next steps. Bullet list, include only what applies. Treat output vs. input failures differently:
orders.order_id: not_null violated for 17 rows). The fix is in this project — either the dbt model is wrong, the contract is wrong, or the data is wrong. If the user wants a follow-up SQL to find the offending rows, suggest the shape but do not run it.dataproduct-implement once upstream republishes a corrected contract, so the cached snapshot under models/input_ports/ refreshes.skipped row, the exact env vars the user needs to set, and where to get them (usually the warehouse admin).If everything passed, write a single line: All <N> contracts pass against <server>.
The Data Contract CLI reads credentials from environment variables, not from the contract file. Only the connection topology (account, database, schema, warehouse) belongs in the servers block.
ODCS server block:
servers:
- server: production
type: snowflake
account: abcdefg-xn12345
database: ORDER_DB
schema: ORDERS_PII_V2
Any env var prefixed DATACONTRACT_SNOWFLAKE_ is forwarded to the Snowflake connector with the prefix stripped and the rest lowercased, so any Snowflake/Soda parameter can be passed this way. Three auth modes:
Password auth
export DATACONTRACT_SNOWFLAKE_USERNAME=...
export DATACONTRACT_SNOWFLAKE_PASSWORD=...
export DATACONTRACT_SNOWFLAKE_WAREHOUSE=COMPUTE_WH
export DATACONTRACT_SNOWFLAKE_ROLE=DATA_CONTRACT_TEST
Private key (JWT) auth — used for service accounts and CI:
export DATACONTRACT_SNOWFLAKE_USERNAME=SVC_DATACONTRACT
export DATACONTRACT_SNOWFLAKE_AUTHENTICATOR=SNOWFLAKE_JWT
export DATACONTRACT_SNOWFLAKE_PRIVATE_KEY_PATH=/secrets/snowflake_rsa.p8
# Only if the key is encrypted:
export DATACONTRACT_SNOWFLAKE_PRIVATE_KEY_PASSPHRASE=...
export DATACONTRACT_SNOWFLAKE_WAREHOUSE=COMPUTE_WH
export DATACONTRACT_SNOWFLAKE_ROLE=DATA_CONTRACT_TEST
External browser SSO — interactive, for local runs against an IdP-backed account:
export [email protected]
export DATACONTRACT_SNOWFLAKE_AUTHENTICATOR=externalbrowser
export DATACONTRACT_SNOWFLAKE_WAREHOUSE=COMPUTE_WH
export DATACONTRACT_SNOWFLAKE_ROLE=DATA_CONTRACT_TEST
Not usable in CI — it opens a browser window.
datacontract test which executes SELECT queries; it never writes. Do not invoke datacontract publish, datacontract export, or entropy-data datacontracts put from this skill..env, dotfiles, or anywhere else on the user's behalf.npx claudepluginhub entropy-data/dataproduct-builder-demo --plugin dataproduct-builder-demoSearches MemPalace before answering questions about past work, people, projects, or prior decisions. Returns verbatim stored content instead of guessing from model memory.
Guides Payload CMS config (payload.config.ts), collections, fields, hooks, access control, APIs. Debugs validation errors, security, relationships, queries, transactions, hook behavior.
Implements vector databases with Pinecone, Weaviate, Qdrant, Milvus, pgvector for semantic search, RAG, recommendations, and similarity systems. Optimizes embeddings, indexing, and hybrid search.