Skill

midnight-verify:zkir-regression

Run a curated set of verification claims against the current toolchain to detect behavioral changes. Each claim is verified through the normal verification pipeline (classify → dispatch agent → verify). Supports full sweep (all categories) and targeted sweep (single category). Invocable as /midnight-verify:zkir-regression or loadable as a sense-check when toolchain issues are suspected.

Popularity

Parent stars

Parent forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/midnight-verify:zkir-regression

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Run a curated set of verification claims against the current toolchain to detect behavioral changes. Each claim is verified through the full pipeline — claim classification, contract writing, compilation, execution, and PLONK proof verification.

SKILL.md

115 lines · ~1.2k tokens

Stats

LanguageTypeScript

Parent stars10

Parent forks1

MaintenanceExcellent

Last CommitApr 3, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

ZKIR Regression Sweep

Use this when:

A new compiler version or checker package version is released
An agent suspects unexpected behavior from the toolchain
You want a confidence check before presenting Midnight-specific claims

Step 1: Determine Mode

If $ARGUMENTS is empty → full sweep (all categories).

If $ARGUMENTS contains a category name → targeted sweep (that category only).

Valid categories: arithmetic, types, state, privacy, zk-proof, transcript.

Step 2: Record Toolchain Versions

compact compile --language-version
compact --version

Record both for the report header.

Step 3: Run Claims

For each claim in the list below (filtered by category if targeted):

Load the midnight-verify:verify-correctness skill to classify the claim domain
Load the appropriate domain skill
Dispatch the sub-agent(s) indicated by the domain skill's routing table
Collect the verdict

This follows the same flow as the /midnight-verify:verify command — you are the orchestrator for each claim.

Claim List

ID	Category	Claim	Expected Verdict
arith-1	arithmetic	A pure circuit that adds two Uint32 values (3 + 4) returns the correct sum (7)	Confirmed (tested)
arith-2	arithmetic	A pure circuit that multiplies two Uint32 values (3 * 5) returns the correct product (15)	Confirmed (tested)
types-1	types	Assigning a value of 256 to a Uint8 variable produces a compiler error	Confirmed (tested)
types-2	types	A pure circuit returning a tuple allows 0-indexed access to each element	Confirmed (tested)
state-1	state	A counter contract's increment circuit updates the ledger state by the specified amount	Confirmed (tested)
state-2	state	Reading a counter ledger value returns the current on-chain state	Confirmed (tested)
privacy-1	privacy	A circuit that writes to the ledger requires a disclose() call	Confirmed (tested)
zk-1	zk-proof	A counter contract's increment circuit passes the full PLONK proof verification	Confirmed (zkir-checked)
zk-2	zk-proof	Tampering with the public transcript of a verified circuit causes PLONK checker rejection	Confirmed (zkir-checked)
zk-3	zk-proof	The PLONK checker error for a tampered transcript identifies the exact mismatched input	Confirmed (zkir-checked)
transcript-1	transcript	A counter increment circuit encodes ledger operations in the publicTranscript	Confirmed (zkir-inspected)
transcript-2	transcript	The compiled ZKIR for a counter increment contains declare_pub_input instructions	Confirmed (zkir-inspected)

Step 4: Compare Results

For each claim, compare the actual verdict against the expected verdict:

Verdict matches expected → PASS
Verdict does not match → FAIL
Verification returned Inconclusive but expected Confirmed → FAIL (toolchain may be unavailable)

Step 5: Report

## ZKIR Regression Report

**Toolchain:** compact CLI vX.Y.Z, language version A.B.C
**Date:** YYYY-MM-DD
**Mode:** [full sweep / targeted: <category>]
**Ran:** N claims

### Results

| Category | Passed | Failed | Total |
|---|---|---|---|
| arithmetic | N | N | N |
| types | N | N | N |
| state | N | N | N |
| privacy | N | N | N |
| zk-proof | N | N | N |
| transcript | N | N | N |
| **Total** | **N** | **N** | **N** |

### Failures (if any)

**<claim-id>:** Expected <expected verdict>, got <actual verdict>
- Claim: "<claim text>"
- Actual result: [what the verification pipeline returned]
- Interpretation: [what this failure suggests about toolchain changes]

If there are zero failures, end with:

All N claims passed. Toolchain behavior matches expectations.

Adding New Claims

Add a row to the claim list table above. Each claim should:

Be verifiable through the normal /midnight-verify:verify pipeline
Have a deterministic expected verdict
Test a specific, observable behavior
Include the expected verdict qualifier (tested, zkir-checked, zkir-inspected, source-verified)

midnight-verify:zkir-regression

Popularity

Invocation

Context Preview

SKILL.md

midnight-verify:zkir-regression

Popularity

Invocation

Context Preview

SKILL.md

ZKIR Regression Sweep

Step 1: Determine Mode

Step 2: Record Toolchain Versions

Step 3: Run Claims

Claim List

Step 4: Compare Results

Step 5: Report

Adding New Claims

Similar Skills

ZKIR Regression Sweep

Step 1: Determine Mode

Step 2: Record Toolchain Versions

Step 3: Run Claims

Claim List

Step 4: Compare Results

Step 5: Report

Adding New Claims

Similar Skills