Verify is the verification layer for coding agents.

Verify checks what a coding agent claims it did — tests, files, commits, protected content — and makes it correct the record when the claim and the repository disagree.

Linters check your code. Test runners check your code. Nothing checks the agent's report of what it did, and that report is the thing you act on. You read "tests pass, branch pushed, nothing leaked," and you move on. If the report is wrong, everything downstream of it inherits the error. Verify checks the report against reality.

Built by Orthogon AI Labs.

V1 supports Claude Code first. Codex support runs as a notification workflow; a Cursor adapter is on the roadmap.

What makes it trustworthy

Verify is the thing that grades your agent, so it holds itself to the same standard.

It never reports a failure it didn't observe. When Verify can't check a claim — no test command, no python3, no upstream branch — it returns inconclusive and says so, instead of guessing. A tool that polices honesty has to be honest about its own limits.
It surfaces, it never fixes. Verify reports the mismatch and makes the agent revise its answer. It never rewrites your code. A verifier that edits would just produce more work that needs verifying.
It blocks once, then gets out of the way. One Stop-hook block, one correction. It stays quiet when no claims are made or when every claim checks out.

Install for Claude Code

From this repo:

claude --plugin-dir .

Once loaded, Verify runs automatically through Claude Code hooks. It stays quiet unless it catches a mismatch.

What Verify checks

Tests — detects claims like "tests pass" and runs your configured or autodetected test command.
Files — detects claims like "updated src/foo.ts" and checks the file was touched this session.
Git and PRs — detects claims like "committed", "pushed", or "opened a PR" and checks local git or gh when available.
Protected sections — detects claims like "protected sections are intact" and checks that blocks the user marked protected weren't silently overwritten. Pairs with the canon marker syntax; works standalone via a vendored checker.
Secrets — detects claims like "no secrets committed" or "safe to push" and scans the staged diff (and commits ahead of upstream on a push claim) for credential patterns. Reports the file, line, and pattern name — never the secret value.

Tests and types fail loudly; you see red and fix it. Verify is built for the silent failures — a quietly overwritten voice-rules block, a "pushed" that never happened. Those are the ones that cost you a week.

If Verify catches a mismatch, it blocks the Stop hook once and tells the agent to revise:

Verify found claim mismatches. Revise your final answer to include these verification results:
- Claimed tests passed, but `npm test` exited 1.
- Claimed protected sections were intact, but 1 block was modified: docs/voice.md (block: voice-rules).

Do not claim failed or unverified work succeeded.

Codex notification workflow

Codex does not currently use the Claude Code Stop hook. The Codex plugin runs Verify as a notification workflow instead: it checks a final-answer draft and reports what was false, missing, or inconclusive.

npm.cmd run codex:notify -- --message "I have run the tests and they all pass."

Verify notification: not done or unverified:
- FAILED: Claimed tests passed, but `npm test` exited 1.

The Codex plugin skill lives in skills/verify-claims/ and tells Codex to run this notification before any final answer that claims tests, file changes, commits, pushes, pull requests, or protected-section preservation.

Configuration

Add verify.config.json to the project being worked on:

{
  "test": {
    "command": "npm test",
    "timeoutMs": 120000
  },
  "enabledVerifiers": ["tests", "files", "git", "protected", "secrets"],
  "reportMode": "failures-only",
  "protected": {
    "allowed": [],
    "skipPaths": ["node_modules", "dist", "_archive"],
    "checkerPath": null
  },
  "secrets": {
    "skipPaths": ["node_modules", "dist", "_archive", "*.example", "*.test.*"],
    "allowPatterns": []
  },
  "receipt": {
    "history": false,
    "path": ".verify"
  }
}

Config precedence:

verify.config.json
.verify/config.json
autodetected test command

If no test command can be found, Verify marks the test check inconclusive instead of failing the run. The same rule holds for every verifier: a missing dependency is never reported as a lie.

agent-verify

Popularity

What's Inside

README

Verify is the verification layer for coding agents.

What makes it trustworthy

Install for Claude Code

What Verify checks

Codex notification workflow

Configuration

Verification receipt

Confidence

Similar Plugins

anthropic-essentials

agent-skills

claude-buddy

claude-code-harness

everything-claude-code-mobile

More by Orthogon-AI-Labs

canon