Skill

checkpoint

Phase boundary audit that runs all gates and all audit agents on the entire codebase, establishes quality baselines, and enforces ratcheting (finding count cannot increase between phases). Use when the user says "phase is done", "wrap up this phase", "milestone check", or when all work orders in a phase are complete.

Popularity

Parent stars

Parent forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/vibeos:checkpoint

User invocable

Model invocable

Inline context

Default effort

Tool Access

This skill is limited to the following tools:

ReadWriteEditGlobGrepBashAgentAskUserQuestion

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Run all quality gates and all audit agents on the entire codebase at phase boundaries. Establish baselines, enforce quality ratcheting, and check for product/standards drift.

SKILL.md

224 lines · ~2.1k tokens

Stats

LanguageShell

Parent stars2

Parent forks1

MaintenanceExcellent

Last CommitApr 5, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

/vibeos:checkpoint — Phase Boundary Audit (Layer 5)

Run all quality gates and all audit agents on the entire codebase at phase boundaries. Establish baselines, enforce quality ratcheting, and check for product/standards drift.

Communication Contract

Follow the full USER-COMMUNICATION-CONTRACT.md (docs/USER-COMMUNICATION-CONTRACT.md). Key rules:

Lead with outcome, follow with mechanism
Present decisions with consequences
Introduce every concept on first use with plain English definition

Skill-specific addenda:

Report progress as each gate suite and auditor completes
Explain any ratchet violations clearly: what regressed, by how much, which files
Explain outcomes in plain English first; add technical detail only when it helps the user understand the regression
Every checkpoint decision must include options, pros, cons, and a recommendation

Prerequisites

Before starting, verify these exist:

project-definition.json
docs/planning/DEVELOPMENT-PLAN.md
Source code in the project

If no source code exists, report "No source code to checkpoint" and stop.

Checkpoint Flow

Step 1: Determine Phase

If $ARGUMENTS specifies a phase number, use that. Otherwise:

Read docs/planning/DEVELOPMENT-PLAN.md
Find the most recently completed phase (all WOs Complete)
Use that phase number

Step 2: Run All Quality Gates

Run the full gate suite on the entire codebase:

bash ".vibeos/scripts/gate-runner.sh" pre_commit --project-dir "${CLAUDE_PROJECT_DIR:-.}"

Collect results: pass/fail per gate, total pass count.

Step 3: Run Full Audit Cycle

Dispatch all 8 audit agents following the same protocol as skills/audit/SKILL.md:

Dispatch all 8 agents (security, architecture, correctness, test, evidence, product-drift, red-team, contract-validator)
Collect structured findings from each
Apply consensus logic (2+ agents = true positive, 1 = warning)
Generate composite findings list

Scale-aware dispatch: For codebases over 15K lines, use module-targeted dispatch:

Identify the top-level modules (e.g., src/gateway/, src/orchestrator/, frontend/src/)
Dispatch auditors per-module rather than per-audit-type
Each auditor gets a focused scope
Aggregate findings across modules after all agents complete
Run cross-cutting checks (circular dependencies, contract mismatches) as a separate pass
Timeout scaling: base 120s per module per auditor, +30s per 5K lines, max 600s per agent

Step 4: Load Previous Baseline

Check for previous phase baseline:

.vibeos/baselines/phase-[N-1]-baseline.json

If no previous baseline exists (first checkpoint), skip ratchet comparison.

Baseline schema:

{
  "phase": N,
  "date": "ISO-8601",
  "gates": {
    "total": N,
    "passed": N,
    "failed": N,
    "gate_results": [{"name": "gate-name", "status": "pass|fail"}]
  },
  "findings": {
    "critical": N,
    "high": N,
    "medium": N,
    "low": N,
    "info": N,
    "total": N,
    "true_positives": N,
    "warnings": N
  },
  "auditors": {
    "security": {"status": "complete|failed", "findings": N},
    "architecture": {"status": "complete|failed", "findings": N},
    "correctness": {"status": "complete|failed", "findings": N},
    "test": {"status": "complete|failed", "findings": N},
    "evidence": {"status": "complete|failed", "findings": N},
    "product-drift": {"status": "complete|failed", "findings": N}
  }
}

Step 5: Apply Ratchet

Compare current results against previous baseline using dual ratchet (count-based + finding-level):

5a. Count-based ratchet (aggregate quality):

Gate pass count must be >= previous (cannot have more gate failures)
Critical finding count must be <= previous (cannot introduce critical issues)
High finding count must be <= previous
Total finding count must be <= previous (overall quality cannot decrease)

5b. Finding-level ratchet (precision tracking, if .vibeos/findings-registry.json exists):

bash ".vibeos/convergence/baseline-check.sh" check \
  --mode finding-level \
  --baseline-file ".vibeos/baselines/midstream-baseline.json" \
  --current-findings-file ".vibeos/findings-registry.json"

This detects:

New findings: Not in baseline — blocks even if total count unchanged (swap detection)
Fixed findings: In baseline but no longer present — locked in via ratchet
Tracked findings: In baseline and still present — allowed, not blocking

Both ratchets run in parallel: count-based ensures aggregate quality only improves, finding-level ensures specific findings are tracked and swaps are detected.

Ratchet result:

PASS: Both ratchets pass (counts improved or stable AND no new findings)
FAIL: Either ratchet fails

If ratchet fails, report which categories regressed with consequences:

"Quality regression detected — your codebase has more issues now than at the end of the previous phase:

Critical findings: [prev] → [current] (+[delta])

[category]: [prev] → [current] (+[delta])

New findings detected: [finding IDs] (not in baseline, even though total count unchanged)

Your options:

Fix the regressions — I'll identify the changes that introduced the new issues and correct them.

Pros: keeps quality improving and preserves the ratchet

Cons: adds more work before the phase can close

Technical note: the current phase baseline stays unchanged until the regressions are cleared

Review and accept — You inspect each new finding and explicitly choose which ones to keep.

Pros: useful when a trade-off is intentional and understood

Cons: those issues become the new normal and will stop being flagged as regressions

Technical note: accepted findings update the baseline

Roll back and rebuild — Revert the changes that introduced regressions and try a different implementation path.

Pros: cleanest recovery path when the current approach is fundamentally flawed

Cons: loses recent work and takes longer

Technical note: this restores the codebase to the previous known-good state before rebuilding

I recommend option 1 because regressions are usually unintentional and fixing them now prevents compounding issues in later phases."

Step 6: Store New Baseline

Save current results as the baseline for this phase:

mkdir -p .vibeos/baselines

Write to .vibeos/baselines/phase-[N]-baseline.json using the schema above.

Step 7: Generate Phase Report

Write the report to stdout and save to .vibeos/baselines/phase-[N]-report.md:

## Phase [N] Checkpoint Report

**Date:** [today]
**Phase:** [N] — [phase name]
**WOs completed:** [list]

### Gate Results

| Gate | Status |
|---|---|
| [gate-name] | PASS/FAIL |

**Total:** [passed]/[total] gates passing

### Audit Findings

| Severity | Count | Consensus | Warnings |
|---|---|---|---|
| Critical | [N] | [N] true positives | [N] warnings |
| High | [N] | [N] true positives | [N] warnings |
| Medium | [N] | [N] true positives | [N] warnings |
| Low | [N] | [N] true positives | [N] warnings |

### Baseline Comparison

| Metric | Previous (Phase [N-1]) | Current (Phase [N]) | Delta | Status |
|---|---|---|---|---|
| Gates passing | [N] | [N] | [+/-N] | PASS/FAIL |
| Critical findings | [N] | [N] | [+/-N] | PASS/FAIL |
| High findings | [N] | [N] | [+/-N] | PASS/FAIL |
| Total findings | [N] | [N] | [+/-N] | PASS/FAIL |

### Ratchet Status: [PASS/FAIL]

### Overall Assessment

[1-3 sentence plain English assessment]

**Recommendation:** [proceed to Phase N+1 / fix regressions first]

Error Handling

If gate-runner fails: log error, continue with audit agents
If an audit agent fails: log failure, continue with remaining agents
If all agents fail: report failure, suggest checking project-definition.json
If baseline file is corrupted: treat as first checkpoint (no ratchet comparison)

checkpoint

Popularity

Invocation

Tool Access

Context Preview

SKILL.md

checkpoint

Popularity

Invocation

Tool Access

Context Preview

SKILL.md

/vibeos:checkpoint — Phase Boundary Audit (Layer 5)

Communication Contract

Prerequisites

Checkpoint Flow

Step 1: Determine Phase

Step 2: Run All Quality Gates

Step 3: Run Full Audit Cycle

Step 4: Load Previous Baseline

Step 5: Apply Ratchet

Step 6: Store New Baseline

Step 7: Generate Phase Report

Error Handling

Similar Skills

/vibeos:checkpoint — Phase Boundary Audit (Layer 5)

Communication Contract

Prerequisites

Checkpoint Flow

Step 1: Determine Phase

Step 2: Run All Quality Gates

Step 3: Run Full Audit Cycle

Step 4: Load Previous Baseline

Step 5: Apply Ratchet

Step 6: Store New Baseline

Step 7: Generate Phase Report

Error Handling

Similar Skills