Skill

measure

Run a workflow N times and compute aggregate metrics (success rate, duration, failures). Standalone — does not invoke other skills.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/workflow-optimizer:measure

User invocable

Model invocable

Inline context

Default effort

Tool Access

This skill is limited to the following tools:

BashReadWriteGlobAgent

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

1. **workflow-path** — path to the workflow definition (workflow.md)

SKILL.md

80 lines · ~560 tokens

Stats

Stars0

MaintenanceGood

Last CommitMar 18, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Measure — Run N Times, Collect Metrics

Input Parsing

workflow-path — path to the workflow definition (workflow.md)
--runs N (default: 5) — number of executions

Phase 1: Load Workflow

Read the workflow.md file. Extract:

Run Command — shell command or agent prompt template
Success Criteria — exit code, output pattern, URL pattern
Fixtures — test inputs (cycled round-robin)
Constraints — timeout (ms)

If the workflow has a setup step, run it first.

Phase 2: Execute Runs

For each run (1..N):

Pick fixture: fixtures[i % fixtures.length]
Substitute fixture values into the run command
Start timer
Execute via Bash tool (shell commands) or describe the agent invocation
Stop timer, record duration
Check success criteria against output
Record result

Run Result

Run {i}:
  success: PASS / FAIL
  fixture: {fixture_id}
  duration_ms: {wall clock time}
  error: {error message or "none"}
  output: {first 500 chars of stdout}

If a run exceeds the timeout, mark it as FAIL with error "TIMEOUT".

Parallel Runs

For workflows with no shared state between runs, use the Agent tool to run subsets concurrently (up to 3 agents). Each agent writes results to .workflow-optimizer/{workflow-id}/runs/run-{i}.md.

Phase 3: Aggregate

Compute across all N runs:

Metric	Formula
Success rate	pass_count / N
Avg duration	mean(duration_ms)
Failure distribution	count per error type

Phase 4: Report

============================================================
  {workflow-name} — {N} runs
============================================================
  Success Rate : XX.X%
  Avg Duration : XX.Xs
  Failures:
    TIMEOUT        2
    TIMING         1
============================================================

Output

Write results to .workflow-optimizer/{workflow-id}/baseline.md with the aggregate metrics and per-run details.

measure

Invocation

Tool Access

Context Preview

SKILL.md

measure

Invocation

Tool Access

Context Preview

SKILL.md

Measure — Run N Times, Collect Metrics

Input Parsing

Phase 1: Load Workflow

Phase 2: Execute Runs

Run Result

Parallel Runs

Phase 3: Aggregate

Phase 4: Report

Output

Similar Skills

Measure — Run N Times, Collect Metrics

Input Parsing

Phase 1: Load Workflow

Phase 2: Execute Runs

Run Result

Parallel Runs

Phase 3: Aggregate

Phase 4: Report

Output

Similar Skills