Skill

executor

Execute one micro-task from tasks.json using TDD. Use when user says "execute", "run next task", "implement next", "continue execution", or runs /execute. This is the fourth phase of Track 2. Enforces TDD gate: failing tests FIRST, then implementation.

Popularity

Forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/harness-engineering:executor

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Execute exactly ONE micro-task from tasks.json. Enforces TDD (failing tests first)

Supporting Files

scripts/mark_complete.pyscripts/select_next.py

SKILL.md

126 lines · ~979 tokens

Stats

LanguagePython

Stars0

Forks1

MaintenanceExcellent

Last CommitApr 3, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Executor (Track 2 — Phase 4)

Purpose

Execute exactly ONE micro-task from tasks.json. Enforces TDD (failing tests first) and Chain of Verification (CoVe) before marking complete.

Process

Step 1: Select Next Task

python3 ${CLAUDE_PLUGIN_ROOT}/skills/executor/scripts/select_next.py

Returns the next unblocked task (~100 tokens). If no tasks available, tell the user.

Step 2: Read Task Context

Read ONLY the files listed in the task's files array. Do not read unrelated files. If the task has annotations, read those carefully — they contain human reviewer decisions.

Step 3: TDD Gate (DO NOT SKIP)

Write failing tests FIRST:

Based on the task's verification spec, write test(s) that define expected behavior
Run the tests — they MUST fail (if they pass, the feature already exists)
Show the user the failing tests
Wait for user validation (for M/L scope tasks)
- For S scope tasks, proceed automatically

Only after tests are validated:

Write implementation code until tests pass
Run tests again — they MUST pass now

Step 4: Chain of Verification (CoVe)

After implementation, run this 4-step self-check:

Generate answer: The implementation code (already written)
Verification questions — ask yourself:
- Does this handle the edge cases mentioned in the task?
- Is this consistent with the existing codebase patterns?
- Does this break any existing tests?
- Are there security concerns (injection, XSS, etc.)?
Answer independently: Check each question against the actual code
Revise: Fix any issues found

Step 5: Mark Complete

python3 ${CLAUDE_PLUGIN_ROOT}/skills/executor/scripts/mark_complete.py \
  <task_id> [--commit-sha <sha>]

This updates tasks.json and appends to claude-progress.txt.

Step 6: Report

Tell the user:

What was implemented
Tests written and their status
CoVe findings (if any issues were found and fixed)
"Context can be safely cleared. Run /execute for the next task."

2-Pass Debugging Rule

If tests don't pass after 2 implementation attempts:

Do NOT try a 3rd time

Log state:

python3 ${CLAUDE_PLUGIN_ROOT}/scripts/progress.py append \
  "2-Pass limit reached on <task_id>: <what was tried>"

Tell the user: "2-Pass limit reached. Recommend context clear and fresh approach."
STOP

Examples

Example 1: Simple Task (S scope)

Task: T001 — Add auth middleware to /api/users endpoint Process:

select_next.py returns T001
Read the endpoint file
Write test: test_users_endpoint_requires_auth()
Run test -> FAILS (no auth yet) -- auto-proceed (S scope)
Add middleware import and decorator
Run test -> PASSES
CoVe: edge cases OK, consistent with other endpoints
mark_complete.py T001 --commit-sha abc123

Example 2: Complex Task (M scope) with 2-Pass

Task: T003 — Implement token refresh logic Process:

select_next.py returns T003
Read token management files
Write test: test_token_refresh_extends_session()
Run test -> FAILS -- show user, wait for validation
User validates test
First attempt: implement refresh -- tests still fail
Second attempt: fix edge case -- tests PASS
CoVe: security check on token handling
mark_complete.py T003

Anti-Patterns

Do NOT implement without writing tests first
Do NOT execute multiple tasks in one session
Do NOT skip CoVe
Do NOT attempt a 3rd debugging pass (2-Pass Rule)
Do NOT read files not listed in the task's files array

executor

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

executor

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

Executor (Track 2 — Phase 4)

Purpose

Process

Step 1: Select Next Task

Step 2: Read Task Context

Step 3: TDD Gate (DO NOT SKIP)

Step 4: Chain of Verification (CoVe)

Step 5: Mark Complete

Step 6: Report

2-Pass Debugging Rule

Examples

Example 1: Simple Task (S scope)

Example 2: Complex Task (M scope) with 2-Pass

Anti-Patterns

Similar Skills

Executor (Track 2 — Phase 4)

Purpose

Process

Step 1: Select Next Task

Step 2: Read Task Context

Step 3: TDD Gate (DO NOT SKIP)

Step 4: Chain of Verification (CoVe)

Step 5: Mark Complete

Step 6: Report

2-Pass Debugging Rule

Examples

Example 1: Simple Task (S scope)

Example 2: Complex Task (M scope) with 2-Pass

Anti-Patterns

Similar Skills