Skill

plan-quality

From st

Use when creating any implementation plan (task or phase). Enforces goal-backward plan design, quality gates before execution, anti-vagueness rules, and plan-checker agent verification loop.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/st:plan-quality

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Plan Quality ensures every implementation plan is concrete enough to execute without guesswork and complete enough to achieve its goal. It is the methodology used during plan creation — not during execution (that is `verification`) and not for parallelism (that is `wave-parallelism`).

Supporting Files

references/plan-checker-prompt.md

SKILL.md

361 lines · ~4.2k tokens

Stats

LanguageJavaScript

Stars0

MaintenanceExcellent

Last CommitApr 8, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Plan Quality

Overview

Plan Quality ensures every implementation plan is concrete enough to execute without guesswork and complete enough to achieve its goal. It is the methodology used during plan creation — not during execution (that is verification) and not for parallelism (that is wave-parallelism).

Two responsibilities:

Methodology — goal-backward plan design, quality criteria checklist, must-haves derivation, plan-checker agent verification loop.
Discipline — anti-vagueness rules that resist Claude's default "optimistic, under-specified" planning behavior.

Core Principle

EVERY PLAN STEP MUST BE EXECUTABLE WITHOUT ASKING QUESTIONS.

If a step says "implement the feature" — it's not a plan, it's a wish.
If a step doesn't name exact files — the planner didn't think it through.
If a step has no acceptance criteria — how do you know it's done?

A plan is a CONTRACT with the executor.
Vague contracts produce vague results.

This is non-negotiable. No complexity level, no time pressure, no "obvious" task overrides this.

When Plan Quality Applies

Always applies to:

/st:plan output (task-level plans)
/st:phase-plan output (phase-level plans)
Any plan passed to /st:execute or /st:phase-execute

Does NOT apply to:

Exploratory brainstorming (ideas, not plans)
Research summaries (information, not instructions)
User's own plan documents (unless user asks for quality check)

Goal-Backward Plan Design

Plans are designed BACKWARDS from the desired outcome, not FORWARDS from "what should we do first."

Step 1: GOAL — What is the desired end state?
  → One sentence: "User can X" / "System does Y" / "Feature Z works"

Step 2: MUST-HAVES — What must be TRUE for goal to be achieved?
  → 3-7 observable truths (not tasks, not code — TRUTHS)

Step 3: ARTIFACTS — What must EXIST for must-haves to hold?
  → Concrete files, functions, endpoints, configs with exact paths

Step 4: TASKS — What steps produce those artifacts?
  → Every task traces back to a must-have
  → Any must-have without a task = gap in plan

Why backward, not forward?

Forward planning asks "what should we do?" and produces activity-oriented plans ("set up project", "create files", "add logic"). These plans feel productive but often miss the goal.

Backward planning asks "what must be true?" and produces outcome-oriented plans. Every task exists because a must-have requires it.

Example:

Direction	Plan for "add dark mode"
Forward (bad)	1. Research dark mode. 2. Create theme file. 3. Add toggle. 4. Update components. 5. Test.
Backward (good)	Goal: User can toggle dark/light mode. Must-haves: (a) toggle persists across sessions, (b) all components respect theme, (c) system preference detected on first visit, (d) no flash of wrong theme. Artifacts: theme.ts, ThemeToggle.tsx, useTheme hook, CSS variables. Tasks: each producing specific artifacts with verification.

The forward plan sounds reasonable but "update components" is unbounded. The backward plan names every artifact and traces each to a must-have.

Must-Haves Derivation

How to derive must-haves depends on plan type:

For /st:plan (task-level):

AI derives must-haves from goal:

Goal: "Add dark mode to the app"
Must-haves:
  1. User can toggle between dark and light mode
  2. Theme preference persists across sessions
  3. System color scheme preference detected on first visit
  4. All existing components render correctly in both themes
  5. No flash of wrong theme on page load

For /st:phase-plan (phase-level):

Must-haves = success criteria from ROADMAP.md phase:

Phase 3: Authentication System
Success criteria (= must-haves):
  1. User can register with email/password
  2. User can login and receive JWT token
  3. Password reset flow sends email and works
  4. Session expires after configured timeout

Rules:

Must-haves are TRUTHS, not tasks. "User can login" not "Create login endpoint."
3-7 per plan. Fewer = underspecified. More = split the plan.
Each must-have should be independently verifiable.
Must-haves become the verification checklist for superteam:verification post-execution.

Plan Quality Checklist

Every plan must pass ALL items before execution. This is the checklist the plan-checker agent uses.

PLAN QUALITY GATES:

□ GOAL STATED — One sentence, observable outcome
□ MUST-HAVES DERIVED — 3-7 truths that prove goal achieved
□ EVERY MUST-HAVE COVERED — Each has ≥1 task addressing it
□ NO ORPHAN TASKS — Every task traces to a must-have
□ EXACT FILE PATHS — Every task names files to create/modify/test
□ READ-FIRST FILES — Every modify-task lists files to read before editing
□ ACCEPTANCE CRITERIA — Every task has grep-verifiable conditions
□ EXPECTED OUTPUT — Tasks with runtime behavior have command + expected result
□ DEPENDENCIES DECLARED — Task ordering is explicit, not implied
□ GRANULARITY APPROPRIATE — Steps match complexity (COARSE/STANDARD/FINE)
□ NO VAGUE STEPS — Zero instances of banned phrases without specifics
□ CONCRETE VALUES — Config values, API paths, function signatures specified
□ TDD INTEGRATION — Logic tasks include test steps per granularity level

Task Anatomy

Every task in a plan must contain these fields. Missing fields = plan fails quality gate.

### Task N: [Specific Component Name]

Files:
  - Create: exact/path/to/file.ts
  - Modify: exact/path/to/existing.ts (lines ~XX-YY if known)
  - Test: tests/exact/path/to/test.ts
  - Read-first: exact/path/to/dependency.ts (understand before editing)

Steps:
  - [ ] Step 1: [concrete action with concrete values]
  - [ ] Step 2: [verification step with command + expected output]

Acceptance criteria:
  - grep "export function X" src/path/file.ts → match
  - npm test -- --filter="X" → 0 failures

Dependencies: Task [M] (needs X to exist)
Must-have: [which must-have this task addresses]

Required fields by granularity:

Field	COARSE	STANDARD	FINE
Files (create/modify)	Required	Required	Required
Read-first	Optional	Required	Required
Test files	Optional	Required	Required
Acceptance criteria (grep)	Required	Required	Required
Expected output (runtime)	Optional	Required	Required
TDD steps (red-green)	Skip	Logic tasks only	All tasks
Dependencies	Required	Required	Required
Must-have trace	Required	Required	Required

Anti-Vagueness Rules

These are the specific vague patterns Claude produces in plans and the concrete replacements required.

Banned Phrase	Why It Fails	Replace With
"Implement the feature"	No specifics. What feature? Which files?	"Create `src/auth/login.ts` with `loginUser(email, password): Promise<Token>`"
"Add validation"	Which fields? What rules? What error messages?	"Add email format check: reject if not matching `/^[^@]+@[^@]+$/`, return `{error: 'Invalid email format'}`"
"Handle errors"	Which errors? How? What response?	"Catch `DatabaseConnectionError` → return 503 with `{error: 'Service unavailable', retry_after: 30}`"
"Set up the project"	What setup? Which tools? What config?	"Create `vite.config.ts` with React plugin, path alias `@/` → `src/`"
"Update components"	Which components? What changes?	"Modify `src/components/Header.tsx`: add `useTheme()` hook, apply `className={theme.header}`"
"Test it"	What test? What assertion?	"Write test: `loginUser('bad@email', 'pass')` → throws `InvalidCredentialsError`"
"Configure [tool]"	Which settings? What values?	"Set `tsconfig.json` `compilerOptions.strict: true`, `paths: { '@/': ['src/'] }`"
"Integrate with [service]"	What endpoint? What data? What auth?	"POST to `/api/v2/users` with `{name, email}`, Bearer token in header, expect 201 with `{id, created_at}`"
"Refactor [code]"	What change? Why? What's the target state?	"Extract `validateInput()` from `handleSubmit()` in `form.ts:45-67` into `src/utils/validate.ts`"
"Clean up"	What cleanup? Which files?	"Remove unused imports in `api.ts` (lines 3, 7, 12), rename `temp` → `userSession`"

Detection rule: If a step can be interpreted 3+ different ways by different developers, it is vague. Rewrite until there is only one interpretation.

Granularity Guidelines

Plan-quality does not choose granularity (the planner agent does), but it enforces minimum quality per level.

Granularity	When	Task Count	Minimum Quality
COARSE	Simple: rename, config, styling	1-3	File paths + acceptance criteria + must-have trace
STANDARD	Medium: new component, API endpoint	5-8	+ read-first + expected output + TDD for logic
FINE	Complex: auth, migration, payment	10-20	+ TDD every task + code snippets + verification between steps

User override: "Chi tiết hơn" (more detail) → increase granularity. "Gọn hơn" (less detail) → decrease. Plan quality gates still apply at any level.

Plan-Checker Agent Protocol

After the planner agent produces a plan, the plan-checker agent verifies quality.

PLAN-CHECKER PROTOCOL:

1. DISPATCH plan-checker agent with:
   - Plan file path
   - Goal statement
   - Must-haves list
   - Quality checklist (from this skill)

2. CHECKER evaluates against checklist:
   - Each gate: PASS / FAIL with evidence
   - Vagueness scan: flag any banned phrases
   - Must-have coverage: matrix showing which tasks cover which must-haves
   - Orphan task detection: tasks not linked to any must-have

3. CHECKER returns:
   - Status: APPROVED / ISSUES FOUND
   - Issues (if any): specific, actionable (not "add more detail")
   - Must-have coverage matrix
   - Recommendations

4. IF ISSUES FOUND:
   - Planner fixes issues (same agent, preserves context)
   - Re-dispatch checker
   - Max 3 iterations
   - After 3: surface remaining issues to user

5. CONFIG GATE:
   - workflow.plan_check: true → run checker (default)
   - workflow.plan_check: false → skip checker, planner self-checks

Checker calibration: Only flag issues that would cause real problems during execution. An implementer building the wrong thing or getting stuck is an issue. Minor wording preferences are not. Approve unless there are serious gaps.

Anti-Shortcut System

Red Flags — STOP

These thoughts mean you are about to produce a low-quality plan:

Thought	What to do instead
"The implementation is obvious, I don't need a detailed plan"	Obvious implementations have non-obvious edge cases. Write the plan.
"I'll figure out the details during execution"	Execution is not planning. Details belong in the plan.
"This step is self-explanatory"	To you, not to the executor. Name the files, values, and criteria.
"I already know how to do this"	Knowledge is not a plan. Write it down.
"The plan is getting too long"	Long plan > vague plan. Completeness over brevity.
"I'll add acceptance criteria later"	You won't. Write them now. They take 10 seconds.
"This is just boilerplate, no need to specify"	Boilerplate varies by framework. Specify the exact boilerplate.
"The user will understand what I mean"	The executor might be a subagent with zero context. Be explicit.
"One big task is fine, I'll break it down in execution"	Big tasks hide complexity. Break it down now.
"Let me start coding, I'll plan as I go"	That's not planning. `/st:plan` exists for a reason.

Common Rationalizations

Excuse	Reality
"Plans change during execution anyway"	Starting with a good plan means smaller changes. Starting with a vague plan means rewriting everything.
"Over-planning wastes time"	Under-planning wastes MORE time: wrong direction, rework, missed requirements.
"The codebase is simple, I don't need all this"	Simple codebases become complex. Plans document decisions for future readers.
"Must-haves are obvious"	If they're obvious, writing them takes 30 seconds. If they're not, you just proved why you need them.
"I'll add file paths when I get to that task"	File paths during planning forces you to think about structure. File paths during execution forces you to context-switch.
"Acceptance criteria slow me down"	Acceptance criteria ARE the plan. Without them, how do you know you're done?
"This is a one-person project, no need for formal plans"	The executor might be a subagent. Or future-you with no context. Plans are for anyone who executes.

Quick Reference

CORE RULE:
  Every plan step must be executable without asking questions.
  Goal-backward: goal → must-haves → artifacts → tasks.

GOAL-BACKWARD:
  1. GOAL: one sentence, observable outcome
  2. MUST-HAVES: 3-7 truths (not tasks)
  3. ARTIFACTS: concrete files/functions/endpoints
  4. TASKS: steps producing those artifacts

QUALITY GATES (plan-checker verifies):
  □ Goal stated
  □ Must-haves derived (3-7)
  □ Every must-have covered by ≥1 task
  □ No orphan tasks
  □ Exact file paths in every task
  □ Read-first files listed
  □ Acceptance criteria (grep-verifiable)
  □ Expected output (runtime)
  □ Dependencies declared
  □ No vague steps (banned phrases check)
  □ Concrete values specified
  □ TDD per granularity

TASK ANATOMY:
  Files: create / modify / test / read-first
  Steps: concrete action + verification
  Criteria: grep-verifiable + expected output
  Dependencies + must-have trace

MUST-HAVES:
  /st:plan → AI derives from goal (3-7 truths)
  /st:phase-plan → success criteria from ROADMAP.md

PLAN-CHECKER:
  Dispatch after planner. Max 3 iterations.
  Config: workflow.plan_check (default true)
  Calibration: flag real problems, not style

BANNED: "implement", "add validation", "handle errors",
        "set up", "update components", "test it",
        "configure", "integrate", "refactor", "clean up"
        — without specifics.

Common Mistakes

Mistake	Fix
Skipping goal-backward, going straight to tasks	Always start from goal. Derive must-haves before writing any tasks.
Must-haves that are tasks, not truths	"Create login endpoint" is a task. "User can login" is a truth. Must-haves are truths.
Vague steps that pass because planner "knows what they mean"	Executor may be a subagent with zero context. Apply banned-phrase check.
No acceptance criteria ("I'll verify at the end")	Every task needs criteria. Without them, "done" is undefined.
Missing read-first files	Modifying a file without reading it first leads to wrong assumptions. Always list.
Plan with 1 giant task instead of multiple focused tasks	Each task should produce 1-3 files. If more, split.
Must-have not covered by any task	Gap in plan. Add task or expand existing one.
Orphan task not linked to must-have	Either the task is unnecessary (remove) or a must-have is missing (add).
Skipping plan-checker "because plan is simple"	Config controls this, not planner judgment. If `workflow.plan_check` is true, run checker.
Forward-planning masquerading as backward	Writing goal + must-haves AFTER writing tasks is not goal-backward. Must-haves must drive task creation, not rationalize it.

Context Budget

File	When to Load	Trigger
`SKILL.md`	Always	Skill invocation via `/st:plan` or `/st:phase-plan`
`references/plan-checker-prompt.md`	On demand	When dispatching plan-checker agent

Rule: Most plan creation needs only SKILL.md. Load references/plan-checker-prompt.md only when dispatching the checker agent. It is a prompt template, not prerequisite knowledge.

Integration

Used by:

/st:plan — plan quality methodology applied during plan creation (steps 5-8)
/st:phase-plan — same engine, phase context adds must-haves from success criteria

Skills that pair with plan-quality:

superteam:project-awareness — provides codebase context, framework detection, config including workflow.plan_check
superteam:wave-parallelism — dependency analysis and wave assignment happen after plan quality is verified
superteam:tdd-discipline — plan-quality enforces TDD step inclusion per granularity level
superteam:verification — must-haves from plan become verification checklist post-execution
superteam:handoff-protocol — plan state (checked/unchecked tasks) captured in handoff

Agent: plan-checker agent uses the quality checklist and must-have coverage matrix from this skill.

plan-quality

Invocation

Context Preview

Supporting Files

SKILL.md

plan-quality

Invocation

Context Preview

Supporting Files

SKILL.md

Plan Quality

Overview

Core Principle

When Plan Quality Applies

Goal-Backward Plan Design

Must-Haves Derivation

Plan Quality Checklist

Task Anatomy

Anti-Vagueness Rules

Granularity Guidelines

Plan-Checker Agent Protocol

Anti-Shortcut System

Red Flags — STOP

Common Rationalizations

Quick Reference

Common Mistakes

Context Budget

Integration

Similar Skills

Plan Quality

Overview

Core Principle

When Plan Quality Applies

Goal-Backward Plan Design

Must-Haves Derivation

Plan Quality Checklist

Task Anatomy

Anti-Vagueness Rules

Granularity Guidelines

Plan-Checker Agent Protocol

Anti-Shortcut System

Red Flags — STOP

Common Rationalizations

Quick Reference

Common Mistakes

Context Budget

Integration

Similar Skills