From upfront
Close the loop on a shipped feature — check spec predictions against production reality
How this skill is triggered — by the user, by Claude, or both
Slash command
/upfront:retroThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
You are running a structured retrospective that compares a feature spec's predictions against what actually happened in production. The goal is to improve future specs by learning from prediction accuracy.
You are running a structured retrospective that compares a feature spec's predictions against what actually happened in production. The goal is to improve future specs by learning from prediction accuracy.
The user provides a spec path as an argument: $ARGUMENTS
If no argument is provided or the path is empty:
specs/ directory (or similar spec locations in the project)Read the spec file and extract:
Present what you found:
"Here's what the spec predicted. Let me check these against reality."
List each prediction clearly so the user can respond to them.
For each success metric, ask:
"The spec predicted: [metric]. What actually happened? Do you have the numbers?"
For pre-mortem risks, ask:
"The pre-mortem identified these risks: [list risks]. Did any of these materialize?"
Then ask:
"Were there any unexpected outcomes — good or bad — that the spec didn't predict?"
Rules for evidence gathering:
/upfront:retro when you have [specific metric] data."For every prediction the spec made, assign one score:
| Score | Meaning |
|---|---|
| Hit | The predicted outcome happened as described |
| Partial | Directionally correct but magnitude was off |
| Miss | Didn't happen, or the opposite happened |
| Unknown | No data available to evaluate |
Present the scorecard clearly.
For each Miss and Partial, work through:
"Why was the prediction wrong?"
Guide the user through three possible causes:
Then ask:
"Should the spec have caught this? What question in
/upfront:featurewould have surfaced it?"
For unexpected outcomes (good or bad):
"Was this predictable? What signal did we miss?"
Challenge the user to generalize. Ask these questions one at a time, not all at once:
Push for specifics. "Be more careful" is not a lesson. "Add a load test to the pre-mortem checklist" is.
Append to specs/LEARNINGS.md (create if it doesn't exist):
## [today's date] — Retro: [feature name]
**Predicted:** [what the spec said would happen]
**Actual:** [what actually happened]
**Score:** [N hits, N partials, N misses, N unknown]
**Why predictions missed:** [analysis from step 4]
**Lessons for future specs:** [what to do differently]
**Process notes:** [did /upfront:feature ask the right questions? did /upfront:build catch the right things?]
Add a Retrospective section to the bottom of the spec file itself:
---
## Retrospective ([today's date])
**Outcome:** [hit/partial/miss for each prediction]
**Key learning:** [one sentence]
Review the retro findings for patterns that should change how /upfront:feature, /upfront:plan, or /upfront:build works. If you see one, say so explicitly. Examples:
/upfront:feature push harder on metric availability?"/upfront:build's red team should specifically test [pattern]."/upfront:feature add a question about [category]?"These are suggestions for evolving the system, not automatic changes. Present them as observations for the user to act on.
npx claudepluginhub thinkupfront/upfront --plugin upfrontActivate for: retrospective, retro, post-mortem, post launch review, feature review, what went well, what didn't go well, lessons learned, sprint retrospective, product review, launch review, did it work, measure impact, feature retrospective, post-ship review, outcome review, product retro, evaluate feature success. NOT for: metrics dashboards (use official /metrics-review), stakeholder updates (use official /stakeholder-update), sprint planning (use official /sprint-planning).
Runs structured retrospective after completing delivery diamonds or milestones, recording cycle data, ICE/effort calibration, DORA metrics, and learnings in canvas YAML and decision log.
Produces structured retrospectives after major releases or deprecations, comparing plan vs actual outcomes on blast radius, communication, migration, and support to identify process improvements.