Drafts an evaluation plan for a customer POC or POV — business decision, binary success criteria, scope guardrails, stakeholders, validation environment, and timeline — co-created with the project sponsor in a single 90-minute meeting, then pressure-tested against the four ways evals quietly fail. Use when scoping a POC, prepping an eval kickoff, drafting an evaluation brief, or sanity-checking a plan a sponsor sent over. Triggers on "draft an eval plan", "POV plan", "evaluation brief", "how should I scope this POC", "eval kickoff", "success criteria for this eval", or "POC scope".
How this skill is triggered — by the user, by Claude, or both
Slash command
/solutions-engineering:eval-planningThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Produce a written evaluation plan for a customer POC or POV — the artifact the project sponsor signs onto before kickoff. The plan is six fields, co-created with the sponsor in a single 90-minute session, then pressure-tested against the four ways evals quietly fail. If the plan takes a second session to land, the planning failed. The deliverable here is something the SE can walk into the meeti...
Produce a written evaluation plan for a customer POC or POV — the artifact the project sponsor signs onto before kickoff. The plan is six fields, co-created with the sponsor in a single 90-minute session, then pressure-tested against the four ways evals quietly fail. If the plan takes a second session to land, the planning failed. The deliverable here is something the SE can walk into the meeting with.
Fire on requests like:
Do not fire for post-eval recaps, demo planning, or first-call discovery prep. Those are different skills.
Run these steps in order. Each can halt and ask the user a question; halting is the correct behavior when the upstream material isn't there.
Before drafting anything, confirm the user can name three things:
If any of the three is missing, halt. Tell the user: "The eval isn't ready to plan yet — the gap is upstream, in discovery. Pin the business decision with the sponsor and AE before drafting further." Offer to help frame the discovery question. Do not paper over the gap with a generic plan.
Fill each field against the user's situation. If material for a field is missing, mark it [NEEDS: <what's missing>] rather than inventing content.
If [observable outcome] is true at wrap-up, [named decider] will [specific deal outcome] by [date]. Every other field exists to make this sentence resolvable.TBD is not acceptable; flag the gap. The project sponsor stays on through wrap-up — confirmed verbally, not over email. The security or risk owner is named explicitly; their absence is the most common reason a technical win does not convert.Walk the draft against each detector in references/anti-patterns.md and surface any that fire. Each detector has a trigger condition, a warning to put in front of the user, and a concrete remediation — surface all three when a detector fires. Do not soften the warning text; the bluntness is the point. The detectors are:
The plan lands in the eval-management tool with the sponsor watching, inside a 90-minute working session with a specific shape. Pull the three-part agenda and the signal markers (what tells the SE the plan is going to land vs. die quietly) from references/planning-meeting.md and attach them to the plan output as a separate Planning meeting section. Don't paste the whole reference inline — summarize the agenda and pick the signal markers most likely to surface for this specific plan.
Use the Output Format below. After the plan, write the Planning meeting section. Then run the Quality Checklist before delivering.
Produce the plan as a markdown block the SE can paste into the eval-management tool. Sections in this order:
# Eval Plan — [Customer]
## Business decision
If [observable outcome] is true at wrap-up, [named decider] will [specific deal outcome] by [date].
## Binary success criteria
1. [Specific behavior, customer environment, measurable threshold.]
2. [...]
3. [...]
(Three to five total. More than five is a Two-Week Flame trigger.)
## Scope guardrails (out of scope)
- [Item] — [one-line reason it's excluded; where it goes instead].
- [Item] — [one-line reason].
## Stakeholders
| Role | Name | Stated interest | Committed to |
|---|---|---|---|
| Project sponsor | [Name] | [What they need to see] | Kickoff, all checkpoints, wrap-up |
| Security / risk owner | [Name] | [What they need to see] | Kickoff, wrap-up |
| Decider (exec sponsor) | [Name] | [Business decision above] | Wrap-up |
| [Other role] | [Name] | [Stated interest] | [Meetings] |
## Validation environment
- Integration surface: [named systems, categories, versions].
- Auth flow path: [direction, credentials, boundaries crossed].
- Test data: [real / test / anonymized / synthetic — named].
- Sandbox vs. production: sandbox first, prepared by the SE, mirroring the customer's stack. Production only after sandbox passes, in stages.
## Timeline
- Kickoff: [date, within one week of planning meeting].
- Checkpoint cadence: every 2–3 days.
- Wrap-up: [date, ≤3 weeks after kickoff], decider attending.
- Internal AE recap: same day as planning meeting.
## Detector check
- Unaimed Evaluation: [PASS / FAIL — note].
- Empty Chair: [PASS / FAIL — note].
- Sandboxed Proof: [PASS / FAIL — note].
- Two-Week Flame: [PASS / FAIL — note].
Then append:
## Planning meeting
- **Who's in the room:** project sponsor + SE, AE supporting. Not the broader customer team — that comes back at kickoff.
- **Length:** 90 minutes minimum. Shorter is an agenda review, not a planning session.
- **Three parts:** (1) confirm success criteria (~30 min), (2) edit the plan live in the eval-management tool with the sponsor watching (~40 min), (3) schedule kickoff, checkpoints, and wrap-up with invites going out before the meeting ends (~20 min).
- **Signals to watch for:** [pull 2–3 markers from references/planning-meeting.md that match this specific plan — typically a mix of one "landing" signal and one "dying" signal to watch].
- **Same-day follow-ups:** AE recap; internal note that planning is complete. Customer does not get access to the plan in the eval-management tool until kickoff.
Verify each item before delivering. If any fails, fix it or surface the gap to the user with a halt message — do not silently ship a plan with a hole.
references/anti-patterns.md — the four detectors with trigger, warning, and remediation. Read in full before running Workflow step 3.references/planning-meeting.md — the 90-minute meeting structure plus the signal markers that distinguish a plan that will land from one that will die quietly. Read when preparing the agenda in Workflow step 4.Provides UI/UX resources: 50+ styles, color palettes, font pairings, guidelines, charts for web/mobile across React, Next.js, Vue, Svelte, Tailwind, React Native, Flutter. Aids planning, building, reviewing interfaces.
Fetches up-to-date documentation from Context7 for libraries and frameworks like React, Next.js, Prisma. Use for setup questions, API references, and code examples.
npx claudepluginhub nmelo/solutions-engineering-plugins --plugin solutions-engineering