By RyanAlberts
The eval lifecycle as PM-runnable commands (/pmstack:eval, /pmstack:run-eval, /pmstack:eval-drift), plus PRDs, voice-of-customer synthesis, launch-readiness gates, and the Monday memo.
Draft a stakeholder brief tailored to executive, engineering, customer, or board
Feature-by-feature comparison of two or more products with eval design and runnable execution
Structured competitive analysis for a market or product
Re-runs /eval-self (and any user-defined eval YAMLs), diffs against the prior baseline, writes a release-blocker memo on regression. Designed to run on a /loop 7d schedule.
Run the pmstack self-eval suite — scores every pmstack skill against canonical scenarios, with regression check against the golden set
Synthesize many raw customer signals (support tickets, interview snippets, churn reasons, sales notes, NPS verbatims, app reviews) into a small set of ranked, PRD-ready problems. Use when a PM pastes or attaches a pile of feedback and asks "what's the real problem / what should we build first / what are the themes", when there are too many signals to read one by one, or when someone needs to turn voice-of-customer data into prioritized problem statements before writing a PRD. This is the front-of-funnel step that precedes pmstack-prd.
A Monday-morning self-snapshot for PMs. Three sections only — decisions made, open loops aging out, and one required "thing I changed my mind about." Anti-vanity by design. Trigger when the user says "weekly memo," "weekly recap," "Monday status," "what changed this week," "weekly self-check," or asks to summarize last week's PM activity. Run weekly on schedule, or on demand to catch up.
Draft a stakeholder brief on a topic, tailored to executive, engineering, customer, or board audience. Use when a PM asks for a "brief", "update", "readout", "exec summary", "eng comms", "customer announcement", or "board update", or when they need to communicate a status / decision / change to a specific audience and need help adapting the framing.
Feature-by-feature comparison of two or more products with built-in eval design and runnable execution. Use when the PM wants to compare specific features across products (e.g., "compare GitHub Copilot vs Cursor", "how does our pricing stack up against X and Y", "evaluate these two AI coding tools head-to-head"). Use pmstack-competitive instead for market-positioning analysis (audience, value prop, white space).
Generate a structured competitive landscape analysis for a market or product category. Use when a PM mentions "competitive analysis", "market positioning", "white space", "where do we wedge in", asks who else is in a space, or wants to size up rivals before strategy/launch. Focus is on market positioning (audience, value prop, white space) — for feature-by-feature parity, use pmstack-compare instead.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
[!TIP]
How do you measure the success of AI features?
Take the 2-minute Eval-Readiness check → · 8 questions · no email · instant score. Most PMs find out they're shipping on vibes — then it names the one gap to fix first.
An eval is how you prove an AI feature is actually good — and stays good after the next model update. In 2025 it became the line between a real AI PM and one who ships and hopes (OpenAI's CPO, Lenny, and a16z all said as much). But every eval tool is built for ML engineers: dashboards, SDKs, CI pipelines. pmstack is the one a PM runs. Here, /eval is a command — not a $2,000 course.
It's a set of commands you run inside Claude — terminal or phone, no code. The eval lifecycle is the core; the rest of your PM workflow (PRDs, competitive teardowns, launch-readiness gates, the Monday memo) rides along in the same place.
If gstack is the engineer's setup, pmstack is the PM's. Prove you know whether the model is any good. A feature flag won't tell you your model got quietly dumber overnight after a version bump. An eval will.
See it in 30 seconds. Point it at any AI feature — get a real test suite, not advice:
/eval "AI code-review bot that comments on pull requests"
A real run — the YAML shown is actual /eval output, not a mockup.
→ a runnable eval YAML: tasks, graders (code / model / human), negative cases, and the pass^k bar you'd gate a launch on. See a full one › Then /run-eval scores it against the real system and flags the week it regresses.
That's the wedge. The PRDs, voice-of-customer synthesis, and stakeholder briefs are table stakes — every AI writes those. An eval a PM can actually run is the part nobody else ships.
Brand new? Run /onboarding once — it walks the whole stack on a real example, start to finish. If it earns a place in your workflow, star the repo so you can find your way back.
You don't have to be "technical." You have to be able to run a command.
Terminal PM — you live in the shell. Install pmstack as Claude Code slash commands and get one muscle-memory across every artifact you ship.
Phone-and-browser PM — you work in claude.ai on web, desktop, or your phone. Upload the skills to a Claude Project and the same commands fire on plain English. No terminal, ever — same commands, same outputs.
~30 seconds. No terminal.
claude-skills/. All 18, or pick the 4–5 you'll use most.Two commands inside Claude Code. Versioned updates, clean uninstall, nothing piped to bash.
/plugin marketplace add RyanAlberts/pmstack
/plugin install pmstack@pmstack
Commands arrive namespaced — /pmstack:eval, /pmstack:prd, /pmstack:weekly — and all 18 skills auto-activate on plain English ("write a PRD from this customer quote"). Update later with /plugin, no re-install.
curl -fsSL https://raw.githubusercontent.com/RyanAlberts/pmstack/main/install.sh | bash -s -- --global
Installs to ~/.claude/ so the commands work in any folder under their bare names. Drop --global to install in the current folder only.
git clone https://github.com/RyanAlberts/pmstack.git && cd pmstack
./setup --global # or: ./setup ~/work/my-pm-stuff
For non-Claude tools, see docs/using-other-tools.md — the skills are plain markdown, paste them into .cursorrules, a Custom GPT, or a system prompt.
If you just installed pmstack, run this once. It's the only command you need to remember.
| Command template | What it answers | What you get | Where it runs |
|---|---|---|---|
/onboarding | "I just installed pmstack. Where do I start?" | A 9-step interactive tutorial running every capability in this README with examples so you can see what good looks like | CLI · web · desktop |
npx claudepluginhub ryanalberts/pmstack --plugin pmstackMulti-model consensus engine integrating OpenAI Codex CLI, Gemini CLI, and Claude CLI for collaborative code review and problem-solving.
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Standalone image generation plugin using Nano Banana MCP server. Generates and edits images, icons, diagrams, patterns, and visual assets via Gemini image models. No Gemini CLI dependency required.
Write feature specs, plan roadmaps, and synthesize user research faster. Keep stakeholders updated and stay ahead of the competitive landscape.
Memory compression system for Claude Code - persist context across sessions