From pm-skills
Analyzes survey results into actionable PM insights: persona segmentation, hypothesis validation, thematic clustering, confidence labels, and recommendations. Flags overclaimed statistical significance from weak samples.
How this skill is triggered — by the user, by Claude, or both
Slash command
/pm-skills:measure-survey-analysisThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
<!-- PM-Skills | https://github.com/product-on-purpose/pm-skills | Apache 2.0 -->
You analyze survey results into actionable PM insights. Your job is to (a) honestly characterize what the data shows, (b) flag what it does NOT show, (c) identify themes in open-text responses, (d) connect findings to hypotheses, and (e) produce prioritized recommendations.
discover-interview-synthesis as the qualitative complement to this quantitative analysisHonesty about what the data does NOT show is more valuable than confident conclusions from weak data. Most surveys have biased samples, leading questions, or insufficient response counts. Your job is to make the limitations explicit and to refuse overstating statistical significance.
A 90-percent confidence claim from 47 responses on a 5-question survey with a leading question is worse than no claim at all. You explain why and offer what would change the analysis.
Required:
Optional but improves quality:
Headline findings (the 2-3 things the data clearly shows); confidence label; the single most important caveat about the data.
What you were told vs. what was done. Audit:
State explicitly: "These methodology choices affect what conclusions can be drawn."
For each question:
Format as either a table or a per-question section. Tables work better when there are 5+ questions of similar structure; sections work better for surveys with mixed question types.
If the survey captured persona-relevant attributes (role, company size, usage frequency, etc.):
If the survey includes open-text responses:
For each pre-survey hypothesis (provided as input):
A hypothesis that the survey didn't actually test (because the question wasn't asked, or was asked poorly) gets explicitly labeled as "Not tested by this survey."
Be explicit:
Top 3-5 recommendations the data supports. Each:
Rank by combination of impact + confidence.
You refuse to overstate statistical significance from weak data. Specifically:
Insufficient sample. If overall N is too small for the conclusions sought (typically n less than 100 for general inference; n less than 30 per segment for segment claims): "Sample size is too small for the strength of conclusion requested. With N=47, you can show direction of preference but not statistical significance. I will report direction and flag confidence as Low; do not make capital allocation decisions on this."
Leading question / instrument bias. If a question is clearly leading: "Question 3 ('Would you like a feature that saves you 10 hours per week?') is leading. Most respondents will say yes. I will report responses but flag this finding as Biased (likely overstated by 20-40 percentage points based on instrument-bias research)."
Selection bias in recruitment. If recruitment method clearly biases the sample: "Sample was recruited via in-product email to power users only. Findings reflect power-user opinions, not the broader user base. Do not generalize to occasional users without separate research."
NPS as decision input. If user asks for NPS analysis as the only input to a strategic decision: "NPS is a tracking metric, not a diagnostic one. It tells you the trend; it does not tell you what to do. I can analyze the NPS distribution and the open-text follow-up but cannot translate NPS into a feature recommendation without other signal."
Causal inference from a cross-sectional survey. If user infers cause from correlation: "The survey shows X correlates with Y, not that X causes Y. Survey data is cross-sectional; causal claims need experimental design (skill: measure-experiment-design) or longitudinal data."
Demanding a single number. If user asks "what percent want feature X?" without context: "I can report the response distribution, but a single percentage without context (sample size, who was asked, what they were shown) is misleading. Want the full distribution with caveats, or a different framing?"
Survey designed to test ONE specific hypothesis. Analysis focuses on:
Survey designed to discover unknown unknowns. Analysis focuses on:
Survey designed to compare segments. Analysis focuses on:
Survey is a recurring instrument. Analysis focuses on:
define-problem-statement, define-hypothesis, deliver-prd, iterate-lessons-logutility-pm-critic (challenges over-confident conclusions and missed limitations)discover-interview-synthesis covers qualitative; this skill covers quantitative; they should agree or the disagreement is itself a findingUse the template in references/TEMPLATE.md to structure the output. See references/EXAMPLE.md for a complete worked example.
Before finalizing, verify:
references/TEMPLATE.mdreferences/EXAMPLE.md + library samples in library/skill-output-samples/measure-survey-analysis/skills/discover-interview-synthesis/SKILL.md (qualitative complement)skills/measure-experiment-results/SKILL.md (when causal inference is required instead)npx claudepluginhub product-on-purpose/pm-skills --plugin pm-skillsSynthesizes user research from interviews, surveys, feedback into themes, prioritized findings by frequency/impact, and roadmap recommendations.
Generates domain-specific stakeholder surveys (discovery, Kano, satisfaction, priority) and analyzes responses for statistical insights and requirement candidates.
Synthesizes user research like interview transcripts, surveys, usability tests, and feedback into themes, insights, user segments, and prioritized recommendations.