Santa's Little Helper (SLH) — mission-based web app evaluation for Claude Code
npx claudepluginhub glendigity/santas-little-helperMission-based web app evaluation. Explores apps as a target user and produces QA reports with classified findings.
"He's a good boy. Your code isn't."
A Claude Code plugin that evaluates web apps by exploring them as a real user. Define personas and missions, then SLH navigates your app, finds what's broken/confusing/rough, and writes a narrative QA report with suggested Playwright regression tests.
Fair warning: This was 100% vibe-coded with Claude Opus 4.6. It works, it's useful, but it's early and rough around the edges. Issues, PRs, and feedback very welcome.
Named after the Simpsons' reject greyhound — chaotic, chews everything, digs up things buried in the yard... but he's trying to help.
# If playwright-cli is already installed
playwright-cli install --skills
# Otherwise, use your package manager:
npx playwright-cli@latest install --skills # npm
bunx playwright-cli@latest install --skills # bun
pnpx playwright-cli@latest install --skills # pnpm
yarn dlx playwright-cli@latest install --skills # yarn
/plugin marketplace add glendigity/santas-little-helper
/plugin install slh
| Command | Description |
|---|---|
/slh-profile [url] | Define user personas and their mission lists |
/slh-test [url] [user] [pass] | Run a full evaluation session |
/slh-retest [url] [user] [pass] | Re-test previous findings to verify fixes |
| Category | Question |
|---|---|
| Broken | Does it work? |
| Confusing | Can I figure it out? |
| Inconsistent | Does it match itself? |
| Rough | Does it look/feel right? |
| Observation | Not sure — needs human judgment |
Each finding gets a severity: High (blocks a task), Medium (significant friction), or Low (noticeable but doesn't impede).
You should — SLH suggests Playwright tests for every finding it discovers. But:
SLH is also for when you're building fast. The UI changes constantly but your missions stay the same. Playwright tests break when you move a button. SLH just tries to accomplish the goal and tells you what got in the way.
SLH uses playwright-cli because it supports named sessions, which lets mission agents run in parallel with independent browser windows. If Claude in Chrome turns out to be better for this, happy to move to it.
/slh-profile — good missions are concrete goals ("Find out who reports to you"), not page names ("View the org chart")/slh-test with a subset of missions first to calibrate.slh/ to git — profiles, knowledge, and orientation are meant to be shared with your teamslh-reports/ and .playwright-cli/ to your .gitignore — those are generated output.slh/config.yml with reports_dir: ./your-pathInspired by Robbie Baskin's idea for using Claude to take on user missions as a QA tester.
MIT
Fetches bugs so you don't have to.
Claude Code marketplace entries for the plugin-safe Antigravity Awesome Skills library and its compatible editorial bundles.
Production-ready workflow orchestration with 84 marketplace plugins, 192 local specialized agents, and 156 local skills - optimized for granular installation and minimal token usage
Directory of popular Claude Code extensions including development tools, productivity plugins, and MCP integrations