From claudity-assurance
This skill should be used when a QA session needs to verify BDD scenarios against a running system. Trigger phrases include "verify these BDDs", "test the new features", "run verification", "check the bdds", "verify against the system", "start QA verification", "what needs testing", and "run the BDD scenarios". Also activates when new .feature.md files appear in the bdds/ directory and the user asks to test them.
How this skill is triggered — by the user, by Claude, or both
Slash command
/claudity-assurance:verifyThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
<HARD-GATE>
ALL knowledge comes from: your internal docs (docs/), incoming BDDs (bdds/), and direct interaction with the running system via the configured interaction surface. Never read code to understand behavior — test it instead.
Ingest BDD scenario files from the bdds/ dropbox, verify each scenario against
the running system using the configured interaction surface, produce a
verification report with evidence, check for regressions in related areas, and
update the internal knowledge graph with anything learned.
This skill is the core testing loop of claudity-assurance. It combines focused verification (new BDDs) with regression awareness (previously verified behaviors in the same area).
bdds/ by the dev session/c-a:explore discovers new behaviors worth verifying/c-a:onboard/c-a:reset to troubleshootRead all .feature.md files in bdds/. Parse YAML frontmatter from each file.
If no new BDD files are found, ask the user:
Sort BDDs by priority (high first, then medium, then low).
For each BDD file, cross-reference with the knowledge graph:
area field — look for matching docs in the knowledge graphtags field — determine which interaction methods to useevidence-level — plan evidence collection depthnotes for any constraints or gotchasBuild a verification plan. Present it briefly to the user: "I have N scenarios across M areas. Starting with [highest priority]. Here's my approach: [brief summary]. Ready to begin?"
For each scenario in priority order:
evidence-level:
light: Write a text note of what was observedmedium: Text note + capture screenshot(s) at key verification pointsheavy: Text note + screenshots + record video/screen capture of the verificationWhen stuck during verification:
Consult references/verification-guide.md for evidence collection patterns
and common verification strategies.
After verifying the new BDDs, check for regressions:
area values from the new BDDsThe regression scope should be proportional — not a full retest of everything, but a focused check on areas adjacent to the changes.
Write the verification report to results/YYYY-MM-DD-<feature-slug>-verification.md.
The report should be lightweight and scannable:
Evidence files (screenshots, recordings) stored alongside the report or in a
subdirectory of results/.
Present a conversational summary — the user should understand the results without reading the full document:
"Verified N scenarios for [feature]. M passed, L failed. [Brief note on failures if any]. Regression check on [area]: no issues found. Full report at results/[filename]."
The user can ask follow-up questions or review the full document if they want more detail.
Fold everything learned during verification into the internal docs:
Update the changelog with the verification event.
Provides behavioral guidelines to reduce common LLM coding mistakes, focusing on simplicity, surgical changes, assumption surfacing, and verifiable success criteria.
Searches, retrieves, and installs Agent Skills from prompts.chat registry using MCP tools like search_skills and get_skill. Activates for finding skills, browsing catalogs, or extending Claude.
Creates, edits, and optimizes skills for Claude Code, including drafting, evaluating with test prompts, iterating on performance, and improving skill descriptions for better triggering accuracy.
npx claudepluginhub look-itsaxiom/claudity-assurance --plugin claudity-assurance