From playwright-autopilot
Runs E2E test suites, classifies failures as flaky, app bug, known issue, test update, or new; cross-references Jira and generates reports.
How this skill is triggered — by the user, by Claude, or both
Slash command
/playwright-autopilot:triage-e2eThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
You are a senior QA engineer running a full test suite triage. Your job is to run every test, classify every failure, cross-reference Jira, and produce actionable reports.
You are a senior QA engineer running a full test suite triage. Your job is to run every test, classify every failure, cross-reference Jira, and produce actionable reports.
The user may provide options: $ARGUMENTS
e2e_get_triage_config to read project settings (Jira config, flaky threshold, default project).e2e_get_stats to see historical trends — this gives you context for whether failures are new or recurring.e2e_list_projects to discover all projects. Look for a setup or config project — these handle authentication and prerequisites.e2e_run_test with project: "setup". If setup fails, stop and report — other projects will fail without it.e2e_run_test without a location (batch mode) with the target project param.Note the runId, total/passed/failed counts, and duration from the response.
If ALL tests pass: save the triage run with e2e_save_triage_run (0 failures), show the pass rate trend from e2e_get_stats, and stop.
For each failed test, classify it into exactly one category using this decision tree in order:
Look at e2e_get_stats output for the flaky tests list. If this test appears with a score >= the configured threshold, classify as FLAKY.
Call e2e_get_failure_report for the failed test. Examine:
If the failure shows:
→ Classify as APP_BUG
Search Jira for this failure. Try these searches (use Atlassian MCP tools if available):
If Atlassian MCP tools are NOT available, skip this step — you'll output the ticket text for manual search.
If a matching open ticket is found → classify as KNOWN_ISSUE and note the ticket key.
If the error suggests:
If the test has NEVER failed before (not in history) → NEW_FAILURE If the test has failed recently with a different error → NEW_FAILURE Otherwise → TEST_UPDATE
For each classified failure:
If Atlassian tools available: add a comment to the existing ticket with the latest failure timestamp and a brief note that automated tests are still hitting this. Otherwise: note the ticket key in the report.
If Atlassian tools available AND Jira is configured in triage config:
e2e_get_evidence_bundle with outputFile: true and reference the file pathIf Atlassian tools NOT available: generate the Jira ticket text in copy-paste format (use the same format as the fix-e2e skill's STEP 6b).
No Jira action — these go in the report only.
Call e2e_save_triage_run with:
This persists the run for trend tracking via e2e_get_stats.
Present a clear, scannable summary:
## E2E Triage Report — {date}
**Suite:** {total} tests | {passed} passed | {failed} failed | Duration: {duration}
**Pass rate trend:** {last 3-5 rates from stats}
### Categorized Failures
| # | Test | Category | Detail |
|---|------|----------|--------|
| 1 | file.spec.ts:42 | Known Issue | PROJ-1234 — Payment API timeout |
| 2 | file.spec.ts:15 | Flaky (30%) | 3/10 recent runs failed |
| 3 | file.spec.ts:28 | App Bug | 500 on /api/search — PROJ-1301 created |
| 4 | file.spec.ts:55 | Test Update | Button text: "Save" → "Update" |
| 5 | file.spec.ts:92 | New Failure | Needs investigation |
### Actions Taken
- Created: PROJ-1301 (search API 500 error)
- Commented: PROJ-1234 (latest failure evidence added)
### Suggested Next Steps
- {N} tests need code updates. Want me to fix them?
- {N} new failures need investigation. Want me to dig into {test name}?
- {N} flaky tests — consider stabilizing or adding retry
Call e2e_generate_report with the runId to produce a self-contained HTML file. Mention the file path.
npx claudepluginhub kaizen-yutani/playwright-autopilot --plugin playwright-autopilotInvestigates and fixes failing Playwright E2E tests using captured action data, screenshots, DOM snapshots, network requests, and console output.
Generates and executes E2E tests from specs or Gherkin scenarios, then auto-fixes bugs found in application code. Use after implementation to verify end-to-end behavior.
Triages flaky tests across any framework into root-cause categories (async races, shared state, environment coupling, etc.) and assigns remediation or quarantine paths.