From ultraship
Verifies production site health post-deploy via canary monitoring: checks HTTP status, response time, and error patterns, compares against baseline to detect regressions, and guides rollback decisions.
How this skill is triggered — by the user, by Claude, or both
Slash command
/ultraship:canary <production-url><production-url>This skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
After every deploy, canary monitoring verifies your production site is healthy. It checks HTTP status, response time, error patterns, and compares against a baseline to detect regressions.
After every deploy, canary monitoring verifies your production site is healthy. It checks HTTP status, response time, error patterns, and compares against a baseline to detect regressions.
Announce at start: "I'm running post-deploy canary monitoring."
node ${CLAUDE_PLUGIN_ROOT}/tools/canary-monitor.mjs <production-url> --checks 3 --interval 2
This runs 3 health checks with 2-second intervals. Options:
--checks N — number of checks to run (default: 3)--interval S — seconds between checks (default: 2)--baseline <file> — path to baseline file for regression comparisonThe canary monitor returns a health status:
| Status | Meaning | Action |
|---|---|---|
healthy | All checks pass, no regressions | Deploy succeeded |
degraded | Site is up but has error patterns or issues | Investigate the specific issues |
regression_detected | Performance or behavior regressed from baseline | Compare with baseline, consider rollback |
critical_regression | Major regression (status code change, 3x slower) | Rollback immediately |
down | Site is unreachable or returning errors | Rollback immediately, use /rescue |
Present the results clearly:
+===========================================+
| C A N A R Y R E P O R T |
+===========================================+
| URL savemrr.co |
| Status ✓ HEALTHY |
| Response Time 234ms (avg) |
| Checks 3/3 passed |
| Regressions None |
+===========================================+
If issues are found, show them with severity and recommended action.
If the canary detects problems:
If a Sentry MCP server is connected (check your available tools for sentry), confirm the regression against live error data: compare the error/issue rate since this deploy to the prior baseline window. A post-deploy spike in a new issue is hard confirmation that the deploy caused it — and the stack trace tells you exactly what to roll back or fix. Distinguish a real regression from background noise this way before recommending a rollback.
degraded — Show the specific error patterns found. Check if they're pre-existing or new.regression_detected — Show the before/after comparison. If response time regressed >50%, investigate.critical_regression or down — Recommend immediate rollback:
git revert HEAD --no-edit && git push
Then use /rescue for full incident diagnosis.When the site is healthy, the canary automatically saves a baseline to .ultraship/canary/baseline.json. Future canary runs compare against this baseline to detect regressions.
For extended monitoring after a risky deploy:
node ${CLAUDE_PLUGIN_ROOT}/tools/canary-monitor.mjs <url> --checks 10 --interval 30
This runs 10 checks over 5 minutes, catching delayed failures (connection pool exhaustion, memory leaks, cache warm-up issues).
/deploy — Run canary automatically after deploy completes/rescue — If canary detects down or critical_regression, escalate to incident response/retro — Include canary results in sprint retrospectives/learn — Save deployment gotchas as learnings when canary catches issuesFor deeper verification, combine canary with Playwright MCP:
browser_console_messages/visual-diff)This catches JavaScript errors, broken layouts, and functional regressions that HTTP-only checks miss.
npx claudepluginhub houseofmvps/ultraship --plugin ultrashipMonitors deployed web app URLs for post-deploy regressions including HTTP status, console errors, network failures, performance metrics (LCP/CLS/INP), content changes, and API health with alerts and looping checks.
Monitors deployed URLs after releases: checks HTTP status, console errors, static assets, SSE streams, and performance regressions. Use after deploys, risky merges, or dependency upgrades.
Monitors production after a 100% deploy completes, comparing metrics and screenshots against a pre-deploy baseline to detect silent regressions (console errors, perf drops, broken pages) during the first hours/days.