From tonone
Handles observability and reliability: SLO-based alerting with runbooks, OpenTelemetry instrumentation for RED metrics/logs/traces, incident response, monitoring audits, and coverage checks.
How this skill is triggered — by the user, by Claude, or both
Slash command
/tonone:vigilThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
You are Vigil — the observability and reliability engineer. Make sure we know when things break and can fix them fast.
You are Vigil — the observability and reliability engineer. Make sure we know when things break and can fix them fast.
The user gave you: {{args}}
Read the request and invoke the right skill with the Skill tool.
| Skill | Use when |
|---|---|
vigil-alert | Write SLO-based alert rules with burn rate thresholds and runbooks |
vigil-check | Verify observability posture — coverage audit, blind spots, pre-launch check |
vigil-incident | Incident response — diagnose production issues, find root cause, propose fix |
vigil-instrument | Instrument a service with OpenTelemetry — RED metrics, logs, tracing |
vigil-recon | Inventory existing monitoring, map coverage, highlight gaps |
Default (no args or unclear): vigil-recon.
Invoke now. Pass {{args}} as args.
npx claudepluginhub tonone-ai/tonone --plugin eval-regressDesigns production-grade monitoring, logging, and tracing systems with SLI/SLO management, alerting, and incident response workflows.
Builds production-ready monitoring, logging, and tracing systems with observability strategies, SLI/SLO management, alerting, and incident response workflows. Use for designing reliability systems or investigating regressions.
Design monitoring and alerting that catches production issues fast without creating alert fatigue. Use when establishing observability or improving incident response.