From honeycomb
Guides interpreting Honeycomb SLO compliance, error budgets, burn rates, and trigger status. Detects misconfigured SLIs, advises deploy freezes vs on-call paging, designs burn alerts.
How this skill is triggered — by the user, by Claude, or both
Slash command
/honeycomb:slos-and-triggersThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Guidance for configuring and reasoning about reliability in Honeycomb. The `get_slos`
Guidance for configuring and reasoning about reliability in Honeycomb. The get_slos
and get_triggers tools document their own parameters — this skill focuses on
designing effective SLOs, choosing between SLOs and triggers, and interpreting
what the numbers mean.
Availability: SLOs require Pro or Enterprise plan. Triggers available on all plans.
| Question | SLO | Trigger |
|---|---|---|
| "Are we meeting our reliability commitments?" | Yes | No |
| "Is something broken right now?" | No | Yes |
| "How fast are we burning our error budget?" | Yes (burn alerts) | No |
| "Did error count exceed a threshold?" | No | Yes |
| "Should we slow down deploys?" | Yes (budget remaining) | No |
Rule of thumb: SLOs measure reliability against commitments over time. Triggers catch immediate operational issues.
An SLI is a per-event boolean: was this event successful? Implemented as a calculated field returning 1 (success) or 0 (failure).
LTE(duration_ms, 500) — requests faster than 500msLTE(http.status_code, 499) — non-5xx responsesEQUALS(checkout.status, "completed") — successful checkoutsAt minimum, two alerts:
When reviewing SLOs with get_slos:
"50 requests slower than 2s" is more actionable than "P99 is 2100ms."
Use COUNT WHERE duration_ms > threshold instead of P99 triggers.
Share a single error budget across up to 10 services.
${CLAUDE_PLUGIN_ROOT}/skills/slos-and-triggers/references/slo-design-guide.md — Detailed SLO design methodology, multi-service SLOs, error budget math${CLAUDE_PLUGIN_ROOT}/skills/slos-and-triggers/references/trigger-examples.md — Complete trigger example library organized by use case${CLAUDE_PLUGIN_ROOT}/skills/slos-and-triggers/references/alerting-strategy.md — How to combine SLO burn alerts and triggers into a cohesive alerting strategynpx claudepluginhub honeycombio/agent-skill --plugin honeycombHelps define SLOs, SLIs, and SLAs with error budget tracking and burn rate alerts. Use when implementing SRE practices or setting data-driven reliability targets.
Guides defining SLOs, selecting SLIs, and implementing error budget policies for service reliability, alerting, and balancing velocity.
Provides Prometheus queries and templates for SLO/SLI definitions on availability/latency, error budget calculations, and burn rate alerting for service reliability.