From role-devops
Configures monitoring stacks with Prometheus/Grafana, Datadog, CloudWatch, PagerDuty; designs USE/RED dashboards; sets up custom metrics, alerts, uptime checks, log aggregation.
How this skill is triggered — by the user, by Claude, or both
Slash command
/role-devops:monitoring-setupThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
- Deploy Prometheus with the **kube-prometheus-stack** Helm chart for Kubernetes environments. It bundles Prometheus, Grafana, Alertmanager, and node-exporter.
ServiceMonitor and PodMonitor CRDs for automatic scrape target discovery. Avoid manual scrape config editing.datadog_monitor resource) or the Datadog API. Version control all monitor definitions.env:production, service:api, team:platform) across metrics, logs, and traces for unified correlation./metrics endpoint.http_requests_total{method="GET", status="200", handler="/api/users"}.group_by to prevent alert storms. Use inhibit_rules to suppress downstream alerts when a root cause is firing.timestamp, level, service, trace_id, message.trace_id field for seamless debugging.npx claudepluginhub rnavarych/alpha-engineer --plugin role-devopsDeploys monitoring stacks like Prometheus, Grafana, Datadog with exporters, scrape targets, alerting rules, Grafana dashboards for Kubernetes or Docker.
Sets up monitoring and observability solutions including metrics collection, distributed tracing, log aggregation, and Grafana dashboards. Useful for implementing the three pillars of observability and establishing alerting strategies.
Provides standards for monitoring, metrics, alerting, and observability including golden signals, RED/USE methods, metric naming conventions, and alert severity levels.