Expert guidance and automation for backend observability: APM, distributed tracing, metrics, and logging for Go, Python, Node.js, Java/Kotlin, Rust, and .NET.
Track cache hit rates, latency, and detect cache-related issues
Instrument database queries, connection pools, and detect N+1 queries
Capture errors with rich context for debugging and alerting
Implement liveness, readiness, and dependency health checks
Plan backend observability using RED + USE + 4 Golden Signals + JTBD
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
A comprehensive Claude Code plugin for implementing production-ready observability in backend services. Covers the full spectrum from basic logging to advanced distributed tracing and SLO-based alerting.
/instrument - Instrument backend services with observability/audit - Audit existing observability implementation| Skill | Description |
|---|---|
instrumentation-planning | Plan what to measure based on JTBD framework |
request-tracing | Implement distributed tracing with context propagation |
database-observability | Instrument database queries, pools, and transactions |
cache-observability | Monitor cache hit rates, latency, and memory |
queue-observability | Track message queues, consumer lag, and processing |
error-handling | Capture errors with context and classification |
health-checks | Implement liveness, readiness, and startup probes |
slo-alerting | Define SLIs/SLOs and burn-rate alerting |
| Agent | Purpose |
|---|---|
codebase-analyzer | Analyze codebase architecture and existing telemetry |
instrumentation-reviewer | Review code for observability anti-patterns |
Real-time anti-pattern detection for:
# Add to Claude Code plugins
claude plugins add backend-observability
Or clone directly:
git clone https://github.com/nexus-labs/backend-observability ~/.claude/plugins/backend-observability
/instrument --analyze
/instrumentation-planning
/request-tracing
/slo-alerting
Ready-to-use code for each platform (Go, Python, Node.js, Java, Rust, .NET):
The plugin detects common observability anti-patterns:
// BAD: User ID in metric label
counter.WithLabelValues(userID).Inc()
// GOOD: Use attributes in traces instead
span.SetAttributes(attribute.String("user.id", userID))
# BAD: Context not propagated
requests.get(url)
# GOOD: Inject trace context
otel.inject(headers)
requests.get(url, headers=headers)
// BAD: Full URL path as label
histogram.labels({ path: req.url }).observe(duration);
// GOOD: Use route pattern
histogram.labels({ route: req.route.path }).observe(duration);
backend-observability/
├── plugin.json # Plugin manifest
├── marketplace.json # Marketplace metadata
├── CLAUDE.md # Plugin instructions
├── commands/
│ ├── instrument.md # /instrument command
│ └── audit.md # /audit command
├── agents/
│ ├── codebase-analyzer/
│ └── instrumentation-reviewer/
├── skills/
│ ├── instrumentation-planning/
│ ├── request-tracing/
│ ├── database-observability/
│ ├── cache-observability/
│ ├── queue-observability/
│ ├── error-handling/
│ ├── health-checks/
│ └── slo-alerting/
├── hooks/
│ └── hooks.json # Anti-pattern detection
└── references/
├── anti-patterns.md # Common observability mistakes
├── methodology/ # RED, USE, 4GS, JTBD, SLI/SLO
├── platforms/ # Go, Python, Node.js, Java, Rust, .NET
├── vendors/ # OTel, Datadog, Grafana, etc.
├── patterns/ # Circuit breaker, retry, timeout, rate limiting
└── templates/ # Middleware, database, health checks per platform
npx claudepluginhub nexus-labs-automation/backend-observabilityExpert guidance and automation for mobile app observability: crash reporting, performance monitoring, session replay, and instrumentation for iOS, Android, React Native, and Flutter.
Expert guidance and automation for mobile app observability: crash reporting, performance monitoring, session replay, and instrumentation for iOS, Android, React Native, and Flutter.
Expert guidance for instrumenting AI agents in production. Covers LLM call tracing, multi-agent coordination, tool execution, token/cost tracking, and evaluation quality for frameworks like LangChain, Claude Agent SDK, and custom agent loops.
Expert guidance and automation for web application observability: Core Web Vitals, error tracking, performance monitoring, session replay, and instrumentation for React, Vue, Angular, Svelte, Next.js, Nuxt, Remix, SvelteKit, and Astro.
Editorial "Observability & Monitoring" bundle for Claude Code from Antigravity Awesome Skills.
Traceway skills: /tw:traceway debugs production issues through the traceway CLI, /tw:traceway-setup instruments a project to report to a Traceway instance.
OpenTelemetry skills and reference documentation for AI coding assistants
Observability engineering agents providing expertise in tracing, monitoring, and logging
Commands for setting up monitoring and observability
Set up distributed tracing for microservices