From observability
Design a logging and tracing strategy for a system or agent pipeline. Use this skill when asked to "add observability", "design a logging strategy", "plan how to trace this system", or "what should I log in this agent".
How this skill is triggered — by the user, by Claude, or both
Slash command
/observability:trace-planThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Design a structured logging and distributed tracing strategy for the given system.
Design a structured logging and distributed tracing strategy for the given system.
Read the system description and identify:
Select the appropriate logging approach based on system type:
| System type | Recommended strategy |
|---|---|
| Single-process agent | Structured JSON logs to stdout/file |
| Multi-service pipeline | Distributed tracing (OpenTelemetry spans) |
| Long-running daemon | Rotating file logs + health-check endpoint |
| Serverless / ephemeral | Centralized log aggregation (CloudWatch, Datadog) |
For each component identified in Step 1, define a trace span:
Span: <component name>
Parent: <parent span or "root">
Attributes to capture:
- <key>: <description of value>
- <key>: <description of value>
Events to record: <list of significant moments within this span>
Error conditions: <what constitutes a failed span>
Establish a consistent log-level policy:
Create a JSON schema for log entries to enable downstream filtering and aggregation:
{
"timestamp": "<ISO 8601>",
"level": "<DEBUG|INFO|WARNING|ERROR|CRITICAL>",
"component": "<span/service name>",
"trace_id": "<UUID shared across a full request>",
"span_id": "<UUID for this component's span>",
"event": "<short machine-readable event name>",
"message": "<human-readable description>",
"attributes": {}
}
Prioritize instrumentation by impact:
Produce a Markdown document listing:
npx claudepluginhub ats-kinoshita-iso/agent-workshop --plugin observabilityDesign observability (metrics, logs, traces) for understanding system behavior in production. Use when debugging distributed systems or building monitoring.
Observability discipline: structured logging, metrics instrumentation, distributed tracing, and signal correlation. Invoke whenever task involves any interaction with observability concerns — adding logging, designing metrics, instrumenting traces, correlating signals, reviewing instrumentation, or understanding when to use which pillar.
Provides structured JSON logging patterns with correlation IDs, context propagation, log levels, and required fields for observability and production incident debugging.