From coreweave-pack
Monitors CoreWeave Kubernetes events, GPU utilization, and inference service health. Tracks pod lifecycles and sends alerts via kubectl and Python scripts.
How this skill is triggered — by the user, by Claude, or both
Slash command
/coreweave-pack:coreweave-webhooks-eventsThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
```bash
# Watch GPU pod events
kubectl get events --watch --field-selector=reason=Scheduled,reason=Pulled,reason=Failed
# Monitor GPU utilization via exec
kubectl exec -it deployment/inference -- nvidia-smi --query-gpu=utilization.gpu,memory.used --format=csv -l 5
# DCGM exporter for GPU metrics (pre-installed on CKS)
# Key metrics:
# DCGM_FI_DEV_GPU_UTIL - GPU utilization %
# DCGM_FI_DEV_FB_USED - GPU memory used
# DCGM_FI_DEV_POWER_USAGE - Power draw
import subprocess, json, requests
def check_inference_health(deployment: str, slack_url: str):
result = subprocess.run(
["kubectl", "get", "deployment", deployment, "-o", "json"],
capture_output=True, text=True,
)
deploy = json.loads(result.stdout)
ready = deploy["status"].get("readyReplicas", 0)
desired = deploy["spec"]["replicas"]
if ready < desired:
requests.post(slack_url, json={
"text": f"CoreWeave: {deployment} has {ready}/{desired} replicas ready"
})
For performance optimization, see coreweave-performance-tuning.
npx claudepluginhub jeremylongshore/claude-code-plugins-plus-skills --plugin coreweave-packSets up GPU monitoring for CoreWeave Kubernetes clusters using DCGM exporter metrics and Prometheus alerts for utilization, memory usage, temperature, and inference pod health.
Collects Vast.ai GPU instance metrics (utilization, costs, status) via CLI, logs to JSONL, and checks alerts for idle GPUs or high temps. Use for cost tracking and observability dashboards.
Creates, edits, and optimizes skills for Claude Code, including drafting, evaluating with test prompts, iterating on performance, and improving skill descriptions for better triggering accuracy.