Complete DevOps/SRE toolkit: incident response, observability, reliability engineering, on-call management, and automation. The most comprehensive open-source DevOps plugin available.
Pre and post deployment checklist with automated verification and rollback preparation
Start structured incident response with automatic triage, communication, and resolution tracking
Generate comprehensive on-call handoff reports with active issues, context, and watch items
Generate blameless postmortem documents with timeline, root cause analysis, and action items
Execute runbook procedures step-by-step with safety checks and verification
Implement safe deployment strategies including rolling, blue-green, canary, and feature flags. Use this skill when planning deployments, reducing deployment risk, or implementing progressive delivery. Activate when: deployment strategy, rolling update, blue-green, canary deployment, feature flags, progressive delivery, zero downtime deployment, rollback, deployment risk.
Diagnose and fix common Kubernetes issues with systematic debugging approaches. Use this skill when troubleshooting K8s clusters, pods not starting, deployments failing, or networking issues. Activate when: kubernetes, k8s, pod, deployment, kubectl, container, crashloopbackoff, imagepullbackoff, pending pods, kubernetes networking, service not working, ingress issues.
Guide incident response as an Incident Commander with structured communication and coordination. Use this skill when there's an active incident, outage, service degradation, or production issue. Activate when: incident, outage, service down, production issue, SEV1, SEV2, pages, alerts firing, something broke, users complaining, error spike, latency spike.
Systematic root cause analysis using 5 Whys, fishbone diagrams, and fault tree analysis. Use this skill when investigating why an incident happened, performing RCA, or writing postmortems. Activate when: root cause, why did this happen, 5 whys, incident analysis, postmortem investigation, how did this happen, what caused, failure analysis.
Design effective alerting strategies that catch real issues without causing alert fatigue. Use this skill when setting up alerts, reducing noise, or improving on-call experience. Activate when: alerting, alerts, pagerduty, on-call, alert fatigue, too many alerts, missed alerts, monitoring thresholds, alert tuning.
Access AWS CloudWatch logs, metrics, and alarms
Interact with Kubernetes clusters - pods, deployments, services
Access repositories, PRs, issues, and deployments
Post incident updates and communicate with team
Query Prometheus metrics and alerts
Requires secrets
Needs API keys or credentials to function
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
67 professional skills + 7 full-featured plugins for AI coding agents — organized by audience so you can find exactly what you need.
Works with Claude Code, Claude Cowork, Cursor, Codex, Windsurf, and 35+ other AI agents.
# Cross-platform skills (works with any AI agent)
npx skills add latestaiagents/agent-skills --all
# Claude Code / Cowork plugins (with MCP integrations)
/plugin marketplace add latestaiagents/agent-skills
/plugin install devops-sre@latestaiagents-agent-skills
Works with Claude Code, Cursor, Codex, Windsurf, Cline, Aider, and 35+ other AI agents.
npx skills add latestaiagents/agent-skills --all
Full plugins with MCP tool integrations and slash commands. Works with Claude Code CLI and Claude Cowork desktop app.
# Step 1: Add our marketplace
/plugin marketplace add latestaiagents/agent-skills
# Step 2: Install plugins you need
/plugin install devops-sre@latestaiagents-agent-skills
/plugin install qa-testing@latestaiagents-agent-skills
/plugin install hr-people-ops@latestaiagents-agent-skills
| You Are | Skills CLI | Claude Plugin |
|---|---|---|
| Everyone | npx skills add latestaiagents/agent-skills/skills/safety --all | /plugin install safety@latestaiagents-agent-skills |
| Developer | npx skills add latestaiagents/agent-skills/skills/developer --all | /plugin install developer-toolkit@latestaiagents-agent-skills |
| DevOps/SRE | npx skills add latestaiagents/agent-skills/skills/mlops --all | /plugin install devops-sre@latestaiagents-agent-skills |
| RAG Engineer | npx skills add latestaiagents/agent-skills/skills/rag-architect --all | /plugin install rag-plugin@latestaiagents-agent-skills |
| Security Engineer | npx skills add latestaiagents/agent-skills/skills/security --all | /plugin install security-guardian@latestaiagents-agent-skills |
| QA/Testing | — | /plugin install qa-testing@latestaiagents-agent-skills |
| HR/People Ops | — | /plugin install hr-people-ops@latestaiagents-agent-skills |
Skills are instructions that teach AI agents how to handle specific tasks. After installation, your AI assistant automatically knows:
No special commands needed — skills activate automatically based on what you're doing.
Safety skills that prevent accidental data loss. Recommended for all users.
npx skills add latestaiagents/agent-skills/skills/safety --all
| Skill | What It Does |
|---|---|
destructive-operation-guard | Core safety protocols for all destructive operations |
migration-safety | Safe database migrations with backup requirements |
database-safety | Prevent accidental DELETE, DROP, TRUNCATE |
file-operation-safety | Protection against rm -rf and bulk deletions |
git-safety | Guard against force push, reset --hard, history loss |
Git workflows, code intelligence, and debugging tools — 19 skills total.
npx skills add latestaiagents/agent-skills/skills/developer --all
| Skill | What It Does |
|---|---|
merge-conflict-surgeon | Step-by-step conflict resolution with context analysis |
commit-message-crafter | Conventional commits that tell a story |
branch-strategy-advisor | GitFlow vs trunk-based — choose what fits |
git-history-detective | Find exactly when and where bugs were introduced |
rebase-safely | Interactive rebase without losing work |
git-undo-wizard | Recover from reset, rebase, and force push disasters |
| Skill | What It Does |
|---|---|
codebase-context-builder | Create CLAUDE.md and optimal context for AI |
ai-code-reviewer | Systematic review of AI-generated code |
refactor-with-ai | Safe, incremental refactoring workflows |
test-generation-patterns | AI-driven test creation that actually works |
debug-with-ai | Structured debugging: hypothesize → verify → fix |
doc-sync-automation | Keep docs updated when code changes |
code-explanation-generator | Clear explanations for complex code |
npx claudepluginhub latestaiagents/agent-skills --plugin devops-sre39 professional skills + 10 slash commands for AI coding agents — Git workflows, debugging, multi-agent architecture, LLMOps, and safety guardrails.
Complete RAG (Retrieval-Augmented Generation) toolkit: chunking strategies, hybrid retrieval, GraphRAG, Agentic RAG, Corrective RAG, evaluation, and production deployment patterns.
Complete application security toolkit: OWASP Top 10 protection, secure code review, vulnerability detection, secrets scanning, and security best practices for modern applications.
Complete HR and People Operations toolkit: job descriptions, recruiting, performance reviews, onboarding, policy management, and employee engagement for modern workplaces.
Complete multi-agent systems toolkit: LangGraph workflows, agent orchestration patterns, supervisor hierarchies, human-in-the-loop, A2A protocols, memory systems, and production deployment.
🚀 Site Reliability Engineer — Site Reliability Engineer + Platform Automation Specialist
Site Reliability Engineering discipline agent for reliability, monitoring, and incident response
Multi-agent orchestrator for Claude Code. Track work with convoys, sling to polecats. The Cognition Engine for AI-powered software factories.
Enterprise-grade AI Agent Skills for software development, DevOps, SRE, security, and product teams.
Set of DevOps skills for Claude Code.
DevsForge site reliability engineering specialist for building resilient and scalable systems