By futuretea
FlashDuty assistant for incident management, diagnosis, team collaboration, and SRE best practices (Community)
深入诊断特定的 FlashDuty 事件。当你需要彻底调查单个事件,包括其时间线、关联告警和根因分析时,应使用此 Agent。
按照 SRE 最佳实践追踪和分析服务的错误预算消耗。计算可靠性指标、追踪 SLO 合规性或基于错误预算状态规划发布时使用此 Agent。
用于跨越多个事件或需要复杂过滤的查询的通用事件分析。此 Agent 处理不适合其他 Agent 特定范围的高级查询。
按照 Google SRE 最佳实践为事件生成无责备的事后分析报告。当事件需要记录、分析系统性改进或需要无责备回顾时,应使用此 Agent。
收集和分析特定时间范围、频道或维度的 FlashDuty 事件统计。当你需要收集事件的统计数据时,应使用此 Agent。
This skill should be used when the user asks to "统计故障", "MTTR", "MTTA", "故障趋势", "故障报告", "incident trend", "incident metrics", "SLO report", "运营指标", "可靠性分析", or discusses incident statistical analysis and operational metrics (事件统计分析和运营指标).
This skill should be used when the user asks to "diagnose incident", "root cause", "incident timeline", "check alerts", "诊断事件", "查找根因", "事件时间线", "what happened", "发生了什么", "problem diagnosis", or discusses incident analysis and problem diagnosis (事件分析和问题诊断).
This skill should be used when the user asks to "create incident", "acknowledge incident", "resolve incident", "close incident", "snooze incident", "query incidents", "list incidents", "创建事件", "确认事件", "关闭事件", "查询事件", or discusses incident lifecycle operations (事件生命周期管理).
This skill should be used when the user asks to "postmortem", "事后分析", "error budget", "错误预算", "SLO", "SLI", "toil", "琐事", "reliability", "可靠性", "blameless", "无责备", or discusses SRE best practices and service reliability (SRE 最佳实践和服务可靠性).
This skill should be used when the user asks to "assign incident", "who is on call", "oncall schedule", "escalation rules", "find team member", "指派", "值班", "团队成员", "升级规则", "协作空间", or discusses incident assignment and team coordination (事件指派和团队协作).
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
一套用于 Claude Code 的 FlashDuty 集成技能,采用 Sub-Agent + Skill 架构,支持并行处理和独立上下文。
┌─────────────────────────────────────────────────────────────┐
│ Skill Layer (触发器) │
│ - 识别用户意图 │
│ - 决定调用哪个 Sub-Agent │
│ - 汇总结果 │
└─────────────────────────────────────────────────────────────┘
│
┌───────────────────┼───────────────────┐
▼ ▼ ▼
┌──────────────┐ ┌──────────────┐ ┌──────────────┐
│ Sub-Agent 1 │ │ Sub-Agent 2 │ │ Sub-Agent N │
│ (独立上下文) │ │ (独立上下文) │ │ (独立上下文) │
└──────────────┘ └──────────────┘ └──────────────┘
/plugin marketplace add futuretea/flashduty-assistant
/plugin install flashduty-assistant@flashduty-assistant
/plugin list
安装成功后,你应该能看到 flashduty-assistant 在已安装插件列表中。
触发词: "create incident", "acknowledge incident", "close incident", "query incidents"
执行方式:
flashduty-incident-analyzer触发词: "diagnose incident", "investigate incident", "analyze incident", "find root cause"
执行方式:
flashduty-diagnosis-engine触发词: "assign incident", "find team member", "query teams", "who is on call"
执行方式:
flashduty-team-resolver触发词: "统计故障", "incident statistics", "故障分析", "MTTR", "MTTA", "故障趋势"
执行方式:
flashduty-stats-collector 或 flashduty-incident-analyzer触发词: "postmortem", "事后分析", "error budget", "错误预算", "SLO", "SLI", "toil", "琐事", "reliability", "可靠性"
执行方式:
flashduty-postmortem-generator 生成无责备分析报告(5 Whys 根因分析)flashduty-error-budget-tracker 追踪 SLO 合规性、计算燃烧率flashduty-toil-analyzer 识别自动化机会、生成路线图flashduty-assistant/
├── .claude/
│ └── settings.local.json # 工具权限配置
├── .claude-plugin/
│ ├── marketplace.json # 插件市场元数据
│ └── plugin.json # 插件元数据
├── agents/ # Sub-Agent 定义
│ ├── stats-collector.md # 统计收集 Agent
│ ├── diagnosis-engine.md # 诊断引擎 Agent
│ ├── team-resolver.md # 团队解析 Agent
│ ├── incident-analyzer.md # 通用分析 Agent
│ ├── postmortem-generator.md # 事后分析 Agent (SRE)
│ ├── error-budget-tracker.md # 错误预算 Agent (SRE)
│ └── toil-analyzer.md # 琐事分析 Agent (SRE)
├── skills/ # Skill 触发器
│ ├── incident-management/SKILL.md
│ ├── incident-diagnosis/SKILL.md
│ ├── incident-analytics/SKILL.md
│ ├── team-collaboration/SKILL.md
│ └── sre-practices/SKILL.md # SRE 最佳实践
├── .gitignore
├── CLAUDE.md
├── LICENSE
└── README.md
# 创建事件
"创建一个严重级别的事件,标题是数据库连接失败"
# 确认事件
"确认事件 FD123456"
# 关闭事件
"关闭事件 FD123456,根因是连接池耗尽"
# 单事件诊断
"诊断事件 FD123456"
→ 启动 diagnosis-engine
→ 并行获取详情/时间线/告警
→ 生成诊断报告
# 多事件对比诊断
"对比事件 A、B、C 的根因"
→ 并行启动 3 个 diagnosis-engine
→ 每引擎分析一个事件
→ 汇总对比结果
# 单维度统计
"统计本周故障"
→ 启动 stats-collector
→ 使用 time_range: "7d"
→ 返回统计数据
# 多维度并行分析
"全面分析本周故障"
→ 并行启动:
Agent 1: Severity 分布分析 (time_range: "7d")
Agent 2: Channel 分布分析 (time_range: "7d")
Agent 3: MTTR/MTTA 计算 (time_range: "7d")
Agent 4: 趋势分析 (time_range: "30d")
→ 汇总生成综合报告
# 多时间段对比
"过去3个月的趋势"
→ 并行启动:
Agent 1: 月份1统计 (time_range: "30d")
Agent 2: 月份2统计 (time_range: "30d")
Agent 3: 月份3统计 (time_range: "30d")
→ 合并趋势数据
# 查找成员
"查找名叫John的团队成员"
→ 启动 team-resolver
# 多团队值班查询
"哪些团队有人在值班?"
→ 并行查询各团队 on-call
# 事件指派建议
"事件 FD123456 应该指派给谁?"
→ 并行获取:事件详情 + 值班表
→ 推荐最佳指派对象
# 事后分析(无责备文化)
"生成事件 FD123456 的事后分析报告"
- 启动 flashduty-postmortem-generator
→ 包含:时间线、5 Whys 根因分析、行动项、经验教训
→ 确保无责备语言,聚焦系统性改进
# 错误预算追踪
"查看支付服务的错误预算"
→ 启动 flashduty-error-budget-tracker
→ 返回:已消耗 %、燃烧率、发布建议
→ "✅ 安全发布" 或 "⚠️ 建议暂停功能发布"
# 多服务 SLO 对比
"对比所有核心服务的可靠性"
→ 并行启动 flashduty-error-budget-tracker × N
→ 生成 SLO 合规性仪表板
→ 标识风险服务(预算消耗 > 50%)
# 琐事分析
"分析我们团队的琐事工作量"
→ 启动 flashduty-toil-analyzer
→ 识别:告警噪音、重复性问题、手动操作
→ 返回:琐事评分、自动化路线图、ROI 分析
npx claudepluginhub futuretea/flashduty-assistant --plugin flashduty-assistantSolutions Engineer daily workflow skills — open source analysis, case management, RFP response, discovery prep, and demo building. Designed for the OSS + customization delivery model.
Rancher assistant for multi-cluster Kubernetes management, troubleshooting, capacity analysis, and resource discovery (Community)
AlibabaCloud DevOps (Yunxiao) assistant for project management, code management, pipeline operations, and application delivery (Community)
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Upstash Context7 MCP server for up-to-date documentation lookup. Pull version-specific documentation and code examples directly from source repositories into your LLM context.
Comprehensive startup business analysis with market sizing (TAM/SAM/SOM), financial modeling, team planning, and strategic research
v9.44.1 — Patch release for Gemini environment/version detection and qwen auth gating. Run /octo:setup.
Complete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.
Comprehensive .NET development skills for modern C#, ASP.NET, MAUI, Blazor, Aspire, EF Core, Native AOT, testing, security, performance optimization, CI/CD, and cloud-native applications