From ai-assistant-ops
Use when revising an existing skill from eval feedback, trigger misses, output regressions, bloat, overfitting, or repeated manual work across skill runs.
How this skill is triggered — by the user, by Claude, or both
Slash command
/ai-assistant-ops:improve-skillThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Improve an existing skill through measured iterations: test, compare, collect
Improve an existing skill through measured iterations: test, compare, collect feedback, revise, rerun, repeat, then tune triggering.
Start from the user's target skill and current stage: test prompts, baseline runs, feedback review, rewriting, rerun comparison, or trigger checks. If the target skill is unclear, resolve it before editing.
Require test prompts before editing. If no evals exist, help create a small set of realistic prompts first. Use objective assertions only for observable behavior; keep subjective quality checks qualitative.
scripts/ tool
or resources file and point the skill to it.ai-assistant-ops:md-bloat-hunter on the changed
skill context before evaluation or rerun. Preserve tested behavior and
trigger coverage while removing redundancy and filler.Repeat until the user is satisfied, feedback is empty, or progress stalls. When progress stalls, report the pattern and ask whether to change the eval set, accept the current tradeoff, or stop.
Keep enough evidence to explain each iteration: target and snapshot paths, prompt set, old-skill and improved outputs, transcript notes, feedback, assertions, timing or token data, and what changed with why.
After the body stabilizes, test the description with should-trigger and should-not-trigger prompts, including near misses. Revise it only for missed triggers or false positives.
Keep the description trigger-only: symptoms and situations, not workflow. The body owns the process.
| Mistake | Correction |
|---|---|
| Editing before evals exist | Create prompts first. |
| Comparing different prompt sets | Use the same prompts. |
| Treating one complaint as a narrow patch | Generalize the feedback. |
| Adding more prose for every failure | Prefer lean wording, examples, scripts, or resources. |
| Stopping after body edits | Run trigger checks after stabilization. |
Creates, edits, and optimizes skills for Claude Code, including drafting, evaluating with test prompts, iterating on performance, and improving skill descriptions for better triggering accuracy.
npx claudepluginhub alex-kopylov/zweihander --plugin ai-assistant-ops