From epic
Analyzes session observations to generate and improve evolved skills via subcommands: status, history, rollback, reset. Use for post-session review and skill optimization.
How this skill is triggered — by the user, by Claude, or both
Slash command
/epic:evolveThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
**CRITICAL**: Run `HARNESS_DIR=$(epic path)` first. NEVER use `.harness/` in the project directory.
CRITICAL: Run HARNESS_DIR=$(epic path) first. NEVER use .harness/ in the project directory.
You are the Evolution Engine — analyze past sessions to improve skills.
/evolve (default) — Run evolution now$HARNESS_DIR/obs/$HARNESS_DIR/evolved//evolve status — Show evolution dashboardRead $HARNESS_DIR/metrics.json and $HARNESS_DIR/evolution.jsonl, then display:
## Evolution Dashboard
### Overview
- Sessions analyzed: {total_sessions}
- Average success rate: {avg_success_rate}%
- Best score: {best_score} (session: {best_session})
- Trend: {trend} ({score_history.length} data points)
- Stagnation count: {stagnation_count} / 3 (rollback at 3)
### Score History (last 5 sessions)
| Session | Success Rate | Avg Score | Observations | Tool Success | Output Quality |
|---------|-------------|-----------|--------------|-------------|---------------|
### Evolved Skills
(list $HARNESS_DIR/evolved/*/SKILL.md with name and description from frontmatter)
### Last Session Analysis
(read last entry from evolution.jsonl)
- Error patterns: {error_patterns}
- Failure patterns: {failure_patterns[].pattern_type}
- Skills seeded: {skills_seeded}
- Skills rolled back: {skills_rolled_back}
- Analysis: {analysis_summary}
/evolve history — Long-term analysisRead $HARNESS_DIR/evolution.jsonl (full history), then display:
## Evolution History
### Trend Over Time
| Session # | Date | Success Rate | Avg Score | Skills | Patterns |
|-----------|------|-------------|-----------|--------|----------|
### Cumulative Pattern Frequency
| Pattern | Total Count | First Seen | Last Seen |
|---------|-------------|------------|-----------|
### Skill Effectiveness
| Skill | Sessions Active | Avg Score With | Avg Score Without | Delta |
|-------|----------------|----------------|-------------------|-------|
### Dispatch Analysis
| Skill | Times Invoked | Top Trigger Signals |
|-------|--------------|---------------------|
/evolve rollback — Undo last evolution$HARNESS_DIR/evolved_backup/ exists, restore it to $HARNESS_DIR/evolved/$HARNESS_DIR/evolution.jsonl for last entry, remove skills seeded in that entry/evolve reset — Clear all evolution data$HARNESS_DIR/evolved/, $HARNESS_DIR/evolved_backup/metrics.json and evolution.jsonlObserve (PostToolUse — multi-dimensional scoring)
↓ $HARNESS_DIR/obs/session_YYYYMMDD.jsonl
Analyze (Stop or /evolve)
↓ SessionAnalysis: per-tool, per-ext, score distribution
↓ Pattern detection: repeated_same_error, fix_then_break, long_debug_loop, thrashing
Seed (auto-generate targeted skills)
↓ 4 seeding paths: pattern / weak tool / weak file type / high-freq error
Gate (validate: format, dedup, cap of 10)
↓ Stagnation check: 3 sessions no improvement → rollback to best checkpoint
Reload (next session resume reports metrics + loads evolved skills)
0.5 × tool_success + 0.3 × output_quality + 0.2 × execution_costGuides creation, editing, and verification of skills for AI coding agents using test-driven development with subagent scenarios. Use when authoring or debugging skills.
npx claudepluginhub epicsagas/epic-harness --plugin epic-harness