By willer
Evolutionary algorithm search inside Claude Code. Skills: evolve (omnibus run loop), evolve-ideate (Opus high ideation), evolve-code (codex-first coding, Sonnet judge/fallback), evolve-score (Haiku evaluation). Self-contained: bundles the CSV/sandbox engine (stdlib-only, no npm/pip) and runs the AI steps as Anthropic subagents.
Evolve worker for claude-evolve. Claims pending candidates from evolution.csv, codes each one codex-first (judging codex's diff, coding it itself as fallback), scores it with the workspace evaluator, and returns a terse summary line. Launched in the background by the evolve skill's re-spawn pool.
Ideation strategist for claude-evolve. Proposes new algorithm variants for one assigned strategy (novel exploration, hill climbing, structural mutation, or crossover) and returns a JSON array of ideas. Launched in parallel by the evolve-ideate skill — one per strategy.
Write the code for one evolution candidate. Resolves the candidate's parent algorithm, copies it to evolution_<id>.py, then implements the candidate's description from evolution.csv — codex (GPT-5.5) first, with you (Sonnet) judging the result and coding it yourself if codex falls short. Use when the user says "code gen02-003", "write the algorithm for this candidate", "implement idea <id>", or when a parent skill dispatches coding.
Run one generation of ideation for a claude-evolve workspace. Reads the top performers, BRIEF, and accumulated notes, then launches parallel Opus subagents at high effort — one per ideation strategy (novel exploration, hill climbing, structural mutation, crossover) — to propose new algorithm variants, and appends them as pending rows in evolution.csv. Use when the user says "ideate", "generate new ideas", "make the next generation", or when the omnibus evolve loop drains its pending queue. Run only ONE ideation at a time per workspace.
Score one evolution candidate against the workspace evaluator. Runs the candidate's evolution_<id>.py through evaluator.py under the same sandbox the claude-evolve engine uses, parses the performance number, and writes status + performance back to evolution.csv. Use when the user says "score gen02-003", "evaluate this candidate", "run the evaluator on <id>", or when a parent skill dispatches scoring. Scoring is deterministic (no model reasoning needed) — this skill exists so the evaluator's output noise stays out of the main conversation.
Run the full claude-evolve loop for a workspace — the omnibus. Drives evolution.csv through its cycle (code pending candidates, score them, ideate the next generation when the queue drains, repeat) as a self-respawning pool of background worker subagents, so the main conversation stays a clean dashboard. Use when the user says "run evolution", "evolve", "start the evolution run", "process the pending candidates", or wants the whole pipeline driven end to end. Equivalent to `claude-evolve run`: codex (GPT-5.5) codes each candidate first with the Sonnet worker judging the result and falling back to coding it itself, the evaluator scores, Opus ideates.
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Automated algorithm evolution using AI. Start with a base algorithm, let Claude evolve better variants autonomously.
# Install
npm install -g claude-evolve
# Set up project
claude-evolve setup
# Generate initial ideas
claude-evolve ideate
# Start evolution (runs forever until Ctrl+C)
claude-evolve run
evolution/BRIEF.mdevolution/algorithm.pyevolution/evaluator.pyEvolution runs indefinitely until you stop it. Perfect for overnight optimization.
claude-evolve # Interactive menu
claude-evolve setup # Initialize workspace
claude-evolve ideate # Generate new algorithm ideas
claude-evolve run # Start evolution loop (runs forever)
claude-evolve analyze # View results and progress with charts
claude-evolve status # Quick progress overview
claude-evolve autostatus # Live auto-updating status display
claude-evolve edit # Manage candidate statuses
claude-evolve config # View/edit configuration
claude-evolve cleanup # Clean up old lock files
claude-evolve clean-invalid # Remove invalid candidates
claude-evolve cleanup-duplicates # Remove duplicate entries
claude-evolve csv-fix # Fix CSV formatting issues
# Use different working directory
claude-evolve --working-dir=my-project run
claude-evolve --working-dir=experiments/trading ideate
your-project/
├── evolution/
│ ├── BRIEF.md # Problem description
│ ├── algorithm.py # Base algorithm
│ ├── evaluator.py # Performance measurement
│ ├── config.yaml # Settings
│ ├── evolution.csv # Progress tracking
│ └── evolution_*.py # Generated variants
Your evaluator.py must output a performance score to stdout. The system supports multiple output formats:
# Just print a single number
print(1.234)
# JSON with 'performance' field (recommended)
print('{"performance": 1.234, "accuracy": 0.95, "latency": 45.2}')
# OR JSON with 'score' field
print('{"score": 1.234, "precision": 0.88, "recall": 0.92}')
# For backward compatibility
print("SCORE: 1.234")
Important notes:
performance or score field is required for evolution decisionsEdit evolution/config.yaml:
# Files
algorithm_file: "algorithm.py"
evaluator_file: "evaluator.py"
evolution_csv: "evolution.csv"
# Evolution strategy
ideation_strategies:
total_ideas: 15
novel_exploration: 3 # Creative new approaches
hill_climbing: 5 # Parameter tuning
structural_mutation: 3 # Architecture changes
crossover_hybrid: 4 # Combine best features
# Auto-generate new ideas when queue empty
auto_ideate: true
# Parallel execution
parallel:
enabled: false
max_workers: 4
evolution.csv for performance trendsToo many failures? Check your evaluator handles edge cases and outputs valid scores.
Stuck in local optimum? Increase novel_exploration in config.yaml or add manual ideas.
Evaluator crashes? Make sure dependencies are installed and error handling is robust.
MIT
npx claudepluginhub willer/claude-evolve --plugin claude-evolveIntelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Persistent file-based planning for AI coding agents. Crash-proof markdown plans (task_plan.md, findings.md, progress.md) that survive context loss and /clear, with an opt-in completion gate and multi-agent shared state. Manus-style. Works with Claude Code, Codex CLI, Cursor, Kiro, OpenCode and 60+ agents via the SKILL.md standard. Includes Arabic, German, Spanish, and Chinese (Simplified and Traditional).
Access thousands of AI prompts and skills directly in your AI coding assistant. Search prompts, discover skills, save your own, and improve prompts with AI.
Complete developer toolkit for Claude Code
Feature development with code-architect/explorer/reviewer agents, CLAUDE.md audit and session learnings, and Agent Skills creation with eval benchmarking from Anthropic.