Skill

evolve-code

Write the code for one evolution candidate. Resolves the candidate's parent algorithm, copies it to evolution_<id>.py, then implements the candidate's description from evolution.csv — codex (GPT-5.5) first, with you (Sonnet) judging the result and coding it yourself if codex falls short. Use when the user says "code gen02-003", "write the algorithm for this candidate", "implement idea <id>", or when a parent skill dispatches coding.

Popularity

Parent stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/claude-evolve:evolve-code <candidate-id> [--working-dir DIR]

User invocable

Model invocable

Inline context

Default effort

Argument hint<candidate-id> [--working-dir DIR]

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Implement one candidate's idea as working code. The candidate already exists in `evolution.csv` with a `description` (the idea) and a `basedOnId` (its parent). Your job: turn the description into a real, substantial code change on a copy of the parent algorithm.

SKILL.md

93 lines · ~1.5k tokens

Stats

LanguagePython

Parent stars1

MaintenanceGood

Last CommitJun 14, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

evolve-code

Implement one candidate's idea as working code. The candidate already exists in evolution.csv with a description (the idea) and a basedOnId (its parent). Your job: turn the description into a real, substantial code change on a copy of the parent algorithm.

Coding is codex-first: codex (GPT-5.5) takes the first pass, and you (Sonnet) judge whether it did the job — coding the candidate yourself when it didn't. It needs genuine coding judgment either way. (The omnibus /evolve loop runs this same protocol via the plugin's claude-evolve:coder agent, which adds claim/score looping; this skill is the standalone one-candidate path.)

Resolve the plugin root

echo "PLUGIN_ROOT=$CLAUDE_PLUGIN_ROOT"

If a subagent invoked you without it, use the path the caller passed in the prompt.

Step 1 — Prepare the file (deterministic)

Run the prepare script. It resolves the parent, copies evolution_<parent>.py (or algorithm.py for a baseline-rooted idea) to evolution_<id>.py, and tells you what to implement:

python3 "$CLAUDE_PLUGIN_ROOT/scripts/prepare.py" --working-dir "<WORKING_DIR>" <CANDIDATE_ID>

It prints one JSON line:

{"id":"...","is_baseline":false,"description":"...","parent":"gen01-002",
 "target_path":"/abs/evolution_gen02-001.py","target_basename":"evolution_gen02-001.py",
 "already_exists":false}

Exit code 2 → the parent file could not be found. Do not invent a parent. Report <id>: BLOCKED — parent missing (<basedOnId>) and stop; the engine should mark it failed-parent-missing.
is_baseline: true → there is nothing to code (the baseline evaluates algorithm.py as-is). Report <id>: baseline — no code change needed and stop.
already_exists: true → the target file was already present. Do not re-copy. Treat the existing file as the starting point only if the user explicitly asked to redo it; otherwise report <id>: already coded and stop so it can go straight to scoring.

Step 2 — Implement the idea (your real work)

Code the candidate. Try codex (GPT-5.5) first; fall back to coding it yourself only if codex falls short.

Step 2a — codex first

python3 "$CLAUDE_PLUGIN_ROOT/scripts/code_with_codex.py" --working-dir "<WORKING_DIR>" <CANDIDATE_ID>

It runs codex on the prepared file (default workspace-write sandbox — codex reads anywhere but writes only inside the workspace; the NEVER-USE-GIT warning rides along in the prompt) and prints one JSON line:

{"ok":true,"changed":true,"compiles":true,"timed_out":false,"restored_parent":false,"summary":"...","diff":"..."}

Read summary and diff and judge for yourself whether codex actually implemented description, preserved the interface, and made a real behavioral change — not a no-op, rename, or off-description edit. The script reports only hard signals (ok = exit 0 + file changed + compiles); the semantic call is yours.

Good (ok:true and your judgment agrees): codex coded it. Record the model and skip to Step 3.

python3 "$CLAUDE_PLUGIN_ROOT/scripts/evolve_csv.py" --working-dir "<WORKING_DIR>" set-field <CANDIDATE_ID> run-LLM gpt-5.5

Not good (ok:false, codex left the file unchanged, or you reject the change): fall back to Step 2b. When ok:false the script has already restored the clean parent copy; when you're rejecting a compiling-but-wrong change, restore the parent yourself first (copy evolution_<parent>.py — or algorithm.py if baseline-rooted — over target_path) so you start clean.

Step 2b — code it yourself (fallback)

Open target_path and modify it to implement description. Requirements:

Make a substantial, on-description change. Don't just add comments or rename things. The change must actually do what the description says.
Preserve the interface. The evaluator calls the algorithm the same way for every candidate — keep the same entry points, function signatures, and I/O contract the parent had. Read the parent and, if needed, evaluator.py to confirm the contract before editing.
Read large files in chunks (offset/limit) to avoid context overload.
Refuse rather than guess. If the description is unclear, infeasible, or you don't know how to implement it correctly, do not make random changes. Leave the file as the unmodified parent copy and report <id>: REFUSED — <one-line reason>. A faithful refusal is better than a fabricated edit — the engine will record it and move on. (This honors claude-evolve's fail-loud principle: don't fake work.)

After editing, sanity-check the syntax:

python3 -m py_compile "<target_path>"

If it fails, fix it. If you can't, report <id>: REFUSED — could not produce valid syntax. Then record the model:

python3 "$CLAUDE_PLUGIN_ROOT/scripts/evolve_csv.py" --working-dir "<WORKING_DIR>" set-field <CANDIDATE_ID> run-LLM sonnet

Step 3 — Report

Return a single line:

coded → <id>: coded — <one-line summary of the change> (ready to score)
baseline / already-coded / blocked / refused → the matching line above.

Do not score the candidate here — that's evolve-score. Leave the candidate at status running (prepare/claim already set it); the parent skill or evolve-score takes it from here. If you were invoked standalone by the user and they want a score too, tell them to run evolve-score <id> (or run it yourself as a separate step).

evolve-code

Popularity

Invocation

Context Preview

SKILL.md

evolve-code

Popularity

Invocation

Context Preview

SKILL.md

evolve-code

Resolve the plugin root

Step 1 — Prepare the file (deterministic)

Step 2 — Implement the idea (your real work)

Step 2a — codex first

Step 2b — code it yourself (fallback)

Step 3 — Report

Similar Skills

evolve-code

Resolve the plugin root

Step 1 — Prepare the file (deterministic)

Step 2 — Implement the idea (your real work)

Step 2a — codex first

Step 2b — code it yourself (fallback)

Step 3 — Report

Similar Skills