By NTT123
Iterative hill-climbing skills: scaffold a project, run execute → verify cycles, brainstorm when stuck, autopilot the loop. State lives in a self-rendering state.html dashboard.
Refresh the idea pool of a hill-climbing project. Reads the run history and current ideas, detects whether progress has stalled, then generates new candidate directions, biased toward exploration when stuck, toward exploitation when a recent run improved things. Use this skill when the user is out of open ideas, the last several runs failed to improve `best`, the user explicitly asks for new ideas, or says "run the hillclimb-brainstorm skill", "I'm stuck", "what else could we try", "give me more directions".
Execute one iteration of a hill-climbing project: pick the next idea from the pool, plan it, do the work, log a run. Use this skill whenever the user has an active project with `.hillclimb/state.html` and wants to make progress. Trigger on phrases like "run the hillclimb-execute skill", "run the next iteration", "try the next idea", "do another pass", "execute one more", or simply "iterate" when a hill-climbing project is already set up. Does NOT run the verifier; that's the `hillclimb-verify` skill's job.
Run the hill-climbing loop autonomously: spawn subagents that invoke the `hillclimb-execute` → `hillclimb-verify` → (when stuck) `hillclimb-brainstorm` skills, iteration after iteration, until a stop condition fires. Uses git for per-iteration checkpoints with automatic rollback on failed verifications. Requires `.hillclimb/` to exist (run the `hillclimb-onboard` skill first); on a dirty git tree, asks how to handle the pending changes (commit / stash / abort) before starting.
Interactive onboarding for a hill-climbing optimization project. Use this skill whenever the user wants to start a long, iterative effort to optimize a measurable score, minimize a loss, maximize accuracy, or beat a perf target. Trigger on phrases like "set up a hill-climbing project", "run the hillclimb-onboard skill", "start an iterative project", "let's iterate on X with a dashboard", or when the user describes a problem whose solution requires many rounds of try-verify-revise. This skill walks the user through one question at a time, fills the gaps with a research subagent, and produces .hillclimb/state.html (a self-rendering dashboard) plus a verify.sh script template.
Verify the most recent execution run in a hill-climbing project. Reads `.hillclimb/state.html`, runs the project's verifier script, and updates the dashboard with the result. Use this skill whenever the user finishes an `hillclimb-execute` and wants to know if it improved things. Trigger on "run the hillclimb-verify skill", "verify the last run", "evaluate", "check the result", or after any `hillclimb-execute` completes. Updates the dashboard's `best`, marks the originating idea done/abandoned/open, and reports back.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Claude Code skills for problems that need many rounds of try → verify →
revise: optimizing a measurable score or chasing a perf target. State
lives in a state.html file you can open in any browser.
| Skill | What it does |
|---|---|
hillclimb skill | End-to-end: onboard if needed, then hand off to the autonomous loop. |
hillclimb-onboard skill | Interactive setup: objective, verifier, baseline, initial ideas. |
hillclimb-execute skill | Pick the top-priority idea, plan + do the work, log a run. |
hillclimb-verify skill | Run the verifier, update best, mark idea done/abandoned/open. |
hillclimb-brainstorm skill | Diagnose the search and add new ideas when stuck. |
hillclimb-loop skill | Autopilot: subagents run execute → verify → brainstorm, with git checkpoints. |
For a fresh project, run hillclimb skill. To drive the cadence by
hand, alternate hillclimb-execute skill and hillclimb-verify skill,
calling hillclimb-brainstorm skill when stuck.
This repo is a Claude Code plugin. Inside Claude Code:
/plugin marketplace add NTT123/hill-climbing-skills
/plugin install hillclimb@hill-climbing-skills
To update: /plugin update hillclimb@hill-climbing-skills. To remove:
/plugin uninstall hillclimb@hill-climbing-skills.
You already have kernel.py (a working fp32 causal self-attention
kernel running at 3.6 TFLOP/s) and check.py (times the kernel and
checks correctness against torch SDPA). You want to beat SDPA
(~31 TFLOP/s) on RTX 5090. In Claude Code:
Run the
hillclimbskill: set up hill-climbing to maximize TFLOP/s onkernel.py. Baseline 3.6 TFLOP/s; target beats torch SDPA on B=1 H=32 S=4096 D=128. Verify withcheck.py; it already prints a TFLOP/s number.
Onboarding asks a few questions (objective, direction, target, baseline,
≥ 3 initial ideas like FlashAttention-style tiling, Triton port, TF32
mma.sync), drops a verify.sh template you point at check.py, and
hands off to the autopilot. The dashboard tracks a trajectory like:
R-001 ✓ pass 14.12 TFLOP/s FlashAttention-2 tiled
R-002 ✓ pass 16.72 TFLOP/s TF32 mma.sync on QK^T
R-003 ✓ pass 24.35 TFLOP/s Triton port, num_warps=4
R-004 ✓ pass 33.02 TFLOP/s ▲ BLOCK_M=32 sweep, beats SDPA
Open .hillclimb/state.html for the full picture: the score chart,
the idea pool, and each run's plan and summary.
<your-project>/
└── .hillclimb/
├── state.html # the dashboard, open in any browser (gitignored)
├── state.py # JSON-island read/write CLI
├── run_with_timeout.py # per-command time wrapper (used by execute/verify)
├── verify.sh # the verifier, emits {status,score,notes} JSON
└── .gitignore
state.html is gitignored so git checkout / git reset can never
destroy run history; the dashboard lives only on disk.
verify.sh is a shell script that emits one JSON line on stdout:
{"status": "pass" | "fail" | "inconclusive", "score"?: <number>, "notes"?: "..."}
status is required; score and notes are optional. score is
required when status is pass for the chart and best to update.
status drives idea-status mapping (pass → done, fail → abandoned,
inconclusive → open).
Onboarding ships two starter templates at
skills/hillclimb-onboard/verify_templates/: metric.sh (compute one
number, optional threshold-gate) and custom.sh (empty starter).
hillclimb-loop skill records each pass commit's SHA on its run record. To
restore a run's code into the working tree (HEAD does not move):
python3 .hillclimb/state.py rollback-to .hillclimb/state.html R-005
# … inspect or re-run verify.sh …
git restore . # back to where you were
Works for every verified pass under both default greedy and lazy
(project.loop.greedy = false) modes; the pass commit is kept
reachable in either case.
Manual hillclimb-execute skill + hillclimb-verify skill runs (outside the
loop) don't auto-record commits, so rollback-to will say no commit recorded for <R-id>. To opt in: commit the working tree after each
verify and store the SHA via state.py set-commit.
To hack on the skills without going through the marketplace, clone the repo and add it as a local plugin source:
/plugin marketplace add /path/to/hill-climbing-skills
/plugin install hillclimb@hill-climbing-skills
Edits propagate after /plugin marketplace update hill-climbing-skills
(or restart Claude Code).
npx claudepluginhub ntt123/hill-climbing-skills --plugin hillclimbEditorial "Web Designer" bundle for Claude Code from Antigravity Awesome Skills.
Ultra-compressed communication mode. Cuts ~75% of tokens while keeping full technical accuracy by speaking like a caveman.
Frontend design skill for UI/UX implementation
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Memory compression system for Claude Code - persist context across sessions
Marketing skills for AI agents — conversion optimization, copywriting, SEO, paid ads, ad creative, and growth