By asragab
Optimize any text artifact using gepa — prompts, code, configs, skills
Discover quality dimensions for an artifact and objective
Recommend an iteration budget based on artifact complexity
Side-by-side comparison of two artifacts using composed score calls
Show the optimization plan for given inputs
Normalize and validate an intake specification
Complete runnable evaluator templates for prompt, code, documentation, and agent-instructions artifacts
Create or write an evaluator script for scoring text artifacts, prompts, or configs during gepa optimization. Use when asked to build, scaffold, or generate an evaluator, scoring function, or judge for optimize-anything.
Guide for running, configuring, and interpreting `optimize-anything` and `gepa` optimization workflows. Use when asked how to optimize a prompt, artifact, config, or skill, or when troubleshooting evaluator feedback, budget, or score interpretation.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
LLM-guided optimization for text artifacts using an iterative propose-evaluate-reflect loop with a bring-your-own evaluator.
# 1) Install
curl -fsSL https://raw.githubusercontent.com/ASRagab/optimize-anything/main/install.sh | bash
# 2) Create a seed artifact
echo "Write a concise support prompt" > seed.txt
# 3) Generate a starter evaluator (default: judge/python template)
optimize-anything generate-evaluator seed.txt \
--objective "Score clarity, actionability, and specificity" \
> eval.py
# 4) Optimize
optimize-anything optimize seed.txt \
--judge-model openai/gpt-4o-mini \
--objective "Improve clarity and specificity" \
--model openai/gpt-4o-mini \
--budget 20 \
--parallel --workers 4 \
--cache \
--run-dir runs \
--output result.txt
CLI stdout returns a JSON summary — see Result Contract for the full shape.
optimize-anything runs a GEPA (Guided Evolutionary Prompt Algorithm) loop: propose → evaluate → reflect, repeating until budget is exhausted or early stopping kicks in.
seed.txt ──► [Propose] ──► candidates
▲ │
│ [Evaluate]
[Reflect] ◄──── scores + diagnostics
--judge-model).The evaluator is the only thing you bring. Everything else — proposal strategy, reflection, early stopping, caching, parallelism — is handled by the optimizer.
Use --dataset for multi-task optimization (one evaluator call per example). Add --valset for generalization validation.
optimize-anything optimize prompt.txt \
--judge-model openai/gpt-4o-mini \
--objective "Generalize across customer request types" \
--dataset data/train.jsonl \
--valset data/val.jsonl \
--model openai/gpt-4o-mini \
--budget 120 --parallel --workers 6 --cache --run-dir runs
Cross-check one artifact with multiple judge providers:
optimize-anything validate result.txt \
--providers openai/gpt-4o-mini anthropic/claude-sonnet-4-5 google/gemini-2.0-flash \
--objective "Score clarity, constraints, and robustness" \
--intake-file intake.json
No seed file required; GEPA bootstraps from objective.
optimize-anything optimize --no-seed \
--objective "Draft a concise, testable API prompt" \
--model openai/gpt-4o-mini \
--judge-model openai/gpt-4o-mini
--no-seed requires both --objective and --model.
--budget > 30 (or force with --early-stop)--cache-from (requires --cache + --run-dir)optimize-anything optimize seed.txt \
--evaluator-command bash eval.sh \
--model openai/gpt-4o-mini \
--budget 150 \
--cache --cache-from runs/run-20260303-120000 \
--run-dir runs \
--early-stop --early-stop-window 12 --early-stop-threshold 0.003
For command/HTTP evaluators:
--score-range unit (default): enforce score in [0, 1]--score-range any: allow any finite floatoptimize-anything optimize seed.txt \
--evaluator-command bash eval.sh \
--model openai/gpt-4o-mini \
--score-range any
optimizegenerate-evaluatorintakeexplainbudgetscoreanalyzevalidateoptimize-anything is also a Claude Code plugin with guided slash commands and skills.
Use the regression harness when you want to verify that Claude can actually invoke the plugin correctly end-to-end, not merely that the CLI itself still works.
# Direct plugin regression run
uv run python scripts/plugin_regression.py
# Full repo validation including plugin regression
uv run python scripts/check.py --with-plugin
Requirements:
claude CLI installed and authenticatedOPENAI_API_KEY set in the shell that launches the commandANTHROPIC_API_KEY set in the shell that launches the commandThe harness runs three real scenarios (analyze, validate, quick), saves Claude JSON outputs plus stderr logs, and fails if Claude does not execute the expected workflow or the optimized artifact is not written.
npx claudepluginhub asragab/optimize-anythingAutomated skill/prompt/tool evaluation and improvement using session log analysis, signal classification, LLM-as-judge target identification, and autoresearch-style optimization loops
Cross-agent session search, context loading, token analytics, session export, remote sources, and learning synthesis powered by CASS (Coding Agent Session Search)
Iterative artifact refinement - hone any artifact or workspace over multiple rounds using criteria-driven judge feedback, runnable evaluators, and focused directional improvements
Analyze and optimize AI prompts for better results
Testany AI/LLM 工具集:Prompt 优化
Autonomous experiment loop that optimizes any file by a measurable metric. 5 slash commands, 8 evaluators, configurable loop intervals (10min to monthly).
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Standalone image generation plugin using Nano Banana MCP server. Generates and edits images, icons, diagrams, patterns, and visual assets via Gemini image models. No Gemini CLI dependency required.