By AndonMitev
Self-evolving autonomous loops for AI coding agents (Claude Code & Codex). A tick is a pure function (state + new data) -> (action + new state + one record); the substrate is the loop's memory, the profile is the only thing that changes by type of work (research / experiment / build / maintenance).
AI-first authoring of a NEW agent skill, autonomously (no human review). Used by self-evolve when a manual step recurs across loop cycles — writes a correctly-structured SKILL.md with frozen, checkable, AI-first instructions, places it in the skills dir, validates it, and wires it into the flow so the loop invokes it next time. Use when the loop identifies a proven recurring pattern worth turning into a reusable capability.
Engineering standards every code-writing tick must follow so the loop produces GOOD code, not just working code. Invoked by build/maintenance ticks before and while editing code. Composes with (never overrides) the host project's CLAUDE.md / AGENTS.md.
Guides systematic root-cause debugging. Use when tests fail, builds break, behavior doesn't match expectations, or you encounter any unexpected error. Use when you need a systematic approach to finding and fixing the root cause rather than guessing.
AI-first investigative research using the orchestrator-worker pattern — decompose a question, fan out parallel subagent searches across multiple source angles, triangulate across independent sources, adversarially refute, synthesize a CITED answer with confidence, and loop until saturation. Use for open questions needing a well-supported answer. (For validating a hypothesis against data, use the experiment profile instead.)
Subjects every non-trivial decision to a fresh-context adversarial review before it stands. Use when correctness matters more than speed, when working in unfamiliar code, when stakes are high (production, security-sensitive logic, irreversible operations), or any time a confident output would be cheaper to verify now than to debug later.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Give an AI agent a one-line goal. It runs an autonomous loop that works toward it, verifies its own output, journals, and fires its own next tick — AI-first, token-light, self-evolving.
Self-evolving autonomous loops for AI coding agents (Claude Code & Codex). You set direction; the loop runs itself — observe → act → verify → journal → re-evolve its next directive, with no user in the inner loop. The engine is plain Python + JSON + markdown instructions: no API key, no build step, no infra — it rides whichever agent you already use. (~1.4k always-on tokens for all 11 skills; the heavy thinking is pay-per-use.)
A tick is a pure function:
(state.json + new data) → (action + new state + one log record)
Each tick is a stateless agent. The substrate (state.json + log.jsonl) is the loop's entire memory between
ticks, so token cost per tick is bounded no matter how long history grows. The loop "evolves" because every tick
rewrites state.next — the directive the next tick reads first.
Only one thing changes by type of work — a profile. The loop primitive and substrate are identical for all:
| Profile | For | Gate ("what good means") |
|---|---|---|
| research | investigate an open question | every claim triangulated across ≥2 independent sources, refuted, cited, confidence-scored |
| experiment | validate an idea vs a slow/noisy signal | pre-registered bar, frozen before peeking; honest null = success |
| build | construct from a milestone DAG | acceptance met and a verifier separate from the builder confirms |
| maintenance | keep an existing app healthy | full suite green before and after; no regression; surgical edits |
/plugin marketplace add AndonMitev/agent-loop
/plugin install agent-loop@agent-loop
Then in any project: /spawn-loop "<your goal>". On first run the helper is copied into that project's .loop/.
/plugin marketplace add AndonMitev/agent-loop
/plugin install agent-loop@agent-loop
/spawn-loop "keep this repo green and tidy"
/spawn-loop classifies the goal, seeds .loop/<id>/, runs the first tick, and schedules the next — you don't
click through iterations. You'll see a loop appear:
$ python3 .loop/loop.py list
repo-health [maintenance] event cyc= 1 preg=0 todo=0 next: sleep until a health signal fires …
Inspect it with python3 .loop/loop.py status repo-health; remove it with python3 .loop/loop.py rm repo-health.
# 1. Spawn. One command: classifies the goal, seeds .loop/, runs the first tick, schedules the next.
/spawn-loop "keep my repo green: tests pass and types check"
# → creates loop 'repo-green' [maintenance], runs tick 1, self-schedules
# 2. See all your loops.
python3 .loop/loop.py list
# repo-green [maintenance] event cyc=1 preg=0 todo=0 next: sleep until a health signal fires …
# 3. Inspect one loop — goal, gate, last reading, what it'll do next.
python3 .loop/loop.py status repo-green
# 4. Nudge it manually (it ticks itself; use this only to debug or push it along).
/loop-tick repo-green
# 5. Delete it when you're done.
python3 .loop/loop.py rm repo-green
More worked examples — one per profile (research / experiment / build / maintenance) — in EXAMPLES.md.
The loop does not rely on you to drive each step — you give a goal and approve hard gates; it does the rest. It is
built to behave like an autonomous engineer, not a script. Every tick lives by a Prime directive (the first
thing loop-tick reads, overriding everything else):
/deep-research, /grill-ai,
/plan-decompose, /test-driven-development, …) the loop calls the skill — judgment-gated (only when it
earns its tokens), never paraphrased or simulated./grill-ai adversarial self-critique before the call is committed. The loop argues with itself so you
don't have to.npx claudepluginhub andonmitev/agent-loop --plugin agent-loopComprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Intelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.
Persistent file-based planning for AI coding agents. Crash-proof markdown plans (task_plan.md, findings.md, progress.md) that survive context loss and /clear, with an opt-in completion gate and multi-agent shared state. Manus-style. Works with Claude Code, Codex CLI, Cursor, Kiro, OpenCode and 60+ agents via the SKILL.md standard. Includes Arabic, German, Spanish, and Chinese (Simplified and Traditional).
Feature development with code-architect/explorer/reviewer agents, CLAUDE.md audit and session learnings, and Agent Skills creation with eval benchmarking from Anthropic.
Production-grade engineering skills for AI coding agents — covering the full software development lifecycle from spec to ship.
Permanent coding companion for Claude Code — survives any update. MCP-based terminal pet with ASCII art, stats, reactions, and personality.