Search everything...

Stats

Actions

Available In

agent-loop

Name: agent-loop
Author: andonmitev

By AndonMitev

Self-evolving autonomous loops for AI coding agents (Claude Code & Codex). A tick is a pure function (state + new data) -> (action + new state + one record); the substrate is the loop's memory, the profile is the only thing that changes by type of work (research / experiment / build / maintenance).

npx claudepluginhub andonmitev/agent-loop --plugin agent-loop

Popularity

Stars

Med: 0·Avg: 285

Installs

Med: 0·Avg: 1

What's Inside

Skills11

author-skill

/author-skill

AI-first authoring of a NEW agent skill, autonomously (no human review). Used by self-evolve when a manual step recurs across loop cycles — writes a correctly-structured SKILL.md with frozen, checkable, AI-first instructions, places it in the skills dir, validates it, and wires it into the flow so the loop invokes it next time. Use when the loop identifies a proven recurring pattern worth turning into a reusable capability.

code-standards

/code-standards

Engineering standards every code-writing tick must follow so the loop produces GOOD code, not just working code. Invoked by build/maintenance ticks before and while editing code. Composes with (never overrides) the host project's CLAUDE.md / AGENTS.md.

debugging-and-error-recovery

/debugging-and-error-recovery

Guides systematic root-cause debugging. Use when tests fail, builds break, behavior doesn't match expectations, or you encounter any unexpected error. Use when you need a systematic approach to finding and fixing the root cause rather than guessing.

deep-research

/deep-research

AI-first investigative research using the orchestrator-worker pattern — decompose a question, fan out parallel subagent searches across multiple source angles, triangulate across independent sources, adversarially refute, synthesize a CITED answer with confidence, and loop until saturation. Use for open questions needing a well-supported answer. (For validating a hypothesis against data, use the experiment profile instead.)

doubt-driven-development

/doubt-driven-development

Subjects every non-trivial decision to a fresh-context adversarial review before it stands. Use when correctness matters more than speed, when working in unfamiliar code, when stakes are high (production, security-sensitive logic, irreversible operations), or any time a confident output would be cheaper to verify now than to debug later.

Hooks1

Event Hooks

1 hook across 1 event

Stats

Version0.1.0

LanguagePython

Stars0

MaintenanceExcellent

LicenseMIT

Last CommitJun 14, 2026

AddedJun 15, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).

Available In

agent-loop

README

agent-loop

Give an AI agent a one-line goal. It runs an autonomous loop that works toward it, verifies its own output, journals, and fires its own next tick — AI-first, token-light, self-evolving.

Self-evolving autonomous loops for AI coding agents (Claude Code & Codex). You set direction; the loop runs itself — observe → act → verify → journal → re-evolve its next directive, with no user in the inner loop. The engine is plain Python + JSON + markdown instructions: no API key, no build step, no infra — it rides whichever agent you already use. (~1.4k always-on tokens for all 11 skills; the heavy thinking is pay-per-use.)

The one idea

A tick is a pure function:

(state.json + new data) → (action + new state + one log record)

Each tick is a stateless agent. The substrate (state.json + log.jsonl) is the loop's entire memory between ticks, so token cost per tick is bounded no matter how long history grows. The loop "evolves" because every tick rewrites state.next — the directive the next tick reads first.

Only one thing changes by type of work — a profile. The loop primitive and substrate are identical for all:

Profile	For	Gate ("what good means")
research	investigate an open question	every claim triangulated across ≥2 independent sources, refuted, cited, confidence-scored
experiment	validate an idea vs a slow/noisy signal	pre-registered bar, frozen before peeking; honest null = success
build	construct from a milestone DAG	acceptance met and a verifier separate from the builder confirms
maintenance	keep an existing app healthy	full suite green before and after; no regression; surgical edits

Install (Claude Code plugin)

/plugin marketplace add AndonMitev/agent-loop
/plugin install agent-loop@agent-loop

Then in any project: /spawn-loop "<your goal>". On first run the helper is copied into that project's .loop/.

Quickstart (install → first output)

/plugin marketplace add AndonMitev/agent-loop
/plugin install agent-loop@agent-loop
/spawn-loop "keep this repo green and tidy"

/spawn-loop classifies the goal, seeds .loop/<id>/, runs the first tick, and schedules the next — you don't click through iterations. You'll see a loop appear:

$ python3 .loop/loop.py list
repo-health  [maintenance] event    cyc=  1  preg=0 todo=0  next: sleep until a health signal fires …

Inspect it with python3 .loop/loop.py status repo-health; remove it with python3 .loop/loop.py rm repo-health.

Use — a full walkthrough (real commands, real output)

# 1. Spawn. One command: classifies the goal, seeds .loop/, runs the first tick, schedules the next.
/spawn-loop "keep my repo green: tests pass and types check"
#   → creates loop 'repo-green' [maintenance], runs tick 1, self-schedules

# 2. See all your loops.
python3 .loop/loop.py list
#   repo-green  [maintenance] event  cyc=1  preg=0 todo=0  next: sleep until a health signal fires …

# 3. Inspect one loop — goal, gate, last reading, what it'll do next.
python3 .loop/loop.py status repo-green

# 4. Nudge it manually (it ticks itself; use this only to debug or push it along).
/loop-tick repo-green

# 5. Delete it when you're done.
python3 .loop/loop.py rm repo-green

More worked examples — one per profile (research / experiment / build / maintenance) — in EXAMPLES.md.

Autonomy (AI-first)

The loop does not rely on you to drive each step — you give a goal and approve hard gates; it does the rest. It is built to behave like an autonomous engineer, not a script. Every tick lives by a Prime directive (the first thing loop-tick reads, overriding everything else):

Decide + act, never passive. A tick never ends on a wait or a status report — it changes state/code/config, records a verdict, or makes an explicit, justified no-op. It always moves toward the goal.
Invoke skills, don't describe them. When the work calls for a capability (/deep-research, /grill-ai, /plan-decompose, /test-driven-development, …) the loop calls the skill — judgment-gated (only when it earns its tokens), never paraphrased or simulated.
Grill itself at every real distinction. Any kill/keep, promote, tombstone, anomaly, or surprising result fires /grill-ai adversarial self-critique before the call is committed. The loop argues with itself so you don't have to.

View full README on GitHub

agent-loop

Popularity

What's Inside

Confidence

README

agent-loop

The one idea

Install (Claude Code plugin)

Quickstart (install → first output)

Use — a full walkthrough (real commands, real output)

Autonomy (AI-first)

Similar Plugins

fullstack-dev-skills

drawio-diagramming

planning-with-files

anthropic-essentials

agent-skills

claude-buddy

agent-loop

The one idea

Install (Claude Code plugin)

Quickstart (install → first output)

Use — a full walkthrough (real commands, real output)

Autonomy (AI-first)

Popularity

Health & Quality

Similar Plugins

fullstack-dev-skills

drawio-diagramming

planning-with-files

anthropic-essentials

agent-skills

claude-buddy