By lkv1988
Collie-style autonomous development agent harness: rubric review + budget guardrails + loop detection + memory
Pre-flight environment check SKILL for the collie autoiter command. Called by the main autoiter SKILL (collie:autoiter) in the §3.5 post-ExitPlanMode recovery path, after worktree creation, before Stage 1 iteration begins. Runs 5 checks: (1) trigger dry-run, (2) scalar extraction validation, (3) observability validation (Monitor/Read-tail + kill signal), (4) persistent directory writability. Outputs prepare-report.md with PASS/FAIL evidence. Returns failure signal to caller on any FAIL — does NOT fix issues. Supports skip_prepare bypass and is idempotent (skips all checks if prepare-report.md already exists). Do NOT invoke directly from user prompts; this is an internal skill invoked exclusively by collie:autoiter.
Main loop orchestrator for collie. Drives the 'run → observe → triage → deep-verify → fix → rerun' self-iteration pipeline. Called by commands/autoiter.md on every ralph-loop session restart. Implements a persistent state machine (§3.5) across ralph-loop session restarts: fresh-start → Stage 0 (Discovery planmode) → Stage 1 (kickoff) → Stage 2 (run trigger) → Stage 3 (observe + auto-recovery) → Stage 4a (Triage) → Stage 4b (Deep Verify) → Stage 5.0 (fix-plan) → Stage 5.1 (flow) → Stage 5.2 (G6 diff audit + rerun) → Stage 6 (rollback + stop check). Completion signal: <promise>Collie: AUTOITER DONE</promise> (emitted ONLY by the §3.5 terminal branch after ralph-loop restart, NEVER inline).
Post-planmode implementation workflow with quality gates. Use immediately after exiting planmode to execute an approved plan.
Boids-inspired memory management for AI agents. Evaluates conversation content against a fixed decision tree to decide what to remember, where to store it, and when to consolidate. Activate on Stop hook and PreCompact hook for automatic memory capture. Also activate when the agent detects information during conversation that might change future behavior — user corrections, preferences, role info, project constraints, or external system references. Do NOT activate for knowledge questions, in-session operations, or debug conclusions.
Collie-style unified rubric reviewer. Enforces 13 red lines + 6 questions + ELEPHANT anti-sycophancy + Reflexion grounding. Internally dispatches Agent(model=opus) for isolation. Use in three contexts: (1) Plan mode — Target is a plan doc matching *-plan.md or under plans/; called in parallel with plan-doc-reviewer at /auto step ③ before ExitPlanMode. (2) Code mode — Target is a worktree diff or branch; called as flow TodoList item [collie-final-review] (Step 5.7) before worktree cleanup. (3) Ad-hoc — any file, diff, design doc, or subagent output needing Collie-style review.
Modifies files
Hook triggers on file write and edit operations
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.

Claude Code plugin that enforces a complete development workflow — brainstorm, plan, dual-review, gated implementation, rubric code review — with hooks that make every step non-optional.
Five capabilities working together:
| Capability | Entry point | What it does |
|---|---|---|
| auto | /collie:auto | End-to-end feature development: research → brainstorm → plan → review → implement → ship |
| flow | collie:flow | Post-planmode gated implementation with worktree isolation, TDD, code review, and pre-merge rubric gate |
| review | collie:review | 13 red-line rubric + 6 design questions + ELEPHANT anti-sycophancy check + Reflexion grounding |
| autoiter | /collie:autoiter | Self-iterating fix loop: run → observe → triage → deep-verify → fix → rerun, with overfit guards |
| memory | collie:memory | Boids-inspired memory system: decision tree evaluates what to remember, consolidation merges over time |
/plugin install superpowers@claude-plugins-official
/plugin install ralph-loop@claude-plugins-official
Verify: /plugin list should show both superpowers and ralph-loop.
claude plugin marketplace add lkv1988/collie-harness
claude plugin install collie@collie-marketplace
Add to ~/.claude/settings.json:
"permissions": { "defaultMode": "acceptEdits" }
Without this, the autonomous execution in Layer 0 won't work.
/collie:auto "add retry mechanism to the foo module"
Completion signal: <promise>Collie: SHIP IT</promise> — only emitted after collie:review (Mode=code) returns PASS.
/collie:autoiter "reduce p99 latency below 200ms" --max-iterations 5
Completion signal: <promise>Collie: AUTOITER DONE</promise> — worktree preserved for user review, not auto-merged.
/collie:auto/collie:auto "task"
-> Research & Reuse <- internal specs first, then web / registry / docs + deferred scope scan
-> superpowers:brainstorming
-> superpowers:writing-plans <- hook marks plan pending for dual review
-> PARALLEL:
collie:plan-doc-reviewer <- structural review
collie:review <- Collie rubric review
-> (both approved)
-> ExitPlanMode <- hook reminds to call collie:flow
-> collie:flow <- TDD + code review + [collie-final-review] pre-merge gate
-> <promise>Collie: SHIP IT</promise>
Every arrow is hook-enforced. Skip a step and the hook blocks you.
/collie:autoiter vs /collie:auto| Dimension | /auto | /autoiter |
|---|---|---|
| Structure | Single linear pass | N iterations (default 5) |
| Stop condition | SHIP IT | Quality threshold / iteration cap / convergence / budget / escalate |
| Typical use | New feature development | Long-running test hardening, metric optimization, batch bug fixes |
| Worktree | Auto-merge + cleanup | Preserved for user review |
The two workflows are independent and cannot be nested.
| Variable | Purpose | Default |
|---|---|---|
COLLIE_ESCALATE_CMD | Custom escalation handler (shell command) | Writes to ~/.collie/escalations.log |
COLLIE_AUTOITER_NOTIFY_CMD | Terminal-event notification for autoiter | stdout only |
COLLIE_HOME | Override state directory location | ~/.collie |
Autoiter notification payload env vars: COLLIE_AUTOITER_EVENT, COLLIE_AUTOITER_RUN_ID, COLLIE_AUTOITER_STATUS_FILE.
# Load as dev plugin (session-only, doesn't affect installed version)
claude --plugin-dir ~/git/collie-harness
# Run unit tests
node --test tests/*.test.js
# Run E2E smoke tests
./tests/e2e/smoke.sh
# Validate plugin structure
claude plugin validate ~/git/collie-harness
No build step. Pure Node.js, zero external dependencies.
MIT
npx claudepluginhub lkv1988/collie-harness --plugin collieLong-running agent harness with 5-layer memory architecture, GitHub integration, autonomous batch processing, Agent Teams with ATDD, 9 hooks (safety, quality gates, team coordination), and 6 Agent Skills
Hypo-Workflow for Claude Code. The plugin namespace is intentionally `hw`; plugin-root commands map /hw:* to existing workflow Skills.
HarnessFlow — From idea to shipped product: high-quality engineering workflows for AI agents. Spec-anchored SDD, gated TDD, evidence-based routing, independent reviews, and formal closeout.
Harness engineering for Claude Code — hook-enforced dual review, state-machine gates, and fail-closed safety where it counts.
Autonomous agent orchestrator for full development lifecycles with zero human input, session budget management, and crash recovery
Harness-native ECC operator layer - 67 agents, 271 skills, 92 legacy command shims, reusable hooks, rules, selective install profiles, and production-ready workflows for Claude Code, Codex, OpenCode, Cursor, and related agent harnesses