Skill

handle-it

End-to-end orchestrator that takes a Linear issue (or a freeform feature/project) and drives it through planning → implementation → draft PR → an in-house review⇄fix loop (rated to 5/5) → conflict-free CI + preview → an auto-tester → your manual review (PR stays a draft) → ready-for-review on your approval, with a live status table and Linear blocking awareness. Use when the user invokes /handle-it, or asks to take an issue/feature all the way to a review-ready PR.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/swatkinson-toolkit:handle-it

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

You are the **orchestrator** (run as Opus, in the main conversation). You route the request to the right planning entrypoint, then drive each target issue through the full lifecycle — implement (Sonnet) → draft PR → review⇄fix loop (🐊 to 5/5) → resolve conflicts → CI + preview → test-and-tick → manual-review handoff (**PR stays a draft**) → mark ready on your approval → senior review → watch t...

Supporting Files

REFERENCE.md

SKILL.md

158 lines · ~4.6k tokens

Stats

Parent stars0

MaintenanceGood

Last CommitJun 4, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

handle-it

You are the orchestrator (run as Opus, in the main conversation). You route the request to the right planning entrypoint, then drive each target issue through the full lifecycle — implement (Sonnet) → draft PR → review⇄fix loop (🐊 to 5/5) → resolve conflicts → CI + preview → test-and-tick → manual-review handoff (PR stays a draft) → mark ready on your approval → senior review → watch to merge — keeping a live status table and honoring Linear blocking relations.

Autonomy contract: Run autonomously from implementation through the review loop, conflict resolution, CI/preview, and the auto-tester — don't checkpoint between those. When waiting on external state (a blocker to merge), loop and wait yourself (ScheduleWakeup). The human gates are exactly: (a) /grill-with-docs during planning, (b) agentsystem-core:open-pr's draft-publish confirm, (c) workbench's DB-connection-string ask, (d) the manual-review gate — the PR stays a draft through review/CI/preview/test-and-tick; you hand it off for manual testing and WAIT; on the user's "looks good" you un-draft (mark ready for review) and tell them to request senior review, (e) a bail when you genuinely cannot proceed (PushNotification, update the table, stop).

Lifecycle mechanics, subagent prompt templates, Linear runtime resolution, and workbench specifics live in REFERENCE.md — load it before Phase 4. Invoke skills by their exact names — plugin skills need the agentsystem-core: prefix or Skill(...) errors Unknown skill (see REFERENCE → Skill invocation names).

Phase 0 — Preflight & resume detection

Detect the repo. CaivanOS (app.caivanos) is fully supported. Workbench is best-effort (see REFERENCE → Workbench). Anything else → tell the user it's not configured and stop.
Confirm Linear tools. Prefer the Linear MCP tools (mcp__linear-server__* in this setup). If absent, tell the user and switch to manual Linear mode: read issues from pasted text, emit issues you write/edit as one copyable markdown block (see REFERENCE → Linear-down fallback). The git/gh pipeline is unaffected.
Resolve runtime IDs once and reuse: my user id (get_user query=me), team id, the ai label id, the active cycle id, and the workflow status ids (Backlog/Todo/In Progress/In Review/Done). See REFERENCE → Linear runtime resolution.
Detect where to resume. /handle-it can be dropped on an issue at any stage — don't assume Phase 1. Derive the furthest-completed phase and jump straight there:
- Explicit user pick-up instruction wins. If the invocation says where to start ("already implemented on worktree X, pick up at review"), honor it — but first verify its preconditions from ground truth (worktree exists, PR exists, etc.); if one is missing, say so and fall back to detection.
- Else derive from ground truth (authoritative — the Linear status block is only a hint, and can be stale): worktree present (bun run worktree:ls)? PR for the branch (gh pr view)? draft or ready? is there a ## 🐊 Claudecodile Rating: comment and what's the score? CI state, mergeable, reviewDecision, state? Map these to the entry phase (full table in REFERENCE → Resume detection).
- Reconcile + announce. If ground truth and the status block disagree, trust ground truth, rewrite the block, and tell the user: "Resuming BE-#### at Phase N (…) — found ."
Skip this only when there's clearly nothing to resume (no worktree, no PR, issue still Backlog/Todo) → start at Phase 1.

Phase 1 — Route to a starting point

Parse the invocation argument and pick ONE entry point. Context-complete means: a new person or agent could pick this up from the title + description alone (or the docs it points to) with zero further questions.

Argument	Condition	Entry point	Action
Issue id(s) given	Context-complete	EP4	Skip planning → Phase 2
Issue id(s) given	Lacks context	EP1	`/grill-with-docs` → rewrite the issue's title + description in `/to-issues` template form (don't create new issues)
No issue	One coherent feature	EP2	`/grill-with-docs` → create the Linear issue(s) with the defaults below
No issue	A large/multi-feature project	EP3	`/grill-with-docs` → `/to-prd` → `/to-issues` (atomic, dependency-ordered) with the defaults below

If "one feature vs project" is unclear with no issue given: anything that would yield 3+ issues is EP3.

Issue-creation defaults (EP2/EP3, and any /to-issues slice): assignee = me; add the ai label (keep the planning skill's ready-for-agent/category labels); set the real Linear blockedBy/blocks relations (not just body text); attach to the active cycle. Leave other labels for the user. See REFERENCE → Creating issues.

After Phase 1 you hold one or more context-complete target issues. Order by dependency (blockers first) and run Phases 2–13 on each leaf. Most invocations are a single issue. On any (re-)entry (e.g. after a ScheduleWakeup), skip issues already In Review/Done and resume the unfinished ones — Linear status is your source of truth, so you never double-claim or re-PR.

Phase 2 — Blocking check (per issue)

get_issue (with includeRelations: true) → read relations.blockedBy. A blocker is cleared only at status Done (merged). In Review is NOT Done.

Blocked: the issue is already context-complete, so wait to implement. Table row → ⏳ waiting on BE-####, then ScheduleWakeup (~20–30 min) to re-check; still open → sleep again; Done → proceed. Do NOT claim until unblocked. (User may also wrap the whole thing in /loop.)
Unblocked / no blockers: → Phase 3.

Phase 3 — Claim + worktree (per issue)

Claim: save_issue status = In Progress, assignee = me.
Worktree (orchestrator owns the path; all subagents cd into THIS one — never a second worktree for the same branch):
- CaivanOS: bun run worktree:new <domain>/<be-id>/<short-kebab>. Add --db-branch ONLY for a DB migration / schema change (matches lead-worktree-team); for non-migration issues, omit it. Branch naming per AGENTS.md. Capture the absolute path.
- Workbench: set up a worktree, copy .env, then ask the user for the DB connection string and WAIT (this ask overrides autonomy).

Phase 4 — Implement (subagent)

Classify the brief, then spawn the matching pre-built agent — Agent(subagent_type: …) with the brief + the absolute worktree path:

Feature / enhancement, or a clear bug (cause given in the brief) → handle-it-shipper (Sonnet; runs agentsystem-core:ship).
Unclear bug (symptom / error log / perf regression, no root cause) → handle-it-investigator (Opus; runs diagnose). If it reports the fix is feature-sized, re-route to handle-it-shipper.

The agent verifies (bun run check + full bun run test) and reports back — it does NOT commit or push. Once it returns, the orchestrator commits (Refs: BE-####) + pushes from the foreground (subagents hang on git push and ignore "don't push" — findings #12/#13; see REFERENCE → Git ownership). Agent defs: .claude/agents/handle-it-shipper.md, handle-it-investigator.md. All agents inherit the hard rules.

Phase 5 — Open DRAFT PR (orchestrator)

cd into the worktree first (the branch is checked out there; from the primary checkout you're on main). Run agentsystem-core:open-pr at mode=balanced and as a draft (--draft): Phase 4 already verified check+test, so balanced is right; production would block on pre-existing unrelated failures. It writes the title + Summary/Test-plan body (reference the bare BE-####) and shows its confirm gate — let it fire. save_comment the PR URL on the Linear issue; keep status In Progress. The PR stays a draft for the entire automated pipeline and your manual testing — it is un-drafted only on your "looks good" (Phase 12). Capture the PR number.

Phase 6 — Review ⇄ fix loop (until 5/5)

Delegate the whole loop to Skill(claudecodile-review) — pass it the worktree path + PR number (and, on a resume, the existing RATING_COMMENT_ID + score history if you have them). That skill owns the iterate-until-🐊-5/5 review⇄fix loop: Opus claudecodile-reviewer posts P#-tagged inline comments with suggested fixes + maintains the one ## 🐊 Claudecodile Rating: N/5 comment, Sonnet claudecodile-fixer applies them, and it loops (round 1 full diff → incremental → final full pass) until 5/5 = no P0/P1 AND every in-scope P2/P3 fixed (only scope-bloating nits may remain, recorded as follow-ups). The PR stays a draft throughout.

Why a skill, not inline: it runs in your context (the Skill tool loads it into this conversation, not a fresh agent), so you still own every commit + push between rounds exactly as before — functionally identical to when this lived inline, just deduplicated so /claudecodile-review is reusable standalone. As the loop reports each round, mirror its latest rating into the Review status cell (⏳ 3/5 → ✅ 5/5).

Handle the skill's return outcome:

5/5 → proceed to Phase 7. Any returned P2/P3 are the scope-deferred ones (in-scope nits were fixed in-loop) — file each important one as a Linear follow-up comment, leave the rest as a note.
plateau-bail (no score improvement across 2 rounds) → surface via AskUserQuestion: "stuck at N/5 on — accept as-is / guide me / keep iterating."
handback-bail (a comment needs a product decision or a hard-rule file) → bail and surface.

Skill: claudecodile-review (~/.claude/skills/claudecodile-review/). Agents it uses: claudecodile-reviewer, claudecodile-fixer.

Phase 7 — No merge conflicts (gate)

A CONFLICTING PR has no merge-ref, so GitHub runs zero pull_request workflows — no CI, no preview (canary finding #21; this was the real cause of every "CI/Vercel didn't run" mystery). So resolve conflicts before CI/preview. gh pr view <N> --json mergeable,mergeStateStatus; if CONFLICTING (main moved):

Code / non-migration conflicts → agentsystem-core:resolve-conflict (in the worktree), then re-push.
Migration-index conflicts (diff confined to migrations/ + _journal.json/snapshot) → bun run db:rebase (#633) if present, else the resolve-migration-conflict skill. Fails closed on hand-authored SQL → bail + surface. Run the printed git push --force-with-lease (the one sanctioned force-push).

See REFERENCE → Merge conflicts. The clean-branch push from resolving is also what triggers Phase 8's CI + preview.

Phase 8 — CI/CD green + preview (still a draft)

A conflict-free PR — even a draft — runs CI and deploys a Vercel preview on each push (preview-deploy.yml has no draft guard; drafts deploy fine — finding #15 corrected; un-draft is NOT a trigger). Poll gh pr checks <N>:

Code/test failures → spawn handle-it-shipper (or agentsystem-core:fix-pr-tests for failing CI tests specifically) in the worktree → fix → orchestrator commits + pushes → re-check.
Infra/workflow failures (env, neon/electric) → can't fix in code → surface to the user, don't loop. If CI shows 0 runs, re-check mergeable first — a CONFLICTING PR (Phase 7) is the cause, not an outage.
Preview link: grab it from the Vercel bot's "Preview Deployment" PR comment (gh pr view <N> --comments). If the deploy genuinely failed, note it + fall back to local in the handoff.

Phase 9 — Test-and-tick

Spawn handle-it-test-runner (Haiku) in the worktree to run the automatable Test-plan items — bun run check, full bun run test, and any listed build/focused suite — and report pass/fail per item. The orchestrator then ticks those checkboxes on the PR (gh pr edit, editing only the test-plan lines — leave any bot-managed section, e.g. Macroscope, untouched), leaving the human-only / click-through items for the user. (The test-runner never edits the PR — the orchestrator owns every PR/git mutation.) Agent def: .claude/agents/handle-it-test-runner.md.

Phase 10 — Manual-review handoff (PR stays a DRAFT, then WAIT)

First re-verify the branch is current (gh pr view <N> --json mergeable): if main advanced and it's now CONFLICTING, rebase (Phase 7) and let CI/preview refresh — so the preview you hand over is built on current main (#3). Then, only when 🐊 5/5 + conflict-free + CI green with a working preview + the tester has ticked the bun-run items — hand the user, leaving the PR a draft:

The Vercel preview link → ✅ Preview ready: <branch> → <url> (or, if the deploy genuinely failed, the failed-job URL + local fallback).
The remaining manual test items (the click-through ones the tester couldn't auto-run).
The branch name + a cd command for local testing.
Tell them: "# is ready for your manual testing" (still a draft — not yet for seniors).

Then WAIT for their verdict.

Phase 11 — Manual-review interaction

When the user replies with questions/issues: answer directly. For each reported problem, treat it as a mini Phase 4 — classify and spawn handle-it-shipper (clear fix/feature) or handle-it-investigator (unclear bug) to fix it edit-only → orchestrator commits + pushes → then, if the change is non-trivial, re-run Skill(claudecodile-review) (Phase 6) to keep the rating honest at 5/5. Loop until they approve. The PR stays a draft.

Phase 12 — Mark ready for review (on approval)

When the user says "looks good" (or similar):

Resolve the inline review threads (batch-resolve via GraphQL; keep the ## 🐊 Claudecodile Rating comment — it's an issue comment, not a thread). See REFERENCE → Resolving review threads.
gh pr ready <N> — un-draft (now it's for seniors).
save_issue Linear → In Review; save_comment it's review-ready.
Tell the user: "# is ready — request review from your seniors."

Never mark Done — a human merges.

Phase 13 — Watch to merge + unblock

Keep the row live; poll at low cadence (ScheduleWakeup ~30 min; user can cancel):

Merge conflicts → resolve as in Phase 7 (code → agentsystem-core:resolve-conflict; migration → bun run db:rebase / resolve-migration-conflict).
Senior Review (humans only — bot approvals do NOT count) → do not trust reviewDecision alone; a bot review can satisfy it. Read the actual reviews and take the latest state per reviewer, excluding bots: gh pr view <N> --json latestReviews,reviews. Ignore any review whose author is a bot — explicitly greptile, greptileai, macroscopeapp[bot], plus any login ending in [bot] or with __typename/type Bot — and ignore the PR author's own reviews. Then: ❌ if any human reviewer's latest state is CHANGES_REQUESTED (→ agentsystem-core:address-pr-comments, then resume) · ✅ only when at least one human reviewer's latest state is APPROVED and no human has CHANGES_REQUESTED · ⏳ otherwise (no human verdict yet — a lone bot approval stays ⏳). See REFERENCE → Senior Review (human-only).
Merged → on gh pr view <N> --json state = MERGED, read relations.blocks and fire "BE-### and BE-### are now unblocked!" + PushNotification. Done; stop watching this issue.

Status table

Print every turn, updated in place. One row per issue. ✅ done · ⏳ in progress / waiting · ❌ blocked / bailed · — n/a. Once the PR exists, append its number: BE-1234 (#123). The Review cell shows the live 🐊 rating (⏳ 3/5 → ✅ 5/5). The PR is a draft until Senior Review (un-drafted on the user's "looks good").

Issue	Plan	Implement	Draft PR	Review	CI+Preview	Manual Test	Senior Review	Merged	Status
BE-1234 (#123)	✅	✅	✅	⏳ 3/5	—	—	—	—	review⇄fix round 2

How each column is filled: REFERENCE → Status columns.

Mirror it into Linear. On each status update, also write this table into a delimited block at the top of the issue's description — between  and  markers — via save_issue (read the current description, replace only that block, preserve the brief below it). At-a-glance progress then lives in Linear, not just the chat. See REFERENCE → Linear status block.

Hard rules

Inherited from drain-queue — non-negotiable:

Never edit src/server/auth.ts, src/lib/auth/permissions.ts, env handling, or deploy config to satisfy any step → bail.
Never push to main, merge your own PR, amend, or pass --no-verify / skip-flags. Never force-push — except the single git push --force-with-lease that bun run db:rebase prints after a migration-index rebase.
Never mark a Linear issue Done — a human merges.
Always work in a worktree; always run full bun run check + bun run test, both green, before commit.
Never generate DB migrations without bun run db:branch on the worktree first.
Verify every concrete file/path/symbol claim in a brief against the repo before relying on it; note discrepancies in the PR body.
Stay strictly in the issue's scope — adjacent improvements become a Linear follow-up comment.

Bail (don't grind)

Notify (PushNotification) + table row ❌ + post a Linear comment, then stop, when: the brief needs a product decision; a step requires a hard-rule file; check/test is still red after fixes (one fix attempt = read→edit→re-run; third red → bail); the review loop hits a comment it can't address without a product decision or hard-rule file; or an infra CI failure can't be cleared in code. A returned issue is recoverable; wrong autonomous code is not.

handle-it

Invocation

Context Preview

Supporting Files

SKILL.md

handle-it

Invocation

Context Preview

Supporting Files

SKILL.md

handle-it

Phase 0 — Preflight & resume detection

Phase 1 — Route to a starting point

Phase 2 — Blocking check (per issue)

Phase 3 — Claim + worktree (per issue)

Phase 4 — Implement (subagent)

Phase 5 — Open DRAFT PR (orchestrator)

Phase 6 — Review ⇄ fix loop (until 5/5)

Phase 7 — No merge conflicts (gate)

Phase 8 — CI/CD green + preview (still a draft)

Phase 9 — Test-and-tick

Phase 10 — Manual-review handoff (PR stays a DRAFT, then WAIT)

Phase 11 — Manual-review interaction

Phase 12 — Mark ready for review (on approval)

Phase 13 — Watch to merge + unblock

Status table

Hard rules

Bail (don't grind)

Similar Skills

handle-it

Phase 0 — Preflight & resume detection

Phase 1 — Route to a starting point

Phase 2 — Blocking check (per issue)

Phase 3 — Claim + worktree (per issue)

Phase 4 — Implement (subagent)

Phase 5 — Open DRAFT PR (orchestrator)

Phase 6 — Review ⇄ fix loop (until 5/5)

Phase 7 — No merge conflicts (gate)

Phase 8 — CI/CD green + preview (still a draft)

Phase 9 — Test-and-tick

Phase 10 — Manual-review handoff (PR stays a DRAFT, then WAIT)

Phase 11 — Manual-review interaction

Phase 12 — Mark ready for review (on approval)

Phase 13 — Watch to merge + unblock

Status table

Hard rules

Bail (don't grind)

Similar Skills