From craftwork-all
Builds shell scripts, CI jobs, cron tasks, batch processors, and pipelines using headless coding-agent CLIs (Claude Code, Codex, Gemini, Copilot) or cloud agents (gh agent-task).
How this skill is triggered — by the user, by Claude, or both
Slash command
/craftwork-all:automateThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Run these nine steps in order for every new automation. The sections below are the toolbox; this list is the procedure.
Run these nine steps in order for every new automation. The sections below are the toolbox; this list is the procedure.
references/introspection.md.references/{codex,gemini,copilot,github-agent-task}.md is mandatory — examples in this file are Claude-syntax. See §Do not transliterate.--allowedTools + --output-format json + schema + success verification (§Non-negotiable flags).| CLI | Headless invocation | Reference |
|---|---|---|
| Claude Code | claude -p "<prompt>" | references/claude-code.md |
| Codex | codex exec "<prompt>" | references/codex.md |
| Gemini CLI | gemini -p "<prompt>" | references/gemini.md |
| GitHub Copilot CLI | copilot -p "<prompt>" --allow-all-tools --autopilot | references/copilot.md |
gh agent-task (cloud) | gh agent-task create "<description>" --base main | references/github-agent-task.md |
Combinable — gh agent-task opens the PR, a local CLI reviews it (see cookbook).
Examples throughout this file are Claude-syntax. Swapping claude→codex/gemini/copilot with the same flags hallucinates. Key divergences:
| Concern | Claude | Codex | Gemini | Copilot |
|---|---|---|---|---|
| Auto mode | --permission-mode acceptEdits | --sandbox <mode> / --full-auto | --approval-mode auto_edit | --autopilot + --allow-all-tools + --no-ask-user |
| Tool scope | --allowedTools "Read,Edit,..." | --sandbox read-only|workspace-write|danger-full-access | --policy <file> | --allow-tool "<name>" |
| Structured output | --json-schema '<inline JSON>' | --output-schema <file> | (policy / post-parse) | (post-parse) |
| JSON result field | .result / .structured_output | --output-last-message <file> | .response | JSONL events |
| Session resume | --resume <id> | codex exec resume --last | -r <id> / -r latest | (n/a) |
| Session ID path | .session_id | .thread_id (in thread.started) | .session_id | per-event session_id |
Local execution (claude / codex / gemini / copilot):
plan/default/interactive in a script; they pause for a human.--allowedTools narrow allowlist — the real safety boundary. Auto mode = "don't pause for approval"; the allowlist defines what the agent can do. Read-only: "Read,Grep,Glob". Dev: "Read,Edit,Bash(npm test*)".--output-format json when the next step is a shell pipeline. Never parse text with regex/grep/sed.--json-schema Claude, --output-schema Codex) when output feeds downstream code. Stops drift into prose/invalid enums so jq can't silently break.is_error, subtype, errors[], permission_denials[]), not just exit code. All four CLIs can exit 0 on recoverable failures.Delegated cloud execution (gh agent-task):
--base <branch>. A PR against develop when you meant main wastes a review cycle.create exit 0 = "dispatched," not "succeeded" and not "merged." The agent runs after the command returns. Pair with a completion watcher (gh pr list, pull_request.opened webhook, or --follow).--repo <owner/name> when targeting another repo. One dispatcher addresses any accessible repo without cloning.-F <file> / stdin, never inlined as argv. Avoids quoting bugs and arg-length limits.Shapes 1-4 are local execution; shape 5 is cloud delegation.
Classification, extraction, summarization, triage. Script gets a parseable structured answer.
result=$(claude -p "Classify the severity of this error: $(cat error.log)" \
--output-format json \
--json-schema '{"type":"object","properties":{"severity":{"type":"string","enum":["low","medium","high","critical"]},"reason":{"type":"string"}},"required":["severity","reason"]}' \
--permission-mode acceptEdits \
--allowedTools "Read")
severity=$(echo "$result" | jq -r '.structured_output.severity')
No edit tool in the allowlist ⇒ no writes happen, regardless of permission mode. That's how you scope "read-only."
"Review and fix," "refactor," "add tests." Model needs file access + command execution.
cd /path/to/repo
claude -p "Run the test suite and fix any failures. Report which tests you changed." \
--allowedTools "Bash,Read,Edit" \
--permission-mode acceptEdits \
--output-format json > run.json
acceptEdits auto-approves writes. dontAsk is stricter (deny-by-default) — better for locked-down CI. Real scope control is still the allowlist.
Discrete checkpointed steps — review, decide, act — each seeing prior context.
session_id=$(claude -p "Audit the auth module for security issues" \
--output-format json --permission-mode acceptEdits \
--allowedTools "Read,Grep,Glob" | jq -r '.session_id')
claude -p "Now prioritize the issues you found by severity" \
--resume "$session_id" --output-format json \
--permission-mode acceptEdits --allowedTools "Read,Grep,Glob" > prioritized.json
claude -p "Fix the critical issues only" \
--resume "$session_id" --permission-mode acceptEdits --allowedTools "Read,Edit"
First two steps are read-only via narrow tools. --resume <id> is explicit and parallel-safe; --continue (most-recent-in-cwd) races.
Same operation across N files/PRs/tickets. Fan out via xargs -P or background jobs; collect into a directory, then reduce.
mkdir -p out
find . -name '*.py' -print0 | xargs -0 -P 4 -I {} bash -c '
f="$1"
timeout 60s claude -p "Summarize $(cat "$f") in one sentence" \
--output-format json --permission-mode acceptEdits \
--allowedTools "Read" \
> "out/$(basename "$f").json"
' _ {}
jq -s 'map({file: .session_id, summary: .result})' out/*.json > summary.json
Bound parallelism (-P 4) to dodge rate limits. Per-worker timeout stops one runaway from stalling the batch.
Output = a PR, no synchronous control needed. Local CLI is a dispatcher.
# Dispatch into current repo
gh agent-task create "Implement the spec in docs/rfc-042.md" --base main
# Dispatch into ANOTHER repo from outside it — no clone
gh agent-task create -F ticket.md --repo my-org/payments-service --base main
# Fan out across a fleet
gh repo list my-org --topic services --json nameWithOwner --jq '.[].nameWithOwner' | \
while read -r repo; do
gh agent-task create -F upgrade.md --repo "$repo" --base main
sleep 2
done
# Dispatch and follow logs in the foreground
gh agent-task create -F ticket.md --base main --follow
--repo is the orchestration lever — one dispatcher addresses any accessible repo. Pair with a local-execution CLI for review-gating: agent opens the PR, Claude/Codex/Gemini/Copilot reviews before merge. See references/github-agent-task.md for preview-status, PR-number-ambiguity, and no-JSON-from-create pitfalls.
--allowedTools, Codex --sandbox, Gemini --policy, Copilot --allow-tool) says what the agent can do. Read-only = narrow allowlist, not a different mode.--allowedTools only for reproducibility or adversarial input.$, backticks, quotes, newlines. Heredocs (<<'EOF') or --append-system-prompt-file avoid shell-expansion bugs.-p. Prompt-injection surface. Wrap user-controlled content in delimited blocks (<user_input>...</user_input>) with instructions to treat as data.--continue in parallel jobs. Races itself — resolves to "most recent session in this cwd." Use explicit session IDs.references/introspection.md.references/cookbook.md.Every automation has to be re-runnable by someone who didn't write it and auditable after the fact.
Reusability:
set -euo pipefail at the top. Non-negotiable.MODEL="${MODEL:-sonnet}", TIMEOUT="${TIMEOUT:-600}", FILE="${1:?path required}". No hardcoded constants.: "${ANTHROPIC_API_KEY:?set it}".--dir or TARGET="${1:?}" and cd "$TARGET" inside the script. Don't rely on the caller's cwd.--force (or env FORCE=1).--append-system-prompt-file, not inlined — swap prompts without editing shell logic.Observability:
2>&1 into a pipe that feeds jq. Progress messages go >&2.echo "$session_id" > .session) — Claude/Gemini .session_id, Codex .thread_id (in thread.started when --json is on). --continue/--last/-r latest race under shared cwd or parallelism; always pass an explicit ID..total_cost_usd on every call; sum across a batch before you report done.--output-format stream-json --verbose --include-partial-messages | tee progress.log.0 success, 2 usage/input error, 3 agent reported failure (is_error:true or permission_denials[] non-empty), 124 timeout fired, 130 interrupted. Downstream tools branch on codes, not on stdout text.trap 'git stash -u || true; cp progress.log "progress.$(date +%s).log" 2>/dev/null || true' ERR.Every script opens with a header a stranger can run from:
#!/usr/bin/env bash
# <name>.sh — one-line purpose
# Usage: ./<name>.sh <args> [flags]
# Env: ANTHROPIC_API_KEY (required), MODEL (default: sonnet), TIMEOUT (default: 600)
# Posture: read-only | edits | shell (enforced by --allowedTools "<list>")
# Output: stdout JSON; exits 0/2/3/124 — see README for taxonomy
set -euo pipefail
Creates, edits, and optimizes skills for Claude Code, including drafting, evaluating with test prompts, iterating on performance, and improving skill descriptions for better triggering accuracy.
npx claudepluginhub andurilcode/craftwork --plugin craftwork-all