From engineering
Use for end-to-end software execution in an unfamiliar or complex repo: orienting the codebase, choosing an execution model, planning and verifying changes, reviewing architecture or PRs, improving agent-first harnesses, and coordinating work across the specialized `frontend` and `backend` engineering skills.
How this skill is triggered — by the user, by Claude, or both
Slash command
/engineering:agentic-development [task type or describe what you need: implement, review, debug, plan, harness, agent-native, orchestrate, release][task type or describe what you need: implement, review, debug, plan, harness, agent-native, orchestrate, release]The summary Claude sees in its skill listing — used to decide when to auto-load this skill
This skill is the orchestration layer for software engineering work. It owns repo orientation, execution-mode selection, planning, review discipline, verification gates, release flow, cross-domain coordination, and agent-native architecture design.
README.mdhooks/check-completion.shhooks/post-bash.shhooks/pre-tool.shhooks/session-end.shreferences/agent-native-architecture.mdreferences/agentic-system-design.mdreferences/agents/adversarial-reviewer.mdreferences/agents/agent-native-auditor.mdreferences/agents/agents-memory-updater.mdreferences/agents/architecture-strategist.mdreferences/agents/ci-watcher.yamlreferences/agents/compatibility-scanner.yamlreferences/agents/correctness-reviewer.mdreferences/agents/learnings-researcher.mdreferences/agents/openai.yamlreferences/agents/orchestrate.yamlreferences/agents/performance-oracle.mdreferences/architecture-analysis.mdreferences/architecture-patterns.mdThis skill is the orchestration layer for software engineering work. It owns repo orientation, execution-mode selection, planning, review discipline, verification gates, release flow, cross-domain coordination, and agent-native architecture design.
## What do you need?Pick a number or describe what you want. The routing table below maps your answer to the right references.
| Answer | Read | |--------|------| | 0, "brainstorm", "design", "before building", "what should we" | [brainstorming-design-gate.md](./references/brainstorming-design-gate.md) — explore context, one-at-a-time questions, 2-3 approach proposals, design approval gate | | 1, "implement", "execute", "feature" | Triage input → [subagents-and-parallelism.md](./references/subagents-and-parallelism.md); search learnings → [institutional-learnings.md](./references/institutional-learnings.md); orient → [repo-orientation.md](./references/repo-orientation.md) | | 2, "debug", "bug", "root cause", "error", "trace" | [debug-investigation.md](./references/debug-investigation.md), then [observability.md](./references/observability.md) for signal sources | | 3, "review", "PR", "code review" | [reviews-and-comments.md](./references/reviews-and-comments.md) — includes multi-agent persona selection and severity scale | | 4, "plan", "architect", "design", "system" | Search learnings first → [institutional-learnings.md](./references/institutional-learnings.md); then [prd-to-plan.md](./references/prd-to-plan.md), [system-design-workflows.md](./references/system-design-workflows.md), [architecture-analysis.md](./references/architecture-analysis.md), [codebase-architecture-language.md](./references/codebase-architecture-language.md) | | 5, "harness", "agent-readiness", "CI", "evals" | [harness-engineering.md](./references/harness-engineering.md) | | 6, "agent-native", "MCP", "tools", "first-class agents" | [agent-native-architecture.md](./references/agent-native-architecture.md) | | 7, "orchestrate", "cloud agents", "parallel agents" | [orchestrate-roles.md](./references/orchestrate-roles.md), then `/orchestrate` | | 8, "release", "merge", "deploy", "hotfix" | [land-and-deploy.md](./references/land-and-deploy.md), [hotfix-procedures.md](./references/hotfix-procedures.md) | | 9, "learn", "improve", "skill", "memory" | [institutional-learnings.md](./references/institutional-learnings.md) to capture findings; [self-improvement.md](./references/self-improvement.md), [skill-extraction.md](./references/skill-extraction.md) | | 10, "tech debt", "debt", "migrations", "dependency", "feature flags", "test coverage", "triage", "lint", "anti-pattern" | [tech-debt-cloud-agents.md](./references/tech-debt-cloud-agents.md) — 10 cloud-agent cadences for schedulable debt elimination, including lint-enforced anti-pattern elimination |../frontend/SKILL.md or ../backend/SKILL.md.BLOCKED or genuine ambiguity. See subagents-and-parallelism.md.DONE, DONE_WITH_CONCERNS, BLOCKED, or NEEDS_CONTEXT.../frontend/SKILL.md or ../backend/SKILL.md.quality-assurance before declaring done, code-documentation after implementing any public API surface, autoimprove or memory-management at session end. If the companion skill isn't installed, install it. If it is installed but hasn't run, run it now.| Scope / Complexity | Default execution model |
|---|---|
| Small, single proof path | Direct execution |
| Medium, separable concerns | Subagents-driven development |
| Large, cross-domain feature | Team of agents |
| Epic, multi-iteration work | Supervised harness loop |
| Agent-first repo audit or harness improvement | Harness engineering pass |
| Distributed, multi-scope or context-exhausting epic | Cloud orchestration (/orchestrate) |
Read subagents-and-parallelism.md for controller/worker patterns, Parallel Safety Check, worktree isolation, git index safety, post-batch merge flow, two-stage review (spec compliance → code quality), implementer status handling (DONE/DONE_WITH_CONCERNS/BLOCKED/NEEDS_CONTEXT), and model routing for sub-agents.
Read subagent-prompt-templates.md for concrete prompt templates: implementer, spec compliance reviewer, and code quality reviewer. Use when dispatching sub-agents to avoid vague prompts.
Read repo-orientation.md for startup discovery, repo-shape detection, and initial command selection.
Read instruction-file-design.md when structuring or auditing AGENTS.md, CLAUDE.md, or other instruction surfaces.
Read harness-engineering.md when improving agent-first repo readiness: context architecture, subsystem guidance, mechanical enforcement, CI feedback, evals, observability, persistent knowledge, deslop passes, CI watching, compatibility scans, or background quality agents.
Read harness-loops.md for repeated agent iterations, disk-backed specs/plans, builder/reviewer loops, restore points, and sequential review pipelines with auto-decisions.
Read institutional-learnings.md when capturing a solved problem, searching docs/solutions/ before starting work in a documented area, or deciding whether a finding belongs in structured solutions vs prose lessons.
Read scratch-space.md when a harness run or skill needs to write intermediate files, checkpoints, delegation prompts, or any state that is not a final deliverable. Use this to avoid polluting the repo and to make state discoverable by sibling agents.
Read architecture-analysis.md when explaining how the system works, tracing dependencies, or evaluating change impact.
Read codebase-architecture-language.md when producing an architecture audit, writing AGENTS.md guidance, or describing structural problems in a PR or report. This reference defines the shared vocabulary: god objects, shotgun surgery, blast radius, layer violations, cohesion, coupling, lint gates, and the 7-dimension scoring rubric. Use this language consistently so findings can be compared across sessions.
Read architecture-patterns.md when choosing or comparing software architecture patterns.
Read system-design-workflows.md when designing a system, migration, API surface, or scaling plan from scratch.
Read interface-design.md when the right module or API shape is unclear and should be explored before implementation.
Read tech-decision-guide.md when the user needs to choose a technology.
Read interviewer-pattern.md before any non-trivial creation task where key constraints are still ambiguous.
Read prd-to-plan.md for phased implementation plans and tracer-bullet slicing.
Read refactor-planning.md for safe refactor decomposition, commit slicing, and RFC/issue planning.
Read debug-investigation.md for the full debug workflow: Triage → Investigate → Root Cause → Fix → Handoff. Includes assumption audits, causal chain gates, smart escalation, root cause tracing (trace backward through the call chain — never fix the symptom), defense-in-depth validation (validate at every layer), and condition-based waiting (replace arbitrary sleep() with real condition checks).
Read observability.md before adding logs or instrumentation, and whenever debugging should start from existing signals.
Read reviews-and-comments.md for code review, PR feedback, multi-agent persona selection, P0–P3 severity scale, and adversarial review passes.
Read code-simplification.md before opening a PR or after a long dev-loop: resolves scope from the branch diff, runs 3 parallel review agents (reuse/quality/efficiency), applies fixes, and verifies behavior is preserved.
Read specs-plans-tests.md when the workflow needs explicit separation between behavior spec, execution plan, and proof.
Read verification-and-finish.md before declaring success, opening a PR, or cleaning up a branch/worktree.
Read tdd-advanced.md for spec-first TDD, property-based testing, mutation testing, and autonomy limits.
Read tdd-framework-guide.md when setting up or changing a test framework.
Read tdd-ci-integration.md when wiring coverage into CI.
Read github-actions-templates.md, gitlab-ci-templates.md, and deployment-gates.md for baseline CI pipeline generation and deployment gates.
Read agent-native-architecture.md when building systems where agents are first-class citizens: the five principles (Parity, Granularity, Composability, Emergent Capability, Improvement Over Time), MCP tool design, action parity, architecture checklist, anti-patterns, and success criteria.
Read agent-native-architecture.md before any new agent-facing API, MCP server, or AI-powered product feature. Run the agent-native-auditor agent to score an existing system against the eight principles.
Read agentic-system-design.md when designing agent architectures from scratch.
Read prompt-engineering-patterns.md when writing or improving prompts.
Read llm-evaluation-frameworks.md when evaluating LLM or RAG output quality.
Read decision-principles.md when running autonomous loops that need to auto-decide intermediate choices, or when surfacing structured decision briefs to users. Use the decision classification (mechanical/taste/user-challenge) to determine whether to auto-decide silently, queue for a final gate, or pause and ask.
Read model-routing.md when spawning sub-agents to route tasks to the right model tier (Haiku for routine file ops, Sonnet for implementation, Opus for security/architecture). The 80/15/5 split achieves ~10× cost reduction vs defaulting everything to a premium model.
Read orchestrate-roles.md when distributing a large task across parallel cloud agents: planner/worker/verifier role separation, plan.json state management, Andon escalation, and when to use /orchestrate vs. /dev-loop.
Read collaboration-and-git.md for branch and PR workflow.
Read conventional-commits.md for commit message standards and release tooling.
Read release-workflow-comparison.md when choosing a branching strategy.
Read hotfix-procedures.md for emergency release flow.
Read land-and-deploy.md when the user asks to merge, land, or deploy.
Read tech-debt-cloud-agents.md when planning or executing debt elimination at scale: large-scale migrations, dependency updates, feature-flag retirement, test coverage fills, Sentry triage, design-system drift, monolith splits, docs sync, and automated bug reproduction. All nine patterns are designed to run as autonomous cloud-agent cadences — scheduled, parallelized, and reviewed as PRs rather than consuming sprint capacity.
Read self-improvement.md at session end or after recurring mistakes.
Read autoimprove.md for autonomous reference optimization.
Read skill-extraction.md when session learnings should become a reusable skill.
Read continual-learning.md to update durable repo/user memory in instruction files.
Read learning-system.md and its companion references when operating the repo-local learning/ system.
<reference_index>
Orientation and planning
references/repo-orientation.md — startup discovery, repo-shape detectionreferences/instruction-file-design.md — AGENTS.md/CLAUDE.md structurereferences/brainstorming-design-gate.md — design-first gate: explore, one-question-at-a-time, 2-3 proposals, approval before code ★ newreferences/prd-to-plan.md — phased implementation plans and tracer-bullet slicingreferences/refactor-planning.md — safe refactor decompositionreferences/interviewer-pattern.md — ambiguity resolution before non-trivial tasksArchitecture and design
references/architecture-analysis.md — tracing dependencies, evaluating change impactreferences/codebase-architecture-language.md — shared vocabulary for architecture audits: anti-patterns, health properties, enforcement terms, 7-dimension scoring rubric ★ newreferences/architecture-patterns.md — comparing software architecture patternsreferences/system-design-workflows.md — designing systems from scratchreferences/interface-design.md — exploring module or API shape before implementationreferences/tech-decision-guide.md — technology selectionExecution and parallelism
references/subagents-and-parallelism.md — Parallel Safety Check, worktree isolation, two-stage review, implementer status handling, model routingreferences/subagent-prompt-templates.md — concrete implementer, spec-reviewer, and code-quality-reviewer prompt templates ★ newreferences/harness-loops.md — disk-backed loops, builder/reviewer loops, restore pointsreferences/orchestrate-roles.md — planner/worker/verifier for cloud agentsAgent-native and AI systems
references/agent-native-architecture.md — five principles, MCP tool design, checklist, anti-patterns ★ newreferences/agentic-system-design.md — agent architecture from scratchreferences/prompt-engineering-patterns.md — prompt writing and improvementreferences/llm-evaluation-frameworks.md — LLM/RAG eval qualityreferences/decision-principles.md — mechanical/taste/user-challenge classificationTech debt and quality cadences
references/tech-debt-cloud-agents.md — 10 cloud-agent patterns for schedulable debt elimination (migrations, deps, flags, coverage, triage, design drift, monolith splits, docs, bug reproduction, lint-enforced anti-pattern elimination)Harness improvement
references/harness-engineering.md — audit dimensions, improvement patterns, scratch space disciplinereferences/observability.md — logs, traces, agent-legible debugging signalsreferences/institutional-learnings.md — docs/solutions/ schema, capture workflow, grep-first search ★ newreferences/scratch-space.md — OS-temp vs .context/ vs durable; decision tree for intermediate state ★ newDebugging
references/debug-investigation.md — phased debug workflow: Triage, Investigate, Root Cause, Fix, Handoff; assumption audits, causal chain gates; root cause tracing (backward call-chain tracing); defense-in-depth validation; condition-based waiting for flaky tests; find-polluter.sh bisection scriptReview, testing, and CI
references/reviews-and-comments.md — multi-agent code review, P0–P3 severity scale, persona selection, adversarial passesreferences/code-simplification.md — 3-agent simplification pass (reuse/quality/efficiency) before PR; scope resolution, verificationreferences/specs-plans-tests.md — behavior spec, plan, proof separationreferences/verification-and-finish.md — success gates before declaring donereferences/tdd-advanced.md — spec-first TDD, property-based testingreferences/tdd-framework-guide.md — test framework setupreferences/tdd-ci-integration.md — coverage wired to CIreferences/github-actions-templates.md, references/gitlab-ci-templates.md — CI baselinesreferences/deployment-gates.md — deployment gate patternsGit, release, and landing
references/collaboration-and-git.md — branch and PR workflowreferences/conventional-commits.md — commit standardsreferences/release-workflow-comparison.md — branching strategy selectionreferences/hotfix-procedures.md — emergency release flowreferences/land-and-deploy.md — merge and deployLearning and self-improvement
references/self-improvement.md — session-end learningreferences/autoimprove.md — autonomous reference optimizationreferences/skill-extraction.md — session learnings → reusable skillreferences/continual-learning.md — durable repo/user memory updatesreferences/learning-system.md — learning/ folder system
</reference_index>This skill should reference the specialized engineering skills repeatedly and intentionally:
../frontend/SKILL.md.../backend/SKILL.md.../frontend/SKILL.md and ../backend/SKILL.md.../frontend/SKILL.md.../backend/SKILL.md.Review agents (dispatch in parallel for multi-agent code review):
references/agents/correctness-reviewer.md — logic errors, off-by-one, null propagation, race conditions, broken error propagation; always-on for any code review.references/agents/adversarial-reviewer.md — chaos engineering perspective; constructs failure scenarios from assumption violations, composition failures, cascades, and abuse cases; conditional (diff ≥50 lines or high-risk domain).references/agents/performance-oracle.md — algorithmic complexity, N+1 queries, memory, caching, network, frontend bundle; conditional (DB-touching or data-scale code).references/agents/architecture-strategist.md — layer violations, coupling, SOLID, structural decisions; conditional (new services, refactors, cross-cutting concerns).Harness and CI agents:
references/agents/ci-watcher.yaml — watches CI for the current branch's PR; classifies failures as FLAKY, REAL, or INFRA; feeds /fix-ci.references/agents/compatibility-scanner.yaml — orchestrates the four-phase compatibility pass (harness scan, docs reliability, startup review, validation efficiency); feeds /check-agent-compat.Memory and improvement agents:
references/agents/agents-memory-updater.md — mines transcript deltas and updates CLAUDE.md / AGENTS.md with durable preferences and workspace facts.references/agents/learnings-researcher.md — grep-first search of docs/solutions/ for applicable past learnings before planning or implementing in a documented area. ★ newOrchestration agents:
references/agents/orchestrate.yaml — acts as Dispatcher for /orchestrate; spawns a cloud Planner, monitors progress, handles Andon escalations, and reconciles the final handoff.references/agents/agent-native-auditor.md — runs 8 parallel sub-agents scoring an existing codebase against agent-native principles (Action Parity, Tool Primitives, Context Injection, Shared Workspace, CRUD Completeness, UI Integration, Capability Discovery, Prompt-Native Features); produces a scored report with prioritized recommendations.scripts/repo_scan.py for repo scanning and startup context.scripts/harness_audit.py for agent-first readiness audits across instructions, CI, evals, observability, enforcement, and persistent knowledge.scripts/find-polluter.sh for test isolation bisection — finds which test creates unwanted files or state by running tests one-by-one until the polluter is identified.scripts/project_architect.py for architecture detection and layer violations.scripts/architecture_diagram_generator.py for Mermaid, PlantUML, or ASCII diagrams.scripts/stack_detector.py and scripts/pipeline_generator.py for baseline CI generation.scripts/coverage_analyzer.py, scripts/fixture_generator.py, and scripts/tdd_workflow.py for verification workflows.scripts/prompt_optimizer.py, scripts/rag_evaluator.py, and scripts/agent_orchestrator.py for LLM/agent systems work.scripts/init-learning.sh, scripts/init-learning.py, scripts/capture-item.py, scripts/scan-learning.py, and scripts/refresh-learning.py for the learning system.../frontend/scripts/.../backend/scripts/.docs/solutions/ for prior decisions and pitfalls in the affected area. See institutional-learnings.md or dispatch the learnings-researcher agent.../frontend/SKILL.md, ../backend/SKILL.md, or both when domain-specific detail is needed.docs/solutions/. Surface follow-up risks, rollout notes, and cleanup for the next engineer.Three slash commands come with this skill. Wire them from engineering/commands/.
/dev-loopStarts an agentic development loop in the current session. The stop hook re-injects the original task prompt each iteration until a <promise> completion tag is output or --max-iterations is reached.
/dev-loop Fix the auth bug --completion-promise 'FIXED' --max-iterations 10
/dev-loop Add test coverage --verify-cmd 'npm test' --completion-promise 'DONE'
/dev-loop Improve API layer --spec-file TASK.md --max-iterations 20
Options: --max-iterations N, --completion-promise TEXT, --verify-cmd CMD, --spec-file PATH.
/cancel-dev-loopCancels an active dev loop by removing .claude/agentic-dev-loop.local.md.
/check-agent-compatRuns the four-phase compatibility scan before extending agent autonomy: harness score, docs reliability, startup path, and validation efficiency. Produces a P0/P1/P2 finding list. Start here before running a long harness-loop on an unfamiliar repo.
/check-agent-compat [repo-root]
/fix-ciWatches CI for the current branch, classifies failures (FLAKY/REAL/INFRA), and applies targeted fixes in a watch → identify → fix → re-watch loop. Uses the ci-watcher agent internally.
/fix-ci [--max-iterations N] [--check <check-name>]
/harness-loopStarts a harness engineering improvement loop. Each iteration runs harness_audit.py, picks the highest-priority P0/P1 finding, enforces it as a CI gate or structural test, and commits. Stops when a fresh audit shows no P0 or P1 items.
/harness-loop --max-iterations 20
/harness-loop path/to/repo --max-iterations 15
/orchestrateDecomposes a large goal into parallel cloud-agent tasks using the planner/worker/verifier model. The local IDE session acts as the Dispatcher; a Planner cloud agent owns scope decomposition; Worker agents execute sub-tasks in isolation. Andon signals pause the pipeline and escalate to the user.
/orchestrate "Migrate the legacy auth module to the new SDK, add test coverage, and update AGENTS.md"
Use when a task would exhaust a single agent's context budget, benefits from parallelism, or spans independently verifiable sub-scopes. Read orchestrate-roles.md before invoking.
This skill ships four hooks. Wire them in Claude Code settings, replacing /absolute/path/to/agentic-development with the actual path.
hooks/check-completion.sh operates in two modes:
/dev-loop or /harness-loop): checks .claude/agentic-dev-loop.local.md for active loop state, parses the JSONL transcript for the last assistant message, detects <promise> completion tags, and re-injects the original task prompt via reason each iteration.hooks/pre-tool.sh logs every Bash, Write, and Edit invocation to stderr.hooks/post-bash.sh captures non-zero exit codes into memory/working/last_error.json.hooks/session-end.sh writes memory/working/session_end.json so the self-improvement phase can detect a finished session.npx claudepluginhub alvarovillalbaa/plugins --plugin engineeringGuides creation, editing, and verification of skills for AI coding agents using test-driven development with subagent scenarios. Use when authoring or debugging skills.