From autoteam
Autonomous AI development team. Run /autoteam "<requirement>" to trigger the full 8-agent pipeline (Product Planner → Architecture → Implementation → QA × 3 → Documentation).
How this skill is triggered — by the user, by Claude, or both
Slash command
/autoteam:autoteamThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
When this skill is invoked, the current Claude Code session becomes the **Orchestration Agent**.
When this skill is invoked, the current Claude Code session becomes the Orchestration Agent.
Extract the requirement: Strip /autoteam from the triggering message. Trim whitespace. The remainder is <REQUIREMENT>.
If <REQUIREMENT> is empty or nonsensical, print:
❌ Usage: /autoteam "your requirement"
Example: /autoteam "build a REST API for task management"
Then stop.
All inter-agent communication happens through files in .autoteam/workspace/. No agent may write to a file it does not own.
Exception: docs/CODE-SUMMARY.md is a persistent, git-tracked file written by Orchestration (Step 2.5) and read by Product Planner and Architecture. It is NOT part of the inter-agent workspace protocol.
| File | Owner |
|---|---|
.autoteam/workspace/requirement-card.yaml | Product Planner |
.autoteam/workspace/adr.md | Architecture |
.autoteam/workspace/interface-contracts.yaml | Architecture |
.autoteam/workspace/discussion/round-N-arch.md | Architecture |
.autoteam/workspace/discussion/round-N-planner.md | Product Planner |
.autoteam/workspace/discussion/consensus.md | Orchestration |
.autoteam/workspace/qa-reports/security-report.md | QA Security |
.autoteam/workspace/qa-reports/quality-report.md | QA Quality |
.autoteam/workspace/qa-reports/test-report.md | QA Test |
.autoteam/workspace/qa-reports/aggregated-report.md | Orchestration |
.autoteam/workspace/fix-instructions.md | Orchestration |
.autoteam/workspace/sprint-contract.yaml | Orchestration |
.autoteam/workspace/phase-summary.md | Orchestration |
.autoteam/workspace/qa-reports/gate-report.md | Orchestration |
.autoteam/workspace/qa-reports/ratchet-baseline.txt | Orchestration |
.autoteam/workspace/chunk.md | Orchestration |
.autoteam/workspace/escalation.md | Implementation |
.autoteam/workspace/plan.md | Orchestration (write) + Human (approve) |
docs/CODE-SUMMARY.md | Orchestration (read by Documentation) |
docs/README.md | Documentation |
docs/ARCHITECTURE.md | Documentation |
docs/API.md | Documentation |
AGENTS.md | Documentation |
# TEMPLATE) are never deletedAt each phase boundary, Orchestration writes a compressed state to .autoteam/workspace/phase-summary.md (overwrite each time). This is the ONLY context carried forward — previous step details are NOT re-read unless specifically needed.
phase: <completed phase name>
requirement: <one-line summary>
tech_stack: <language + framework>
modules: [list of module IDs]
implementation_status: complete | partial
qa_round: <N>
critical_findings: <count>
resolved_findings: <count>
pending_fixes: [FIX-IDs]
next_action: <what happens next>
Write phase-summary.md after: Step 6 (Implementation), Step 8 (QA Aggregate), each QA fix loop iteration. Read phase-summary.md before: Step 7 (QA Pipeline), Step 9 (QA Loop Decision), Step 10 (Documentation).
Pipeline step reference (for internal Orchestration logic):
性质:强制 gate — 必须人类批准 plan.md 才能继续
.autoteam/workspace/plan.md exists with APPROVED: true:
git diff --stat --since="<last_review_at>"[Step 0/11] ✓ Plan approved (skip brainstorming) → Step 1[Step 0/11] ⚠️ Plan may be stale — quick reviewAPPROVED: false or no file: proceed to brainstorming轻量确认流程:
last_review_at: <ISO 8601>Orchestration 以 Socratic 方式展示 requirement,询问人类澄清性问题:
Orchestration 记录人类回答,生成 .autoteam/workspace/plan.md 草案:
# Plan: <requirement title>
## Goals
- [人类确认的高层目标]
## Scope
### In
- [确认要做的]
### Out
- [明确不做的]
## Features
### FEAT-001: <功能名称>
- scope: "<具体范围>"
- status: pending
- done_criteria:
- [ ] <DC-001>: <可验证的行为描述>
- [ ] <DC-002>: <可验证的行为描述>
### FEAT-002: <功能名称>
- scope: "<具体范围>"
- status: pending
- done_criteria:
- [ ] <DC-001>: <可验证的行为描述>
## Risks & Open Questions
- [未解决的疑问或风险]
## Verification
- [如何验证交付成功]
---
APPROVED: <true/false>
Approved-by: <人类确认>
Approved-at: <ISO 8601>
Last-review-at: <ISO 8601>
展示 plan.md 给人类:
人类反应:
APPROVED: true)→ 继续 Step 1最多无限追问(人类可以随时批准或继续修改)
打印:
[Step 0/11] ✓ Plan approved → plan.md[Brainstorm] 等待人类批准 plan.md....autoteam/workspace/plan.mdAPPROVED: false or file not exists: stop with [ERROR] Plan not approved<REQUIREMENT> is consistent with plan.md goals<REQUIREMENT>. If empty/whitespace/nonsensical: stop with [ERROR] Invalid requirement..autoteam/workspace/ exists and contains .yaml or .md files:
.autoteam/runs/<YYYYMMDD-HHMMSS>/ (copy, not move)[Archive] Previous run archived → .autoteam/runs/<timestamp>/.autoteam/workspace/, .autoteam/workspace/qa-reports/, .autoteam/workspace/discussion/.yaml, .md files in workspace (except templates starting with # TEMPLATE and except plan.md)[Step 2/11] ✓ Workspace initializeddocs/CODE-SUMMARY.md exists
last_commit_hash from header using regex ^Git Commit:\s*([a-f0-9]{40})$git log -1 --format=%H to get current commit hashgit status --porcelain to check if working tree is clean
working_tree_clean = (output is empty)last_commit_hash == current_commit_hash AND working_tree_clean:
[Step 2.5/11] ✓ Code summary fresh (skip)last_commit_hash != current_commit_hash OR NOT working_tree_clean):
[Step 2.5/11] ⚠️ Code summary stale — regenerating[Step 2.5/11] Generating code summaryGeneration steps (for both new and stale summaries):
docs/CODE-SUMMARY.md with this exact header:
**Generated:** <ISO 8601 timestamp>
**Git Commit:** <commit hash>
**Working Tree:** clean | dirty
[Step 2.5/11] ✓ Code summary (re)generated<REQUIREMENT> and the Product Planner definition (Section 5.1).autoteam/workspace/requirement-card.yaml[Step 3/11] ✓ Product Planner complete → requirement-card.yaml.autoteam/workspace/adr.md AND .autoteam/workspace/interface-contracts.yaml[Step 4/11] ✓ Architecture complete → adr.md + interface-contracts.yaml.autoteam/workspace/adr.md and .autoteam/workspace/requirement-card.yamlround-N-arch.md, Product Planner writes round-N-planner.mdAPPROVED appears, or after round 3 (Orchestration writes consensus.md with binding decision)[Step 5/11] Skip (no contradictions found)[Step 5/11] ✓ Architecture-Planner alignment verifiedBefore Implementation writes any code, Orchestration facilitates a contract between Implementation and QA Test:
.autoteam/workspace/sprint-contract.yaml:modules:
- id: MOD-001
name: "User Authentication"
done_criteria:
- id: DC-001
behavior: "POST /auth/login with valid credentials returns 200 + JWT token"
testable: true
- id: DC-002
behavior: "POST /auth/login with invalid password returns 401"
testable: true
not_included:
- "OAuth2 social login"
- "Password reset flow"
.autoteam/workspace/interface-contracts.yaml, and real code entrypoints/parameters before marking work done.autoteam/workspace/interface-contracts.yaml, and executable implementation evidence (not sprint-contract text alone)[Step 5.5/11] ✓ Sprint contract agreed → .autoteam/workspace/sprint-contract.yamlSkip conditions: Skip if: module_count == 1 AND total_acceptance_criteria <= 3 (too simple to need negotiation).
Process Features sequentially — each Feature: Implementation → QA verified → next Feature
features from requirement-card.yamlin_progress[Step 6/11] [FEAT-001] Implementation starting...done when done_criteria are met in code[Step 6/11] [FEAT-001] Implementation done → QA verifying...verified
[Step 6/11] [FEAT-001] ✓ QA verifiedverified:
[Step 6/11] ✓ All features verifiedBefore dispatching QA agents, run all available deterministic gates on generated code:
adr.md tech stackbaseline_violations count.| Gate | Name | Tool | Detection |
|---|---|---|---|
| A | Formatting + Lint | ruff/eslint/go-vet | Language detected |
| B | Import Boundaries | import-linter | pyproject.toml has [tool.importlinter] |
| C | Structural Rules | ast-grep | sgconfig.yml exists in project |
| D | Snapshot Testing | pytest --snapshot | __snapshots__/ directory exists |
| E | Golden Outputs | diff against committed goldens | tests/goldens/ directory exists |
| F | Numerical Equiv. | tolerance check (numpy allclose or float compare) | tests/numerical/ directory exists |
current_violations <= baseline_violations.autoteam/workspace/qa-reports/gate-report.md.autoteam/workspace/qa-reports/ratchet-baseline.txt:
baseline: <N>
current: <N>
delta: <+/- N>
status: PASS/FAIL
[Multi-Gate] ⚠️ No gate tools detected for {language}. Skipping deterministic gates. and proceed to Step 7[Step 6.5/11] Multi-Gate Check
Gate A (Lint): ✅ PASS | ⏭️ SKIPPED | ❌ FAIL
Gate B (Imports): ✅ PASS | ⏭️ SKIPPED (no import-linter config) | ❌ FAIL
Gate C (AST Rules): ✅ PASS | ⏭️ SKIPPED (no sgconfig.yml) | ❌ FAIL
Gate D (Snapshots): ✅ PASS | ⏭️ SKIPPED (no __snapshots__/) | ❌ FAIL
Gate E (Goldens): ✅ PASS | ⏭️ SKIPPED (no tests/goldens/) | ❌ FAIL
Gate F (Numerical): ✅ PASS | ⏭️ SKIPPED (no tests/numerical/) | ❌ FAIL
Ratchet: OFF | ON (baseline: N, current: N, delta: N)
Result: N/N active gates PASS
Note: QA Test already completed per-Feature in Step 6. This step runs overall Security and Quality checks.
Dispatch two QA subagents in sequence:
security-report.mdquality-report.md[Step 7/11] ✓ QA Pipeline complete → 2 reports written.autoteam/workspace/qa-reports/aggregated-report.mdALL_CLEAR: true only if zero CRITICAL findings AND overall quality score ≥ 3.0/5## Council Tally
QA Security: ACCEPT (HIGH) | QA Quality: ACCEPT (MEDIUM)
Result: 2/2 ACCEPT → PASS | <2 ACCEPT → FAIL
ALL_CLEAR: true only if: Council = 2/2 ACCEPT AND zero CRITICAL findings AND overall quality score ≥ 3.0/5## Quality Scores (Round N)
Security Posture: X/5 | Code Quality: X/5 | Design Coherence: X/5 | Test Coverage: X/5 (sourced from per-Feature QA in Step 6) | Functionality: X/5
Overall: X.X/5 (average)
[REGRESSION] in aggregated report.autoteam/workspace/fix-instructions.md listing every CRITICAL as structured fix task:fixes:
- id: SEC-001
file: src/auth.py
function: verify_token
lines: "45-67"
issue: "SQL injection via unsanitized input"
fix: "Use parameterized queries"
[Step 8/11] ✓ QA aggregated → aggregated-report.md + fix-instructions.mdALL_CLEAR=true (=2/2 council ACCEPT + zero CRITICAL + score ≥ 3.0/5) → go to Step 10
ALL_CLEAR=false →
escalation.md[FAILED]docs/README.md (minimum 10 lines), docs/ARCHITECTURE.md, and AGENTS.md (project root); if the project exposes API endpoints, also wait for docs/API.mddocs/README.md has <10 lines: retry once with model sonnet[Step 10/11] ✓ Documentation complete → docs/ writtenAfter all code and docs are written:
autoteam/<YYYYMMDD>-<slug> (slug = first 3 words of requirement, kebab-case)
1.5. Generate work chunk evidence — write .autoteam/workspace/chunk.md:
# Work Chunk: <requirement title from requirement-card.yaml>
## Intent
- <one-line description of what behavior/structure changed>
## Preconditions
- Branch base: <base branch name>
- Harness status before: <PASS/FAIL/N/A (from pre-existing gate check)>
## Features Completed
| Feature | Status | Verified-by | Verified-at |
|---------|--------|-------------|-------------|
| FEAT-001 | verified | QA Test | <ISO 8601> |
| FEAT-002 | verified | QA Test | <ISO 8601> |
## Evidence
- Multi-Gate: <N/N active gates PASS / FAIL at gate X / all SKIPPED>
- QA Council: <2/2 ACCEPT — scores: security X.X, quality X.X>
- QA Rounds: <N round(s) to pass>
- Files created: <count>
- Files modified: <count>
- Test files: <count>
## Rollback
git revert <commit-sha> # fill in after commit
requirement-card.yaml (features status), qa-reports/aggregated-report.md (QA results).autoteam/workspace/chunk.md (exclude rest of .autoteam/workspace/, .autoteam/runs/)feat: <one-line requirement summary>
AutoTeam pipeline — {N} features verified, QA passed
Features: FEAT-001, FEAT-002, ...
gh pr create --draft --title "feat: <title>" --body "AutoTeam pipeline — see chunk.md for evidence"
[Step 10.5/11] ✓ PR created locally on branch autoteam/<branch-name>🔀 Run 'git push' to push branch and submit PRSkip conditions: git not available, not a git repo, or user requirement says "don't commit"
Print success or failure (see Section 6).
When dispatching any subagent, provide ALL context inline:
## Your Role
<paste the full agent definition from Section 5.X>
## Your Task
<specific task description>
## Input Files
Read these files for your inputs:
- <list .autoteam/workspace/ file paths>
## Required Output
Write to: <exact file path(s)>
Format: <expected schema>
| Agent | Model |
|---|---|
| Product Planner | sonnet |
| Architecture | opus |
| Implementation | sonnet |
| QA Security | sonnet |
| QA Quality | sonnet |
| QA Test | sonnet |
| Documentation | haiku |
Parallel dispatch: When pipeline allows it (independent Implementation modules, QA agents if desired), dispatch multiple subagents simultaneously.
After each subagent: Verify expected output files exist and are non-empty. Missing → retry once. Second failure → go to failure output.
Role: Transform approved plan.md into structured requirement-card.yaml.
Input: .autoteam/workspace/plan.md (APPROVED: true), docs/CODE-SUMMARY.md (if exists)
Output: .autoteam/workspace/requirement-card.yaml
Input Files:
.autoteam/workspace/plan.md (must have APPROVED: true — this is the human-approved plan)docs/CODE-SUMMARY.md (existing codebase context, if it exists — skip if not present)Process:
plan.md — extract Goals, Scope (In/Out), Features (with done_criteria), and Verificationdocs/CODE-SUMMARY.md if it exists to understand existing project structure (brownfield scenarios)acceptance_criteria entries — each must be independently testable, specific, behavioral (observable outcomes, not implementation details)out_of_scope in requirement-card.yamltech_constraints: []features in requirement-card.yaml — these are the execution unitsrequirement-card.yaml:requirement: |
[faithful paraphrase of user requirement]
acceptance_criteria:
- id: AC-001
description: "[testable criterion]"
testable: true
out_of_scope:
- "[not required item]"
tech_constraints:
- "[user-stated constraint]"
features: # Derived from plan.md Features
- id: FEAT-001
name: "<feature name>"
scope: "<scope description>"
status: pending
done_criteria:
- id: DC-001
description: "<behavior description>"
testable: true
- id: DC-002
description: "<behavior description>"
testable: true
- id: FEAT-002
name: "<feature name>"
scope: "<scope description>"
status: pending
done_criteria:
- id: DC-001
description: "<behavior description>"
testable: true
modules: [] # Architecture fills this in
Rules:
Discussion Node 1 (review mode):
round-N-arch.md, re-read acceptance criteriaAPPROVED on its own line.autoteam/workspace/discussion/round-N-planner.mdRole: Design tech architecture, select stack, define interface contracts.
Input: .autoteam/workspace/requirement-card.yaml, docs/CODE-SUMMARY.md (if exists)
Output: .autoteam/workspace/adr.md, .autoteam/workspace/interface-contracts.yaml, updated modules in requirement-card.yaml
Input Files:
.autoteam/workspace/requirement-card.yamldocs/CODE-SUMMARY.md (if exists; existing codebase context — for understanding current project state in brownfield scenarios)Process:
id, description, depends_on, output_filesadr.md (Context, Tech Stack table, Module Breakdown, Key Decisions with rationale, Risks, Out of Scope)interface-contracts.yaml:api_endpoints:
- id: EP-001
method: POST
path: /auth/login
description: "Authenticate user, return JWT"
authenticated: false
request:
content_type: application/json
body:
username: {type: string, required: true, max_length: 64}
password: {type: string, required: true, min_length: 8}
response:
success: {status: 200, body: {token: {type: string}, expires_at: {type: string, format: ISO 8601}}}
errors:
- {status: 401, condition: "Invalid credentials", body: {error: "Invalid credentials"}}
data_models:
- id: DM-001
name: User
fields:
- {name: id, type: integer, primary_key: true, auto_increment: true}
- {name: username, type: string, max_length: 64, unique: true, nullable: false}
cli_commands: []
functions: []
modules sectionPrinciples: YAGNI, Testability (every interface testable in isolation), Security by default, No premature optimization
Discussion Node 1 (discussion mode):
round-N-planner.md.autoteam/workspace/discussion/round-N-arch.mdRole: Write production code implementing the architecture exactly. No design decisions.
Input: .autoteam/workspace/adr.md, .autoteam/workspace/interface-contracts.yaml, .autoteam/workspace/requirement-card.yaml; in FIX MODE also .autoteam/workspace/fix-instructions.md
Output: Project source code files at paths from module output_files
.autoteam/workspace/requirement-card.yaml — list acceptance criteria IDs.autoteam/workspace/adr.md — confirm tech stack and module list.autoteam/workspace/interface-contracts.yaml — list all endpoints/commands.autoteam/workspace/sprint-contract.yaml — list done_criteria IDs for assigned modulefix-instructions.md and list assigned fix IDsMode: [NORMAL|FIX] | Module: [name] | Criteria: [N] | Done-Criteria: [N] | Fixes: [IDs or none]output_files list in requirement-card.yaml exist and no silently added feature extends beyond the agreed scope.autoteam/workspace/sprint-contract.yaml, verify the code satisfies the stated behavior through the actual contract/entrypoint it refers to; if a DC cannot be mapped cleanly, write escalation.mdescalation.md, do NOT add it silentlyRead fix-instructions.md completely before touching any code.
Fixed FIX-001: [one-line description]escalation.md insteadWrite .autoteam/workspace/escalation.md ONLY when:
Format:
ESCALATION: [FIX-ID]
Root cause: architectural issue in [document section]
Proposed change: [what needs to change]
Reason scope is insufficient: [explanation]
Role: Scan all generated code for security vulnerabilities. Report only — do not fix.
Input: All project source files (excluding .autoteam/)
Output: .autoteam/workspace/qa-reports/security-report.md
Vulnerability Categories:
NOT in scope: Code quality, test coverage, performance, formatting
Severity:
Report Format:
# Security QA Report — Round {N}
**Scanned files:** [list]
**Total findings:** CRITICAL: N | WARNING: N | INFO: N
## CRITICAL
| ID | File | Location | Lines | Issue | Fix |
|----|------|----------|-------|-------|-----|
| SEC-001 | src/auth.py | verify_token | 45-52 | JWT verify disabled | Fix: Remove options={"verify_signature": False}, use default verification |
## WARNING
[same table format]
## INFO
[same table format]
## ALL_CLEAR: [true only if zero CRITICAL]
## Council Vote
vote: ACCEPT | REJECT
rationale: <one sentence summarizing security posture>
confidence: HIGH | MEDIUM | LOW
Scores: security_posture: X/5 (1=critical exploits, 5=defense in depth) with 1-2 sentence rationale.
Role: Review code quality. Report only — do not fix.
Input: All project source files (excluding .autoteam/)
Output: .autoteam/workspace/qa-reports/quality-report.md
Golden Rules (always CRITICAL — mechanical check, no judgment needed):
from module import *)Quality Categories:
NOT in scope: Security, test coverage, performance (unless obvious O(n²) vs O(n)), formatting style
Report Format: Same table structure as Security, with Fix column. ALL_CLEAR: true only if zero CRITICAL.
Scores: Include at end of report: code_quality: X/5 (1=unmaintainable, 5=exemplary), design_coherence: X/5 (1=random patterns, 5=unified architecture) with 1-2 sentence rationale per score.
Council Vote: Append to report:
## Council Vote
vote: ACCEPT | REJECT
rationale: <one sentence summarizing code quality posture>
confidence: HIGH | MEDIUM | LOW
Role: Verify test coverage maps to acceptance criteria. Run tests. Report gaps.
Input: All project files + .autoteam/workspace/requirement-card.yaml + .autoteam/workspace/interface-contracts.yaml + .autoteam/workspace/sprint-contract.yaml
Output: .autoteam/workspace/qa-reports/test-report.md
Process:
.autoteam/workspace/requirement-card.yaml.autoteam/workspace/interface-contracts.yaml — identify the real endpoint/command/function/field contracts for the assigned Feature.autoteam/workspace/sprint-contract.yaml — load done_criteria per module as additional test targets.autoteam/workspace/interface-contracts.yaml or executable implementation evidence, report contract drift or ambiguityReport Format:
# Test QA Report — Round {N}
**Acceptance criteria checked:** N | **Covered:** N | **Uncovered:** N
## Test Run Results
Command: `pytest tests/ -v`
Exit code: 0/1
Passing: N | Failing: N
## CRITICAL
| ID | File | Location | Lines | Issue | Fix |
[table with Fix column]
## Acceptance Criteria Coverage Map
| Criterion | Description | Status | Test(s) |
| AC-001 | ... | COVERED/UNCOVERED/FAILING | test_name |
## Sprint Contract Verification
| Criterion | Behavior | Contract / Entrypoint | Status | Evidence |
| DC-001 | POST /auth/login returns 200 + JWT | POST /auth/login | PASS/FAIL/DRIFT (`DRIFT` = contract text cannot be cleanly mapped to `interface-contracts.yaml` or implementation evidence) | test_login_success |
## Scores
test_coverage: X/5
functionality: X/5
Rationale: [1-2 sentences per score]
## ALL_CLEAR: [true only if zero CRITICAL]
## Council Vote
vote: ACCEPT | REJECT
rationale: <one sentence summarizing test coverage and functionality>
confidence: HIGH | MEDIUM | LOW
If the project is a web application (has api_endpoints or serves HTML):
npm run dev, python -m flask run, uvicorn, go run ., etc.)## Interactive Evaluation
| ID | Page/Route | Action | Expected | Actual | Status |
| INT-001 | /login | Submit valid credentials | Redirect to /dashboard | Redirected correctly | PASS |
| INT-002 | /users | Click delete button | Confirmation dialog | User deleted without dialog | FAIL |
Skip conditions: Not a web app, no dev server command detectable, or Playwright/browser tools not available (which playwright or npx playwright --version fails). Print: [QA Test] ⚠️ Interactive evaluation skipped: {reason}
Role: Write clear, accurate documentation for the delivered project.
Input: All project code + .autoteam/workspace/requirement-card.yaml + .autoteam/workspace/adr.md + .autoteam/workspace/interface-contracts.yaml
Output: docs/README.md, docs/ARCHITECTURE.md, docs/API.md (if API endpoints exist), AGENTS.md (project root)
docs/README.md (required sections):
docs/API.md (if api_endpoints exist):
docs/ARCHITECTURE.md:
Rules:
AGENTS.md (always generated, project root):
just check, npm test, pytest, make test — infer from project files), lint command, test commandadr.md (architectural decisions, import boundaries)Harness command detection order: justfile → just check | package.json with test script → npm test | pyproject.toml → pytest | Makefile → make check | fallback → echo "Configure your check command"
[Step 11/11] ✓ AutoTeam pipeline complete
📋 Requirement: <title from requirement-card.yaml>
📦 Features: FEAT-001 (verified) | FEAT-002 (verified) | ...
📐 Architecture: <tech stack summary — one line>
📁 Output:
- [list every file created or modified]
📊 QA: 2/2 ACCEPT (Security + Quality)
🔀 Branch: autoteam/<name> (PR created locally — run 'git push' to submit)
Status: ✅ SUCCESS
[AutoTeam] ❌ Pipeline Failed at: <stage name>
Reason: <specific error>
Partial output: <list of files created before failure, or "none">
Stop. Do not attempt further stages.
Adapted from Anthropic's harness design principle: "find the simplest solution possible, and only increase complexity when needed." Every component encodes an assumption about what the model can't do on its own — stress test those assumptions.
| Component | Assumption it encodes | When to consider dropping |
|---|---|---|
| Discussion Node 1 | Architecture may miss acceptance criteria | Skip rate >90% across runs |
| Sprint Contract | Implementation may build the wrong thing | QA round-1 pass rate >90% without contract |
| Multi-Gate Check | LLM misses mechanical violations | Never — deterministic checks are always cheaper |
| Ratchet mode | Pre-existing code has violations | Drop when project reaches zero baseline violations |
| 3 separate QA agents | Specialized focus catches more | If one agent consistently finds zero issues |
| FIX MODE minimal-change rule | Implementation over-refactors during fixes | If Implementation shows discipline without constraint |
When a new model version is available:
npx claudepluginhub nangenx/autoteam --plugin autoteamCoordinates specialist agents through a complete development cycle: requirements, planning, implementation, refactoring, QA, and documentation. Use for systematic feature development with quality checks.
Orchestrates an adversarial plan-implement-review pipeline by spawning agents with separate context windows. Use after intake skills produce a starting document.
Auto-loop execution workflow with quality gates. Use when starting any non-trivial implementation task. Provides automatic task decomposition, code implementation, testing (L1-L4), and iterative quality gates until completion. Invoke with /autoworker.