design-review-cycle | llm-gateway

Stats

Actions

Tags

design-review-cycle | llm-gateway

Design Review Cycle

Submit design documents, implementation plans, or specifications for peer review through the LLM gateway. Track review iterations until approval.

Dispatch Defaults

Apply these on every dispatch unless the caller has explicitly overridden a rule in the current turn:

Omit model — let the gateway use its configured default per CLI.
approvalStrategy:"mcp_managed" is the skill dispatch default (the gateway schema default is "legacy"). For Codex, also pass fullAuto:true when it must read files or run commands.
No wallclock timeout; poll every 60 s — idleTimeoutMs is a separate no-output safeguard.
Iterate until unconditional APPROVED (review dispatches only) — every review prompt must end with "End with APPROVED or NOT APPROVED with findings." Loop: dispatch → parse verdict → on NOT APPROVED or conditional, revise + re-submit → repeat. Escalate after 3 rounds. This rule does not apply to pure implementation or non-review analysis dispatches.

When to Use

Before implementing a complex feature (review the plan first)
After writing a specification or design document
When a development process requires peer review before code
For architecture decisions that affect multiple components

Review Workflow

Step 1: Prepare the Review Request

Write a clear review request that includes:

What document to review (file path)
What kind of review (design, spec, plan, architecture)
Specific concerns or focus areas
Iteration number (for tracking)

Step 2: Submit to Reviewer(s)

Single reviewer (Codex — most common):

codex_request({
  prompt: "Review the design document at [path]. This is a [spec/plan/design] for [feature].\n\nReview for:\n- Completeness (are all requirements addressed?)\n- Correctness (will this approach work?)\n- Feasibility (can this be implemented as described?)\n- Risks (what could go wrong?)\n- Missing considerations\n\nEnd with APPROVED or NOT APPROVED with specific, actionable findings.",
  fullAuto: true,
  approvalStrategy: "mcp_managed",
  correlationId: "design-review-r1"
})

Dual reviewer (Codex + Gemini for security-sensitive designs):

codex_request_async({
  prompt: "Review the design document at [path]... End with APPROVED or NOT APPROVED with findings.",
  fullAuto: true,
  approvalStrategy: "mcp_managed",
  correlationId: "design-review-r1-codex"
})
gemini_request_async({
  prompt: "Review the design document at [path]. Focus on security implications, attack surfaces, data flow risks, and failure modes... End with APPROVED or NOT APPROVED with findings.",
  approvalStrategy: "mcp_managed",
  correlationId: "design-review-r1-gemini"
})

Triple reviewer (add Grok for independent diversity on high-stakes designs):

grok_request_async({
  prompt: "Independent review of the design document at [path]. Flag completeness gaps, feasibility concerns, and assumptions the other reviewers may accept too easily. End with APPROVED or NOT APPROVED with findings.",
  approvalStrategy: "mcp_managed",
  correlationId: "design-review-r1-grok"
})

Step 3: Process Feedback

Parse the review response for:

APPROVED → Proceed to implementation
NOT APPROVED with findings → Address each finding
Conditional approval ("approved if X is addressed") → Address X, re-submit

For each finding:

Update the design document to address it
Or explicitly document why the finding doesn't apply (with justification)

Step 4: Re-submit

codex_request({
  prompt: "Re-review the design at [path] after addressing round 1 feedback:\n\n1. [Finding] — Addressed by: [what changed]\n2. [Finding] — Addressed by: [what changed]\n3. [Finding] — Not applicable because: [justification]\n\nReview the updated document. End with APPROVED or NOT APPROVED with findings.",
  fullAuto: true,
  approvalStrategy: "mcp_managed",
  correlationId: "design-review-r2"
})

Step 5: Track Iterations

Use correlation IDs to track rounds:

design-review-r1 → Initial review
design-review-r2 → After first round of fixes
design-review-r3 → After second round (rare)

If after 3 rounds the design is still not approved, the design likely needs a fundamental rethink — escalate to the user.

Document Types and Review Focus

Document Type	Primary Reviewer	Focus Areas
Implementation plan	Codex	Feasibility, task ordering, test strategy
API specification	Codex + Gemini	Completeness, security, error handling
Architecture decision	Codex + Grok	Trade-offs, scalability, maintenance burden, independent perspective
Security design	Gemini	Attack surfaces, threat model, mitigations
Data model	Codex	Normalization, query patterns, migration path
High-stakes / hard-to-reverse design	Codex + Gemini + Grok	Use all three when the design locks in a contract or migration path

Review Request Templates

For Implementation Plans

Review the implementation plan at [path].

Context: This plan implements [feature] for [project].

Check:
- Are tasks ordered correctly (dependencies respected)?
- Is the test strategy adequate?
- Are there missing tasks or gaps?
- Is the scope appropriate (not too broad, not too narrow)?
- Are file paths and function names consistent throughout?

APPROVED or NOT APPROVED with findings.

For API Specifications

Review the API specification at [path].

Context: This API serves [purpose] for [consumers].

Check:
- Are all endpoints documented with request/response schemas?
- Are error cases handled (400, 401, 403, 404, 500)?
- Is authentication/authorization specified?
- Are there rate limiting or pagination considerations?
- Is the naming consistent and RESTful?

APPROVED or NOT APPROVED with findings.

For Architecture Decisions

Review the architecture decision at [path].

Context: This decision affects [components] in [project].

Check:
- Are alternatives considered and trade-offs documented?
- Is the chosen approach justified with concrete reasons?
- What are the long-term maintenance implications?
- Are there migration or rollback considerations?
- Does this align with existing system architecture?

APPROVED or NOT APPROVED with findings.

Integration with sqry Review Process

Many sqry review documents follow this naming convention:

01_SPEC.md → Specification
02_DESIGN.md → Design document
02_DESIGN_review_r1_request.md → Review request (round 1)
02_DESIGN_review_r2_request.md → Review request (round 2)

The skill works with any document structure — the naming convention is optional.

Tips

Always provide file paths, not inline content. Let the reviewer read the document.
Include context about the project and feature — reviewers don't have your conversation history.
Use correlationId for every review round to enable tracing.
For large documents, tell the reviewer which sections changed between rounds.
Always pass fullAuto: true and approvalStrategy: "mcp_managed" for Codex reviews — fullAuto:true gives Codex sandboxed file/shell access, while mcp_managed records and gates the request. If Codex still can't access something specific, paste the relevant sections inline.
Design reviews are cheaper than code reviews — catch issues before writing code.
For multi-round reviews of the same design, pass resumeLatest:true to Codex on round 2+ to carry the reviewer's prior context (or sessionId:<UUID> for a specific Codex session). Note: --full-auto is dropped on resume — the original session's approval policy is inherited.
Deferred review jobs are durable (default 30-day retention, LLM_GATEWAY_JOB_RETENTION_DAYS). If polling times out mid-round, re-issue the same call (auto-dedup reattaches to the running job) or fetch by jobId later. Use forceRefresh:true only when the design document has actually been updated.
For multi-round design reviews where the design document itself doesn't move much between rounds, switch from prompt to the structured promptParts field: put the stable review brief in system, the design content (or path-to-design) in context, and the round-specific question in task. prompt and promptParts are mutually exclusive. Stable system/context across rounds keeps the prefix-hash identical, raising implicit cache hit rate at the provider. After a round, read cache-state://prefix/{hash} to confirm reviewers actually shared the prefix.