From edify
Review production artifacts after plan execution. Triggers on "review deliverables", "deliverable review", "artifact review", or after completing plan execution. Two-layer review (delegated per-file + interactive cross-project) producing severity-classified report grounded in ISO 25010 / IEEE 1012.
How this skill is triggered — by the user, by Claude, or both
Slash command
/edify:deliverable-reviewThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Review production artifacts against a design specification to identify conformance gaps, correctness issues, coverage holes, and excess artifacts.
Review production artifacts against a design specification to identify conformance gaps, correctness issues, coverage holes, and excess artifacts.
Before starting, gather:
plans/<plan>/design.md)plans/<plan>/reports/)plugin/bin/deliverable-inventory.py (no arguments, no pipes, no redirect). Outputs markdown tables: per-file diff stats and summary by type| Type | Pattern | Review axes |
|---|---|---|
| Code | *.py source | Universal + robustness, modularity, testability, idempotency, error signaling |
| Test | test_*.py | Universal + specificity, coverage, independence |
| Agentic prose | SKILL.md, agent defs | Universal + actionability, constraint precision, determinism, scope boundaries |
| Human docs | Fragments, references | Universal + accuracy, consistency, completeness, usability |
| Configuration | Justfile, pyproject | Universal only |
Universal axes (all types): conformance, functional correctness, functional completeness, vacuity, excess.
Full axis definitions: agents/decisions/deliverable-review.md
Compare inventory against design In-scope items:
Two-layer review. Layer 1 is optional (scales to large deliverable sets). Layer 2 is mandatory (catches what delegation cannot).
Gate on total deliverable lines from Phase 1 inventory:
| Total lines | Strategy |
|---|---|
| < 500 | Skip Layer 1 — Layer 2 handles full review |
| 500–2000 | Two opus agents: code+test, prose+config |
| > 2000 | Three opus agents: code, test, prose+config |
When Layer 1 applies, partition deliverables by type and launch parallel review agents. Each agent receives:
plans/<plan>/reports/deliverable-review-<type>.mdUse run_in_background=true for parallel agents.
Always runs in main session with full cross-project context.
Scope depends on whether Layer 1 ran:
| Layer 1 | Layer 2 scope |
|---|---|
| Skipped (< 500 lines) | Full review: per-file axes AND cross-cutting checks |
| Ran | Cross-cutting focus: issues delegation cannot catch |
Cross-cutting checks (always):
Per-file review (when Layer 1 skipped):
Why interactive is mandatory: Delegated agents lack cross-project context — fragment conventions, other skills' configurations, inter-file consistency. Layer 2 reads deliverables directly against the design spec, not delegation reports. The two layers are independent.
Write consolidated report to plans/<plan>/reports/deliverable-review.md.
Report structure:
# Deliverable Review: <plan-name>
**Date:** <date>
**Methodology:** agents/decisions/deliverable-review.md
## Inventory
[Table: type, file, lines]
[Design conformance summary]
## Critical Findings
[Numbered, with file:line, design requirement, impact]
## Major Findings
[Same format]
## Minor Findings
[Grouped by category, brief]
## Gap Analysis
[Table: design requirement, status (covered/missing), reference]
## Summary
[Severity counts, assessment]
Severity classification:
Next steps:
Report severity counts only. No merge-readiness language — the user reads severity counts and decides what to act on. The report is the deliverable; the skill does not queue follow-up work or record lifecycle state.
agents/decisions/deliverable-review.mdreferences/example-report.mdCreates, edits, and optimizes skills for Claude Code, including drafting, evaluating with test prompts, iterating on performance, and improving skill descriptions for better triggering accuracy.
npx claudepluginhub ddaanet/edify-plugin