Skill

audit

Detect documentation/implementation drift and auto-generate Issues (`/audit drift`), and detect structural fragility (`/audit fragility`). AI detects semantic drift between Steering Documents + Project Documents and codebase implementation, and auto-generates Issues for code-side fixes. Where `/doc sync` proposes documentation-side fixes, `/audit` is the complementary skill that creates Issues for code-side fixes. Running without arguments executes both drift and fragility perspectives in an integrated run. `/audit stats` aggregates Issue metadata across the project and generates a project health diagnostic report (throughput / composition / First-try success / Backlog Health, etc.), providing a third lens for project health alongside drift and fragility detection. `/audit stats --retention` adds phase/verify and Icebox dwell metrics (median/p95/30-day threshold violations, verify-type breakdown, Icebox dwell, trigger candidates) with escalation-based retire-proposal comment posting (30/60/90 days for verify, 90/180 days for Icebox). `/audit recoveries` reads the cross-Issue orchestration recovery log (`docs/reports/orchestration-recoveries.md`) and files Issues for recurring patterns that exceed a frequency threshold. `/audit progress <XL-parent-issue-number>` displays a sub-issue progress snapshot (status breakdown, phase distribution, time estimate, 24h activity) for the specified XL parent issue. `/audit auto-session <session-id>` generates the data layer of a /auto session retrospective report from `.tmp/auto-events.jsonl` filtered by session_id (SESSION_ID generated by /auto at startup as PID-timestamp). `/audit auto-session --full <session-id>` additionally generates LLM drafts for all 4 narrative sections (What worked / Limits and gaps / Improvement candidates surfaced / Conclusion) with `[LLM draft — human review required]` markers; no issues are auto-filed (human gate preserved). Both modes also generate a Japanese-translated sibling file at `{report-path-without-ext}-ja.md` by default; pass `--no-ja` to skip.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/wholework:audit

User invocable

Model invocable

Inline context

Default effort

Tool Access

This skill is limited to the following tools:

ReadWriteEditGlobGrepBash(gh issue create:*, gh issue list:*, gh issue view:*, gh issue edit:*, gh label create:*, ls:*, mkdir:*, rm:*, ${CLAUDE_PLUGIN_ROOT}/scripts/gh-issue-edit.sh:*, ${CLAUDE_PLUGIN_ROOT}/scripts/gh-graphql.sh:*, ${CLAUDE_PLUGIN_ROOT}/scripts/gh-issue-comment.sh:*, ${CLAUDE_PLUGIN_ROOT}/scripts/get-issue-size.sh:*, ${CLAUDE_PLUGIN_ROOT}/scripts/get-issue-type.sh:*, ${CLAUDE_PLUGIN_ROOT}/scripts/get-issue-priority.sh:*, ${CLAUDE_PLUGIN_ROOT}/scripts/collect-recovery-candidates.sh:*, ${CLAUDE_PLUGIN_ROOT}/scripts/check-eager-load-capability.sh:*, ${CLAUDE_PLUGIN_ROOT}/scripts/compute-escalation-level.sh:*, ${CLAUDE_PLUGIN_ROOT}/scripts/get-sub-issue-progress.sh:*, ${CLAUDE_PLUGIN_ROOT}/scripts/get-auto-session-report.sh:*)

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Parse ARGUMENTS and route to the appropriate subcommand.

Supporting Files

auto-session-narrative-prompts.md

SKILL.md

1242 lines · ~14.3k tokens(exceeds 5k compaction limit)

Stats

LanguageShell

Stars0

MaintenanceExcellent

Last CommitJun 18, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

audit: Documentation × Implementation Drift Detection

Parse ARGUMENTS and route to the appropriate subcommand.

If ARGUMENTS contains --help, Read ${CLAUDE_PLUGIN_ROOT}/modules/skill-help.md and follow the "Processing Steps" section to output help, then stop.

Command Routing

If ARGUMENTS is drift or starts with drift (including options like --dry-run, --limit N): execute the "drift subcommand" section and exit.

If ARGUMENTS is fragility or starts with fragility (including options like --dry-run, --limit N): execute the "fragility subcommand" section and exit.

If ARGUMENTS is stats or starts with stats (including options like --since DATE, --limit N, --no-save, --retention): execute the "stats Subcommand" section and exit.

If ARGUMENTS is recoveries or starts with recoveries (including options like --dry-run, --limit N, --threshold K): execute the "recoveries subcommand" section and exit.

If ARGUMENTS is progress or starts with progress (e.g., progress <issue-number>): execute the "progress Subcommand" section and exit.

If ARGUMENTS is auto-session or starts with auto-session (e.g., auto-session <session-id>, auto-session --full <session-id>, auto-session --since 24h, auto-session <id> --output <path>, auto-session <id> --no-ja): execute the "auto-session Subcommand" section and exit.

If ARGUMENTS is empty (no arguments), --dry-run, or starts with --limit: execute the "Integrated Execution (drift + fragility)" section and exit.

For any other ARGUMENTS: display "Usage: /audit [drift|fragility|stats|recoveries|progress |auto-session ] [--dry-run] [--limit N] [--since DATE] [--no-save] [--retention] [--threshold K] [--no-ja] (running /audit without arguments executes drift + fragility integrated run)" and exit.

drift Subcommand

Detect semantic drift between Steering Documents + Project Documents and codebase implementation, and generate Issues for code-side fixes.

Note: Run /doc sync --deep first to normalize document-side drift, then run /audit drift to detect remaining semantic drift that requires code-side fixes. This keeps /audit drift focused on drift that cannot be resolved by updating documentation alone.

Option Parsing

Parse the following options from ARGUMENTS:

--dry-run: display the drift report only without generating Issues
--limit N: limit Issue generation to N items (in descending severity order)

Step 1: Context Collection

Read ${CLAUDE_PLUGIN_ROOT}/modules/codebase-analysis.md and follow the "Processing Steps" section to execute cross-codebase analysis.

Then collect documents using the following procedure:

Load Steering Documents:

Read ${CLAUDE_PLUGIN_ROOT}/modules/detect-config-markers.md and follow the "Processing Steps" section. Retain SPEC_PATH and STEERING_DOCS_PATH for use in subsequent steps.

Search for $STEERING_DOCS_PATH/product.md, $STEERING_DOCS_PATH/tech.md, $STEERING_DOCS_PATH/structure.md with Glob and Read any that exist. If none exist, display "Steering Documents not found. Run /doc init." and exit with error.

Load Project Documents:

Following the document traversal pattern from /doc, dynamically detect type: project documents using this procedure:

Search the entire repository with Grep for the type: project pattern limited to *.md files, getting a list of candidate file paths
Skip files matching these exclusion patterns:
- Paths starting with $SPEC_PATH/
- Paths containing node_modules/
- Paths starting with .git/
- Paths starting with .tmp/
Read each candidate file and collect its contents

Fetch existing open Issues (for duplicate check):

gh issue list --state open --json number,title,body --limit 100

The retrieved issue list is used for duplicate checking in Step 3 (after drift detection).

Step 2: Drift Detection

Cross-reference the Steering Documents, Project Documents, and codebase analysis results collected in Step 1 to detect semantic drift.

Steering Documents categories (examples):

Category	Detection method
tech.md Architecture Decisions vs actual code	Compare with Read + Grep pattern comparison (inconsistencies between documented architecture and actual code)
tech.md Key Dependencies vs actual dependencies	Extract actual deps from `package.json`/`go.mod` etc. with Grep → compare with tech.md table
tech.md Coding Conventions vs actual code	Detect naming convention violations / Forbidden Expressions usage with Grep
structure.md Directory Layout vs actual directory	Get actual directory listing with `ls` + Glob → diff against structure.md entries
structure.md Key Files vs actual files	Detect absent listed files and unlisted important files with Glob
product.md Non-Goals vs implementation	AI judgment to detect implemented features that violate Non-Goals
product.md Terms vs code terminology	Detect usages of different notation from defined terms with Grep

Project Documents categories (examples):

Category	Detection method
workflow.md skill list vs actual skills	Match Glob results of `skills/*/SKILL.md` against skill names/subcommands listed in workflow.md
workflow.md phase descriptions vs SKILL.md implementation	Compare phase role descriptions (routing, options, etc.) with actual behavior in SKILL.md via Read
workflow.md path references vs actual files	Extract path references (like `skills/<name>/SKILL.md`) with Grep → verify file existence with Glob
`docs/environment-adaptation.md` Layer 3 Domain Files table vs bundled Domain file frontmatter	(1) Glob `${CLAUDE_PLUGIN_ROOT}/skills/*/.md` and `${CLAUDE_PLUGIN_ROOT}/modules/*/.md`; for each file Read its frontmatter and collect files with `type: domain` → "actual Domain files". (2) Read `docs/environment-adaptation.md` → extract all rows from the "Domain Files (exhaustive)" table under Layer 3. (3) Report three drift sub-types: table-missing — a file has `type: domain` frontmatter but is not listed in the table; file-or-frontmatter-missing — a table row's file does not exist or lacks `type: domain` frontmatter; load_when-mismatch — the `load_when` column value in the table differs from the `load_when:` block in the file's frontmatter
eager-load 共通モジュールへの capability guidance 混入	`${CLAUDE_PLUGIN_ROOT}/scripts/check-eager-load-capability.sh` を実行し出力を drift レポートに含める。スクリプトが行う処理: (1) `modules/{name}-adapter.md` を Glob して capability 名を列挙; (2) `modules/verify-patterns.md` と `modules/verify-executor.md` の本文のセクション見出し（table row 除く）に capability 名が現れる箇所を検出; (3) 対応する Domain file `skills/*/{name}-guidance.md` の存在を確認; (4) Domain file が存在しない場合に Issue 候補として記録

Severity scoring (AI judgment):

Assign severity to each detection result using these guidelines (not strict rule-based, AI judgment):

high: Code doesn't work, security issues, complete contradiction between documentation and implementation
medium: Minor functional impact, documentation description is outdated
low: Notation inconsistency, style mismatch, minor irregularities

Step 3: Duplicate Check

Semantically compare the detected drift against existing open Issues retrieved in Step 1.

Reference titles and bodies; if the content is similar to an existing Issue (pointing out the same drift), judge as duplicate and skip. Duplicate check is AI-judgment-based.

Display duplicates as "duplicate (existing Issue #N)" in the results report.

Step 4: Results Output

Display drift detection results in table format:

| No | Category | Severity | Description | Affected Files | Duplicate |
|----|---------|----------|-------------|---------------|-----------|
| 1  | tech.md Coding Conventions | high | ... | skills/foo/SKILL.md | - |
| 2  | workflow.md skill list | medium | ... | docs/workflow.md | - |
| 3  | structure.md Key Files | low | ... | docs/structure.md | existing #123 |

In --dry-run mode: display the table and exit (do not generate Issues).

In normal mode:

If --limit N is specified, select N items in descending severity order. Exclude duplicates ("existing #N") from the count.

Ask the user with AskUserQuestion (non-interactive mode: auto-resolve — automatically select "Generate all" for non-duplicate items up to --limit N; record the decision in an issue comment):

"Generate all": generate Issues for all non-duplicate drift items
"Select": enter item numbers to generate separated by commas (e.g., 1,3,5)
"Cancel": exit without generating Issues

If "Cancel": display "Issue generation cancelled." and exit.

Step 5: Issue Generation

Generate Issues in /issue standard format for approved drift items.

Table row addition verify commands:

When the detected drift involves adding a row to a documentation table (e.g., adding a new entry to a | Name | Path | Role | table in docs/structure.md), generate a grep + section_contains pair from the start — do not generate grep alone:

<!-- verify: grep "{row-keyword}" "{target-file}" -->
<!-- verify: section_contains "{target-file}" "{section-heading}" "{row-keyword}" -->

{row-keyword}: a keyword that uniquely identifies the new table row (e.g., the script name or module name being added)
{section-heading}: the section heading that contains the table. Selection rule:
- If the target table is under an existing named section (e.g., ### Scripts), use that heading
- If the table has no named section of its own, use the parent section heading (e.g., ## Key Files（Required）)

Rationale: grep alone cannot verify that the keyword appears in the correct section. The section_contains command ensures the entry is placed in the expected table, not elsewhere in the file. This matches the guidance in modules/verify-patterns.md §5.

Each Issue body:

## Background

{Context where the drift was found, quoting the relevant Steering/Project Document section}

## Purpose

{Problem resolved by the fix}

## Acceptance Conditions

### Pre-merge (automated verification)

- [ ] <!-- verify: {verify command} --> {condition 1}
- [ ] {condition 2}

### Post-merge

- [ ] {verification items}

Verify command validity check (before creating Issues):

Before calling gh issue create, validate each  in the generated Issue body:

Known command type: The command name must be a known type defined in modules/verify-executor.md (e.g., file_exists, file_contains, section_contains, grep, command, github_check, rubric, etc.). If the command type is unknown, replace it with a valid known type or remove the verify comment.
Non-empty arguments: No argument may be an empty string (e.g., file_not_contains "path" "" — the empty second argument is invalid). If an empty argument is found, fix the command with a meaningful value or remove the verify comment.

Fix any invalid verify commands before proceeding to Issue creation.

Label assignment:

After Issue generation, assign the following label:

audit/drift: tracking label indicating the drift was detected by the audit skill

Do not assign the triaged label when creating Issues. The triaged label is assigned by the /triage skill after triage is actually executed; pre-assigning it causes the Issue to be skipped by the triage pipeline, leaving Type/Size/Priority/Value unset.

Type/Size assignment:

Set Type and Size from AI estimation of drift scope (update project fields via ${CLAUDE_PLUGIN_ROOT}/scripts/gh-graphql.sh).

After generation:

Display the list of generated Issue numbers and titles.

Then read ${CLAUDE_PLUGIN_ROOT}/modules/steering-hint.md and follow the "Processing Steps" section.

stats Subcommand

Aggregate Issue metadata across the project and generate a project health diagnostic report. This is a read-only tool — it generates new docs/stats/YYYY-MM-DD.md files only, and does not edit existing files or create Issues.

Option Parsing

Parse the following options from ARGUMENTS:

--since DATE: aggregation start date (default: 90 days before today; format: YYYY-MM-DD)
--limit N: maximum number of Issues to fetch (default: 500)
--no-save: skip saving to docs/stats/; output to stdout only
--retention: enable retention analysis — compute phase/verify and Icebox dwell metrics, and post escalation-based retire-proposal comments

Step 1: Data Collection

Fetch Issue list:

gh issue list --state all --json number,title,body,labels,createdAt,closedAt,state --limit {N}

Filter to Issues created on or after --since DATE. If --since is not specified, use 90 days before today as the default.

Fetch timeline items for each Issue (for reopen and phase label transition analysis):

For each Issue in the filtered list:

${CLAUDE_PLUGIN_ROOT}/scripts/gh-graphql.sh --query get-issue-timeline -F num={number} \
    --jq '.data.repository.issue'

Extract the following from timelineItems:

Reopen events: ReopenedEvent entries → mark the Issue as having reopen history
Phase label transition history: LabeledEvent and UnlabeledEvent entries for phase/* labels → record the sequence of phase transitions in chronological order

Spec file existence check (for retrospective presence):

Use Glob to check whether $SPEC_PATH/issue-{number}-*.md exists for each Issue. Record existence as a boolean (do not read Spec content).

Step 2: Computation

Success/Failure Definitions (3 levels, displayed simultaneously)

First-try success (strictest): Issue reached phase/done AND has no reopen history
Completed: Issue reached phase/done (reopen history does not affect this)
Rework: number of times the phase sequence went from phase/verify back to phase/code

Composition (Type / Size / Priority)

For each Issue in the filtered list, resolve Type, Size, and Priority from GitHub Projects fields (with label fallback) by calling the helper scripts:

${CLAUDE_PLUGIN_ROOT}/scripts/get-issue-type.sh {number}      # -> Bug / Feature / Task (empty if unset)
${CLAUDE_PLUGIN_ROOT}/scripts/get-issue-size.sh {number}      # -> XS / S / M / L / XL (exit 1 if unset)
${CLAUDE_PLUGIN_ROOT}/scripts/get-issue-priority.sh {number}  # -> urgent / high / medium / low (exit 1 if unset)

Classify as "unset" when the script exits with a non-zero status or outputs an empty string. The gh-graphql.sh --cache flag used internally in each script deduplicates GraphQL requests for the same Issue.

Content Segment Classification (MVP: keyword-based)

Classify each Issue by checking whether its title or body contains any of the following keywords (case-sensitive partial match). Assign the first matching segment in order; if none match, assign "other".

Segment	Keywords
ui/design	`UI`, `デザイン`, `画面`, `レイアウト`, `Figma`, `design`
backend	`API`, `サーバー`, `サーバ`, `DB`, `データベース`, `backend`
infra	`CI`, `CD`, `Docker`, `環境`, `deploy`, `インフラ`, `runner`
docs	`ドキュメント`, `doc`, `README`, `CLAUDE.md`, `文書`
test	`テスト`, `test`, `bats`, `spec`
other	(none of the above)

This section is structured as an independent subsection to allow future replacement with LLM-based classification.

Work Origin Classification

Classify each Issue based on its labels:

audit/drift label present → audit (drift)
audit/fragility label present → audit (fragility)
retro/verify label present → retrospective
None of the above → manual

Note: retro/verify label may not yet exist in the repository. When the label is absent or no Issue has it, the "retrospective" category will show 0 — this is expected behavior. Once the companion Issue adding retro/verify label assignment to /verify Step 13 is merged, retrospective-derived Issues will be separated automatically.

Outcome Exclusion Filter

Section 5 (Outcome) の全サブ項目集計から retro/verify ラベル付き Issue を除外するためのフィルター定義。

Compute the following from filtered_issues:

retro_verify_count: number of Issues in filtered_issues that have the retro/verify label
outcome_population: filtered_issues minus Issues that have the retro/verify label

retro/verify Issues represent wholework infrastructure improvement proposals surfaced by the /verify phase and closed as "not planned" after upstream migration — they are not implementation failures and must not affect First-try success rate, Completed rate, Rework, or Phase regression calculations.

Note: Section 4 (Work Origin) uses the full filtered_issues as its population and is not affected by this filter. retro/verify Issues are already classified as the "retrospective" category in Section 4.

Trend Analysis (30-day window × 3)

Split the past 90 days into three 30-day windows (window 1: oldest, window 3: most recent). For each window, compute:

Created: number of Issues created in the window
Closed: number of Issues closed in the window
Net: Closed − Created
Open end: total Open Issues at the end of the window

Backlog Health Thresholds

Stale candidates: Open Issues with no update for 90 or more days (same set as age ≥ 90d Open)
Untriaged candidates: Open Issues without the triaged label

phase/verify Dwell Time (滞留期間)

For each Issue that has (or had) a phase/verify label, compute the dwell time (滞留期間) as:

Active Issues: days from the most recent phase/verify LabeledEvent to today
Past Issues: days from the most recent phase/verify LabeledEvent to the most recent phase/verify UnlabeledEvent

Compute the following aggregates across all Issues currently in phase/verify:

Median dwell time (days)
p95 dwell time (days)
Max dwell time (days)

Observation Waiting Count

Scan each Issue currently labeled phase/verify (closed state) for unchecked (- [ ]) lines containing verify-type: observation. Count the number of such Issues.

Opportunistic Remaining Count

Scan each Issue currently labeled phase/verify (closed state) for unchecked (- [ ]) lines containing verify-type: opportunistic. Count the number of such Issues.

Manual Waiting Count

Scan each Issue currently labeled phase/verify (closed state) for unchecked (- [ ]) lines containing verify-type: manual. Count the number of such Issues.

30-Day Threshold Violations

For each Issue currently in phase/verify, compute the dwell time from the most recent phase/verify LabeledEvent to today. Collect all Issues with dwell time ≥ 30 days as threshold violation candidates. Record the list with Issue number, title, and dwell days.

Icebox Dwell Time

For Issues with Project Status=Icebox (fetched via ${CLAUDE_PLUGIN_ROOT}/scripts/gh-graphql.sh --query get-projects-with-fields), compute the dwell time as days from each Issue's createdAt to today.

Compute the following aggregates across all Icebox Issues:

Icebox count: total number of Issues with Project Status=Icebox
Median dwell time (days)
p95 dwell time (days)

Icebox Trigger Candidates

For each Icebox Issue, scan the Issue body for re-evaluation trigger text (lines containing keywords such as 「再評価トリガー」, "re-evaluation trigger", or "trigger"). For each trigger found, apply heuristic judgment:

If the trigger references a specific Issue number (e.g., #123), check whether that Issue is CLOSED via gh issue view.
If the trigger references an event or condition, apply model judgment to estimate whether the condition may have been met.

Record Issues where at least one trigger heuristic evaluates to true as "trigger fire candidates".

Highlights Auto-Detection Logic

Collect items meeting any of the following criteria to display in the Highlights section:

Failure rate for a specific Size, Type, or Content segment is 2x or more above the overall average
Trend direction (Net) is the same direction for 2 or more consecutive 30-day windows (↑↑ or ↓↓)
Backlog net change in the most recent window has worsened by 20% or more

Highlights contain only auto-detected items. Do not include interpretation or inference in the report.

Step 3: Report Generation

Generate a Markdown report containing all 6 sections below. Output to stdout.

Section 1: Highlights

List items that meet the auto-detection criteria from Step 2. If no items meet the criteria, output "No highlights detected."

Do not interpret or infer. Only enumerate items that meet the detection thresholds.

Section 2: Flow

Display Created / Closed / Net / Open end for each of the three 30-day windows in table format.

| Window | Period | Created | Closed | Net | Open end |
|--------|--------|---------|--------|-----|----------|
| W1 (oldest) | YYYY-MM-DD – YYYY-MM-DD | N | N | N | N |
| W2 | ... | N | N | N | N |
| W3 (recent) | ... | N | N | N | N |

Section 3: Composition

Display counts by Type, Size, and Priority. Also show the ratio change for the most recent 30-day window vs. the prior two windows combined.

Section 4: Work Origin

Display distribution of audit (drift) / audit (fragility) / retrospective / manual. Include percentage of total.

If the retro/verify label does not exist, display "retrospective" as 0 with a note: "(retro/verify label not yet assigned — will be separated once companion Issue is merged)".

Section 5: Outcome

Display the exclusion note first:

Outcome 集計対象: N 件 (うち retro/verify ラベル付き M 件を除外)

where N = count(outcome_population) and M = retro_verify_count. Always display this note even when M = 0.

All sub-items below are computed using outcome_population (i.e., filtered_issues with retro/verify-labeled Issues excluded):

By Size: First-try success rate, Completed rate, and average Rework count for each Size
Phase regression points: which phase/verify → phase/code regressions occurred most frequently
By Content segment: First-try success rate and reopen rate vs. overall average for each segment
Trend: First-try success rate per 30-day window × 3 (using outcome_population)

Section 6: Backlog Health

Display the following:

Total Open Issue count
Age distribution: 0–7d / 7–30d / 30–90d / 90d+
Stale candidate count (≥ 90d, same as 90d+ Open)
Untriaged candidate count (Open without triaged label)

Section 7: phase/verify Dwell and Observation Queue

Display the following (computed from the Step 2 metrics):

phase/verify dwell time (current Issues in phase/verify): median / p95 / max (days)
Observation waiting: number of phase/verify Issues with unchecked verify-type: observation ACs
Opportunistic remaining: number of phase/verify Issues with unchecked verify-type: opportunistic ACs

If there are no Issues currently in phase/verify, display "No Issues currently in phase/verify."

--retention Option

Skip this entire section when --retention is not specified.

When --retention is specified, append the following after Section 7.

Section 8: phase/verify Retention Metrics

Display the following table with threshold warnings:

Metric	Value	Threshold	Status
phase/verify dwell (median)	N days	> 14 days	OK / WARNING
phase/verify dwell (p95)	N days	> 30 days	OK / WARNING
Observation waiting	N	> 10	OK / WARNING
Opportunistic waiting	N	> 10	OK / WARNING
Manual waiting	N	> 5	OK / WARNING
30-day threshold violations	N	> 0	OK / WARNING

List 30-day threshold violation Issues (if any) with Issue number, title, and dwell days.

Section 9: Icebox Retention Metrics

Display the following table with threshold warnings:

Metric	Value	Threshold	Status
Icebox dwell (median)	N days	> 90 days	OK / WARNING
Icebox dwell (p95)	N days	> 180 days	OK / WARNING
Icebox count	N	—	—
Trigger fire candidates	N	> 0	OK / NOTIFY

List trigger fire candidate Issues (if any) with Issue number, title, and the matched trigger text.

Retire-Proposal Comment Posting

For each Issue currently in phase/verify, compute dwell days and call:

${CLAUDE_PLUGIN_ROOT}/scripts/compute-escalation-level.sh verify <dwell_days>

Route by escalation level:

Level 0 (0–29 days): no action
Level 1 (30–59 days): post observation guide reminder comment (suggest intentionally triggering the observation event)
Level 2 (60–89 days): post retire candidate comment AND add stale-verify label via gh issue edit --add-label stale-verify
Level 3 (90+ days): post "manual confirmation or remove observation condition" decision-prompt comment

For each Icebox Issue, compute dwell days and call:

${CLAUDE_PLUGIN_ROOT}/scripts/compute-escalation-level.sh icebox <dwell_days>

Route by escalation level:

Level 0 (0–89 days): no action
Level 1 (90–179 days): post observation guide reminder comment
Level 2 (180+ days): post retire candidate comment

Duplicate prevention: before posting a comment, fetch existing comments via gh issue view --json comments and check for a comment containing the same escalation level marker (). If found, skip that Issue — do not post a duplicate.

Comment format (include escalation level marker for duplicate prevention):

<!-- escalation-level: {N} -->
## phase/verify Retention Notice (Level {N})

This Issue has been in `phase/verify` for **{dwell_days} days**.

{Level-specific message}

For Icebox comments, use ## Icebox Retention Notice (Level {N}) as the heading.

Step 4: Save

If --no-save is specified: output to stdout only and exit.

If --no-save is not specified:

Determine today's date in YYYY-MM-DD format
Create directory if it does not exist:
```
mkdir -p docs/stats
```
Write report content to docs/stats/YYYY-MM-DD.md (overwrite if the file already exists for the same date). When --retention is specified, the Sections 8 and 9 retention output is included in the saved file.
Display: "Report saved to docs/stats/YYYY-MM-DD.md"

Then read ${CLAUDE_PLUGIN_ROOT}/modules/steering-hint.md and follow the "Processing Steps" section.

fragility Subcommand

Detect structural fragility based on project context (Steering Documents) and generate risk improvement Issues.

Option Parsing

Parse the following options from ARGUMENTS (same system as drift):

--dry-run: display the fragility report only without generating Issues
--limit N: limit Issue generation to N items (in descending severity order)

Step 1: Context Collection

Read ${CLAUDE_PLUGIN_ROOT}/modules/codebase-analysis.md and follow the "Processing Steps" section to execute cross-codebase analysis.

Then collect documents using the following procedure:

Load Steering Documents:

Read ${CLAUDE_PLUGIN_ROOT}/modules/detect-config-markers.md and follow the "Processing Steps" section. Retain SPEC_PATH and STEERING_DOCS_PATH for use in subsequent steps.

Load Project Documents:

Following the document traversal pattern from /doc, dynamically detect type: project documents using this procedure:

Search the entire repository with Grep for the type: project pattern limited to *.md files, getting a list of candidate file paths
Skip files matching these exclusion patterns:
- Paths starting with $SPEC_PATH/
- Paths containing node_modules/
- Paths starting with .git/
- Paths starting with .tmp/
Read each candidate file and collect its contents

Fetch existing open Issues (for duplicate check):

gh issue list --state open --json number,title,body --limit 100

The retrieved issue list is used for duplicate checking in Step 3 (after fragility detection).

Step 2: Fragility Detection

Based on context collected in Step 1, detect structural fragility in the following 5 categories.

Detection categories (exhaustive):

Category	Detection method
Missing tests for core modules	For modules positioned as core in product.md / structure.md, check for test files in `tests/` with Glob. Detect modules without tests (test coverage gap detection)
Architecture Decisions violations	Read the Architecture Decisions section of tech.md and detect code patterns contradicting the documented design decisions with Grep + Read
Missing error handling for critical external deps	Identify call sites for dependencies deemed critical in tech.md Key Dependencies with Grep, and verify presence/absence of try/catch etc. error handling
Single point of failure	Identify files that many modules depend on from structure.md dependency relations, and verify presence/absence of corresponding tests/documentation
Scattered configuration	Detect cases where environment variables/config values are scattered across multiple files without an SSoT (Single Source of Truth) with Grep, and cross-reference with tech.md descriptions

Severity scoring (AI judgment):

Use the same guidelines as drift:

high: high risk that critical features break, fragility with wide impact
medium: risk under specific conditions, partial impact
low: minor risk, seeds of future problems

Boundary with drift:

drift: "documentation says X, but code is Y" (factual inconsistency)
fragility: "given this project's structure, this is likely to break" (risk indication)

If the same location applies to both, prioritize drift and skip the fragility side.

Step 3: Duplicate Check

Semantically compare the detected fragility against existing open Issues retrieved in Step 1.

Reference titles and bodies; if the content is similar to an existing Issue (pointing out the same fragility), judge as duplicate and skip. Duplicate check is AI-judgment-based.

Display duplicates as "duplicate (existing Issue #N)" in the results report.

Step 4: Results Output

Display fragility detection results in table format:

| No | Category | Severity | Description | Affected Files | Duplicate |
|----|---------|----------|-------------|---------------|-----------|
| 1  | Missing tests for core modules | high | ... | modules/foo.md | - |
| 2  | Architecture Decisions violations | medium | ... | skills/bar/SKILL.md | - |
| 3  | Scattered configuration | low | ... | scripts/setup.sh | existing #456 |

In --dry-run mode: display the table and exit (do not generate Issues).

In normal mode:

If --limit N is specified, select N items in descending severity order. Exclude duplicates from the count.

Ask the user with AskUserQuestion (non-interactive mode: auto-resolve — automatically select "Generate all" for non-duplicate items up to --limit N; record the decision in an issue comment):

"Generate all": generate Issues for all non-duplicate fragility items
"Select": enter item numbers to generate separated by commas (e.g., 1,3)
"Cancel": exit without generating Issues

If "Cancel": display "Issue generation cancelled." and exit.

Step 5: Issue Generation

Generate Issues in /issue standard format for approved fragility items.

Each Issue body:

## Background

{Context where the fragility was found, quoting the relevant Steering/Project Document section}

## Purpose

{Risk reduced by the improvement}

## Acceptance Conditions

### Pre-merge (automated verification)

- [ ] <!-- verify: {verify command} --> {condition 1}
- [ ] {condition 2}

### Post-merge

- [ ] {verification items}

Label assignment:

After Issue generation, assign the following label:

audit/fragility: tracking label indicating the fragility was detected by the audit skill

Type/Size assignment:

Set Type and Size from AI estimation of fragility scope (update project fields via ${CLAUDE_PLUGIN_ROOT}/scripts/gh-graphql.sh).

After generation:

Display the list of generated Issue numbers and titles.

Then read ${CLAUDE_PLUGIN_ROOT}/modules/steering-hint.md and follow the "Processing Steps" section.

recoveries Subcommand

Read the cross-Issue orchestration recovery log and file Issues for patterns that exceed a frequency threshold. Mirrors the drift/fragility subcommand structure.

Option Parsing

Parse the following options from ARGUMENTS:

--dry-run: display candidates only without generating Issues
--limit N: limit Issue generation to N items (in descending frequency order)
--threshold K: minimum recurrence count to qualify as a candidate (default: 3)

Step 1: Context Collection

Read the recovery log:

Read docs/reports/orchestration-recoveries.md. If the file does not exist, display "Recovery log not found: docs/reports/orchestration-recoveries.md. Recovery events are written by /auto Step 4a." and exit.

Fetch existing open Issues (for duplicate check):

gh issue list --state open --json number,title,body --limit 100

Write the JSON to .tmp/open-issues-recoveries.json.

Step 2: Candidate Detection

Run the candidate detection script:

${CLAUDE_PLUGIN_ROOT}/scripts/collect-recovery-candidates.sh docs/reports/orchestration-recoveries.md --threshold ${K} --issues-json .tmp/open-issues-recoveries.json

Where ${K} is the value from --threshold (default: 3).

The script outputs <symptom-short>\t<count> lines for each qualifying candidate.

Step 3: Duplicate Check

For each candidate from Step 2, perform a semantic duplicate check against the open Issues retrieved in Step 1:

The collect-recovery-candidates.sh script already excludes exact substring matches
Additionally apply AI-based semantic match: if a candidate symptom-short is semantically equivalent to an existing Issue's title or body, mark as duplicate and skip
Display duplicates as "duplicate (existing Issue #N)" in the results table

Step 4: Results Output

Display recovery candidate results in table format:

| No | Symptom | Occurrences | Duplicate |
|----|---------|-------------|-----------|
| 1  | gh-pr-list-head-glob | 4 | - |
| 2  | verify-timeout-exceeded | 3 | existing #311 |

Clean up temp file: rm -f .tmp/open-issues-recoveries.json

In --dry-run mode: display the table and exit (do not generate Issues).

In normal mode:

If --limit N is specified, select N items in descending frequency order. Exclude duplicates from the count.

Ask the user with AskUserQuestion (non-interactive mode: auto-resolve — automatically select "Generate all" for non-duplicate items up to --limit N; record the decision in an Issue comment):

"Generate all": generate Issues for all non-duplicate candidates
"Select": enter item numbers to generate separated by commas (e.g., 1,3)
"Cancel": exit without generating Issues

If "Cancel": display "Issue generation cancelled." and exit.

Step 5: Issue Generation

Generate Issues for approved candidates.

Each Issue body:

## Background

Recurring orchestration recovery pattern detected by `/audit recoveries`:
- Symptom: {symptom-short}
- Occurrences: {count} (threshold: {K})
- Recent examples from `docs/reports/orchestration-recoveries.md`:
  {quote 1-3 representative Diagnosis + Recovery Applied sections}

## Purpose

{Describe what structural fix would prevent recurrence of this recovery pattern}

## Acceptance Conditions

### Pre-merge (automated verification)

- [ ] <!-- verify: {verify command} --> {condition 1}
- [ ] {condition 2}

### Post-merge

- [ ] {verification items}

Label assignment:

After Issue generation, assign audit/fragility (recovery patterns are structural fragility by nature).

Do not assign the triaged label.

Type/Size assignment:

Set Type and Size from AI estimation of recovery pattern scope (update project fields via ${CLAUDE_PLUGIN_ROOT}/scripts/gh-graphql.sh).

Update log entries after filing:

For each filed Issue, update the corresponding log entries in docs/reports/orchestration-recoveries.md:

Find all entries where symptom-short matches and Improvement Candidate is 未起票. Replace - 未起票 with - 起票済み #NNN (where NNN is the new Issue number) using the Edit tool.

Commit the log update:

git add docs/reports/orchestration-recoveries.md
git commit -s -m "chore: update recovery log after /audit recoveries filing

Co-Authored-By: Claude Sonnet 4.6 <[email protected]>"
git push origin HEAD 2>/dev/null || git push origin main

After generation:

Display the list of generated Issue numbers and titles.

Then read ${CLAUDE_PLUGIN_ROOT}/modules/steering-hint.md and follow the "Processing Steps" section.

progress Subcommand

Display a progress snapshot of sub-issues under a specified XL parent issue.

Argument Parsing

Extract the parent issue number from ARGUMENTS (e.g., progress 1000 → parent number 1000). If no issue number is provided, display "Usage: /audit progress " and exit.

Step 1: Fetch Sub-issue Data

Run:

"${CLAUDE_PLUGIN_ROOT}/scripts/get-sub-issue-progress.sh" <parent-number>

The script outputs JSON with the following structure:

{
  "parent": { "number": 1000, "title": "..." },
  "sub_issues": [
    {
      "number": 1001,
      "title": "...",
      "state": "OPEN",
      "createdAt": "2026-06-01T00:00:00Z",
      "closedAt": null,
      "updatedAt": "2026-06-10T12:00:00Z",
      "labels": [{ "name": "phase/code" }],
      "blockedBy": [{ "number": 1005, "state": "OPEN" }]
    }
  ]
}

If the script exits non-zero, display the error and exit.

Step 2: Classify Sub-issue Status

For each sub-issue, classify status using the following priority order (first matching rule wins):

Priority	Status	Classification rule
1	Done	`state == "CLOSED"`
2	Blocked	`state == "OPEN"` AND any entry in `blockedBy` has `state == "OPEN"`
3	Stale	`state == "OPEN"` AND labels contain `stale-verify`
4	In progress	`state == "OPEN"` AND labels contain any of: `phase/code`, `phase/review`, `phase/verify`, `phase/spec`
5	Pending	all other OPEN issues (including `phase/issue`, `phase/ready`, or no phase label)

Step 3: Compute Metrics

Status counts: Count sub-issues per status (Done / In progress / Blocked / Stale / Pending).

Phase distribution: For In progress and Blocked sub-issues, count by phase label (phase/issue, phase/spec, phase/ready, phase/code, phase/review, phase/verify). An issue with no phase label counts under "no phase".

Time estimates:

Median completion time: compute median of (closedAt - createdAt) in minutes for all Done sub-issues. If no Done sub-issues exist, display "N/A".
Remaining estimate: pending_count / max(in_progress_count, 1) × median. Display as a range (±20%): e.g., "12-18 hours wall-clock".

Recent 24h activity: Filter sub-issues where updatedAt is within the last 24 hours. Report the count.

Blocked relationships: For each Blocked sub-issue, list the OPEN blockedBy issue numbers.

Step 4: Display Output

Output the snapshot in the following format:

XL parent #<parent-number>: <parent-title>
Sub-issues: <total> total (created <earliest-createdAt date>)

Status breakdown:
  ✅ Done:           <N> (<pct>%)
  🔄 In progress:    <N> (<pct>%) — #<num>, #<num>, ...
  🟡 Blocked:        <N> (<pct>%) — #<num> (by #<blocker>), ...
  🟠 Stale:          <N> (<pct>%) — #<num>, ...
  ⬜ Pending:        <N> (<pct>%)

Phase distribution (in-progress + blocked):
  phase/issue:   <N>
  phase/spec:    <N>
  phase/ready:   <N>
  phase/code:    <N>
  phase/review:  <N>
  phase/verify:  <N>
  no phase:      <N>

Time estimates (based on completed sub-issues):
  Median time per sub-issue: <N> min
  Est. remaining: <low>-<high> hours wall-clock (<in_progress_count> concurrent, <pending_count> sub-issues remaining)

Recent activity (last 24h):
  - <N> sub-issues updated

Omit rows with count 0 from Phase distribution. If no sub-issues exist under the parent, display "No sub-issues found for #." and exit.

auto-session Subcommand

Generate the data layer of a /auto session retrospective report from .tmp/auto-events.jsonl, filtered by session_id.

This subcommand covers the post-session time scale of the /audit 3-axis model:

Command	Time scale	Use case
`/audit stats`	weeks/months	project health
`/audit progress <XL>`	hours	in-progress snapshot
`/audit auto-session <id>`	session post	post-session retrospective (this subcommand)

Session Boundary Identification

/auto generates a SESSION_ID at startup using the format PID-timestamp (e.g., 12345-1718336400). This identifier is recorded in:

.tmp/auto-session-current — pointer file read by run-auto-sub.sh to populate session_id in each emitted event
.tmp/auto-session-${SESSION_ID}.json — metadata file recording session start time

Each event in .tmp/auto-events.jsonl includes a session_id field set to this value. The auto-session subcommand filters events by session_id to isolate exactly one session's activity, even when multiple sessions' events are mixed in the same log file.

Data Source

.tmp/auto-events.jsonl is the primary data source (set via AUTO_EVENTS_LOG env var or default path). This file is populated by R1 (#630) and subsequent extensions. When R1-era event types (watchdog_kill, max_silent_window, token_usage, concurrent_commit_detected) are absent from the log, the corresponding Summary rows degrade gracefully to 0 or N/A.

Output Template Structure

The generated report (docs/reports/auto-session-{session-id}-{date}.md) contains the following sections:

Summary — aggregate metrics table (issues processed, route mix, throughput, recovery counts, watchdog kills, token usage, concurrent commits)
Per-Issue Durations — per-issue phase breakdown table (issue number, size/route, timestamps, PR link, notes)
Recovery Events — chronological list of Tier 1/2/3 recovery events (phase, tier, result, affected issue)
Verify Phase Residuals — issues that entered verify but did not complete it in this session
Concurrent Sessions Detected — events where another session committed to main during a phase
Improvement Candidates Surfaced — anomaly-derived improvement candidates (Tier 3 recoveries, unknown patterns)
Narrative Section (skeleton) — TBD placeholders for "What worked", "Limits and gaps", "Improvement candidates surfaced", "Conclusion" (manual fill, or --full for LLM-assisted draft)

Argument Parsing

Parse from ARGUMENTS (after the auto-session prefix):

<session-id> (positional): generate report for this session; required unless --since is given; may appear before or after --full
--full: enable full mode — after generating the data-layer report, generate LLM narrative drafts for all 4 sections (What worked / Limits and gaps / Improvement candidates surfaced / Conclusion) and insert them into the report with [LLM draft — human review required] markers
--output <path>: override output file path (default: docs/reports/auto-session-<id>-<date>.md)
--since <spec>: list mode — show distinct session_ids from the log, filtered to the specified time window (e.g., 24h, 2026-06-14); omit <session-id> in this mode
--no-ja: skip Japanese sibling generation (Step 4). Default behavior generates a Japanese-translated sibling file at {report-path-without-ext}-ja.md alongside the English report.

Step 1: Run Report Script

"${CLAUDE_PLUGIN_ROOT}/scripts/get-auto-session-report.sh" <session-id> [--output <path>] [--no-github]

Pass --no-github only in contexts where GitHub API calls are unavailable (e.g., hermetic testing). In normal operation, omit --no-github to include live GitHub label/PR state in the report.

If no session_id is given (list mode with --since):

"${CLAUDE_PLUGIN_ROOT}/scripts/get-auto-session-report.sh" --since <spec>

Step 2: Display Result

Output the path of the generated report file. If in list mode, display the session list returned by the script.

If --full is not specified, stop here.

Step 3: LLM Narrative Draft (`--full` mode only)

This step generates LLM drafts for the 4 narrative sections and inserts them into the report. Run only when --full is present in ARGUMENTS.

Read the generated report from the path output in Step 2
For each Issue number found in the Per-Issue Durations table of the report, fetch issue details: gh issue view <N> --json title,body,labels (provides richer context for narrative generation)
Read ${CLAUDE_PLUGIN_ROOT}/skills/audit/auto-session-narrative-prompts.md to load the prompt templates and few-shot examples
For each of the 4 narrative sections, generate a draft using the report data and issue details as context and the corresponding prompt template:
- What worked: Extract 3-5 elements that functioned as designed, grounded in the report data
- Limits and gaps: Extract 3-5 structural observations where the system fell short or revealed a gap
- Improvement candidates surfaced: For each Limits item, generate one improvement candidate with classification:
  - Run gh issue list --search "<keyword>" to check for existing open issues for each candidate
  - Classify as: "既存 #XXX に統合提案" / "Issue 起票候補" (include one-paragraph body skeleton) / "凍結推奨（trigger: XXX）"
- Conclusion: Write 2-3 paragraph summary grounded in the session data and the three sections above

Write all 4 drafts to .tmp/narrative-draft-<session-id>.md using the Write tool, structured as:

### What worked
{draft content}

### Limits and gaps
{draft content}

### Improvement candidates surfaced
{draft content}

### Conclusion
{draft content}

Run the report script with the draft to insert it:
```
"${CLAUDE_PLUGIN_ROOT}/scripts/get-auto-session-report.sh" <session-id> --narrative-draft .tmp/narrative-draft-<session-id>.md --output <report-path>
```
This replaces each "TBD — fill in after reviewing the session" placeholder with the draft content prefixed by > [LLM draft — human review required]

Delete the temp file:

rm -f .tmp/narrative-draft-<session-id>.md

Output: "Narrative draft complete. Report contains [LLM draft — human review required] markers in all 4 narrative sections. Review and edit before committing."

Note: No issues are filed automatically. The Improvement candidates section lists candidates for human review; filing is done manually via /issue or discarded. This preserves the human gate.

Note: The [LLM draft — human review required] marker is inserted as a blockquote prefix so the user can immediately identify LLM-generated content requiring review before the report is committed or shared.

Step 4: Generate Japanese Sibling

This step runs by default after Steps 1–3 complete (regardless of whether --full was specified). Skip entirely when --no-ja is present in ARGUMENTS.

Determine sibling path: replace the trailing .md of the report path with -ja.md (e.g., docs/reports/auto-session-<id>-<date>.md → docs/reports/auto-session-<id>-<date>-ja.md)
Read the final report (post Step 3 if --full was set, otherwise post Step 2)
Translate the entire content to Japanese with the following rules:
- Translate prose, headings, table column headers, and inline narrative
- Preserve as-is: code blocks, file paths, command names, function/script identifiers (e.g., spawn-recovery-subagent.sh), Issue/PR references (#666), session IDs, ISO 8601 timestamps, SHA hashes, log markers like [LLM draft — human review required] (but translate the marker text content if it appears as Japanese in body: prefer [LLM ドラフト — レビュー必須] for the blockquote marker)
- Convert AC/section labels to Japanese natural equivalents (e.g., "Summary" → "サマリ", "Per-Issue Durations" → "Issue 別所要時間", "Recovery Events" → "リカバリイベント", "Verify Phase Residuals" → "Verify Phase 残留", "Concurrent Sessions Detected" → "並行セッション検出", "Improvement Candidates Surfaced" → "改善候補 (自動検出)", "Narrative Section" → "Narrative セクション", "What worked" → "うまくいったこと", "Limits and gaps" → "限界と gap", "Improvement candidates surfaced" → "改善候補 (浮上分)", "Conclusion" → "結論")
- Keep the same Markdown structure (heading levels, table layout, bullet hierarchy)
Write the translated content to the sibling path using the Write tool
Output: "Japanese sibling generated at {sibling-path}."

Note: The Japanese sibling is generated unconditionally by default to support the project's user-facing Japanese convention (CLAUDE.md). Use --no-ja to opt out.

Integrated Execution (drift + fragility)

/audit (no arguments) sequentially executes both drift and fragility perspectives and displays detection results in an integrated table.

Option Parsing

Parse the following options from ARGUMENTS (same system as drift/fragility):

--dry-run: display the integrated report only without generating Issues
--limit N: limit total Issue generation to N items (in descending severity order)

Step 1: Drift Detection

Execute Steps 1–3 from the "drift subcommand" (context collection, drift detection, duplicate check). Don't proceed to Issue generation at this step — --dry-run/--limit are applied at final output; collect detection results only.

Step 2: Fragility Detection

Execute Steps 1–3 from the "fragility subcommand" (context collection, fragility detection, duplicate check). Reuse the same Steering/Project Documents context from drift if available (skip re-fetching).

If fragility detection results overlap with drift detections (pointing out the same location), prioritize drift and skip the fragility side.

Step 3: Integrated Results Output

Display drift and fragility detection results in an integrated table with a lens column:

| No | lens | Category | Severity | Description | Affected Files | Duplicate |
|----|------|---------|----------|-------------|---------------|-----------|
| 1  | drift | tech.md Coding Conventions | high | ... | skills/foo/SKILL.md | - |
| 2  | fragility | Missing tests for core modules | medium | ... | modules/bar.md | - |
| 3  | drift | workflow.md skill list | low | ... | docs/workflow.md | existing #789 |

In --dry-run mode: display the integrated table and exit (do not generate Issues).

In normal mode:

If --limit N is specified, select N items in descending severity order. Exclude duplicates from the count.

Ask the user with AskUserQuestion (non-interactive mode: auto-resolve — automatically select "Generate all" for non-duplicate items up to --limit N; record the decision in an issue comment):

"Generate all": generate Issues for all non-duplicate items
"Select": enter item numbers to generate separated by commas
"Cancel": exit without generating Issues

Step 4: Issue Generation

Generate Issues in /issue standard format for approved items. Apply the Issue body format, label assignment, and Type/Size assignment based on each item's lens:

For items with lens: drift:

Each Issue body:

## Background

{Context where the drift was found, quoting the relevant Steering/Project Document section}

## Purpose

{Problem resolved by the fix}

## Acceptance Conditions

### Pre-merge (automated verification)

- [ ] <!-- verify: {verify command} --> {condition 1}
- [ ] {condition 2}

### Post-merge

- [ ] {verification items}

Label assignment:

After Issue generation, assign the following label:

audit/drift: tracking label indicating the drift was detected by the audit skill

Type/Size assignment:

Set Type and Size from AI estimation of drift scope (update project fields via ${CLAUDE_PLUGIN_ROOT}/scripts/gh-graphql.sh).

For items with lens: fragility:

Each Issue body:

## Background

{Context where the fragility was found, quoting the relevant Steering/Project Document section}

## Purpose

{Risk reduced by the improvement}

## Acceptance Conditions

### Pre-merge (automated verification)

- [ ] <!-- verify: {verify command} --> {condition 1}
- [ ] {condition 2}

### Post-merge

- [ ] {verification items}

Label assignment:

After Issue generation, assign the following label:

audit/fragility: tracking label indicating the fragility was detected by the audit skill

Type/Size assignment:

Set Type and Size from AI estimation of fragility scope (update project fields via ${CLAUDE_PLUGIN_ROOT}/scripts/gh-graphql.sh).

Display the list of generated Issue numbers and titles grouped by lens.

Then read ${CLAUDE_PLUGIN_ROOT}/modules/steering-hint.md and follow the "Processing Steps" section.