From agent-tooling
Full PR quality-gate: scope the diff, run the project's tests, convene a type-matched panel of expert review agents with a cross-review round, conditional security review and targeted local verification, then a final go / merge-with-fixes / needs-changes verdict plus a call on whether a human staging test is warranted. Use whenever the user asks to "check this PR", "quality-check before merge", "run the PR gate", "is this ready to merge", "vet this branch", "/pr-check", or wants a consolidated go/no-go on a branch or PR — even if they don't name a specific check. Prefer this over running a code review or the test suite alone when the ask is "is this good to merge?".
How this skill is triggered — by the user, by Claude, or both
Slash command
/agent-tooling:pr-checkThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
This is the Claude Code adapter over the agent-neutral **PR quality-gate playbook**. The method lives in the playbook; this file supplies Claude-specific wiring (config discovery, the `scope.py` call, the Workflow-based panel, and which existing skills to reuse).
This is the Claude Code adapter over the agent-neutral PR quality-gate playbook. The method lives in the playbook; this file supplies Claude-specific wiring (config discovery, the scope.py call, the Workflow-based panel, and which existing skills to reuse).
Read the playbook: ../../playbooks/pr-check.md. It defines steps 0–6 and the verdict rules. Follow it; the notes below are only the Claude-specific how.
Read .claude/pr-check.json in the consuming repo (schema: pr-check.example.json). It supplies base_ref, scope globs, sensitive patterns, test_command, reviewer map, a11y_spec, e2e_skill, staging_skill, report_dir. If it's missing, stop and offer to create one from the example — do not hardcode another repo's values.
Run the bundled script once instead of ad-hoc pipelines (the plugin exposes its root as ${CLAUDE_PLUGIN_ROOT}):
python3 "${CLAUDE_PLUGIN_ROOT}/scripts/scope.py" --config .claude/pr-check.json [--pr N | --base <ref>]
(Allowlist Bash(python3 *agent-tooling/scripts/scope.py*) to make it prompt-free.)
It returns {files, flags}. Branch on the flags for the rest.
/code-review high (never /code-review ultra — billed/user-only; suggest it in the verdict for high-risk changes).test_command. Red suite → caps at needs-changes.references/panel-workflow.js: set ROLES from the config's reviewer map for the flags that fired (a generalist always), point all agents at a pinned ref or worktree (not the live tree, which may have concurrent edits), and pass the changed files.SENSITIVE. Run /security-review.RUNTIME or any reproducible finding. Use the config's e2e_skill for stack lifecycle, but feed it the finding-driven plan from step 5, not generic happy-path flows. Reproduction can be cheap (e.g. node --check on an extracted inline script) — don't spin up the full stack when a parse/unit check is decisive.staging_skill per the playbook's criteria.Emit the playbook's verdict format in chat. Be decisive; weight reproduced behavior over inferred findings.
Then persist it as a handoff so it survives the session and another agent can pick it up:
<report_dir>/pr-<N>.md (or <report_dir>/<branch>.md when run on a branch), where report_dir comes from the config (default .claude/pr-check).mkdir -p). It lives under .claude/, which is gitignored — confirm the consuming repo ignores it (don't commit handoffs).If a panel workflow ran, log its cost tagged by PR type so "what does pr-check cost on this kind of PR" accrues empirically. Use the real subagent_tokens the Workflow result reported (not a guess), the scope flags from step 0, and the diff size:
python3 "$SKILL_DIR/../../scripts/record_run.py" --skill pr-check --pr <N> \
--flags <comma-separated flags that fired> --diff-lines <changed lines> \
--subagent-tokens <subagent_tokens from the workflow result> --duration-ms <duration_ms>
Skip --subagent-tokens (defaults 0) for a docs-only run with no panel. See accrued cost-by-PR-type with scripts/cost_report.py [--rate <$/Mtok>], or a pre-run estimate with cost_report.py --predict --flags … --diff-lines ….
Provides behavioral guidelines to reduce common LLM coding mistakes, focusing on simplicity, surgical changes, assumption surfacing, and verifiable success criteria.
Searches, retrieves, and installs Agent Skills from prompts.chat registry using MCP tools like search_skills and get_skill. Activates for finding skills, browsing catalogs, or extending Claude.
npx claudepluginhub lgelauff/wikimedia-coding-agent-lessons --plugin agent-tooling