From dev-kit
Use for "interrogate", "adversarial review", "multi-model review", "challenge this", "stress test this code", "find blind spots", or "tear this apart". Four LLM reviewers challenge changes from independent angles.
How this skill is triggered — by the user, by Claude, or both
Slash command
/dev-kit:interrogateThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Spawn four reviewers to adversarially review code changes. Each gets the same prompt and rubric. The adversarial signal comes from model diversity, not assigned personas. Models differ in blind spots, priors, and reasoning patterns. Agreement across models is high-confidence signal; lone-model findings are worth reading but lower confidence.
Spawn four reviewers to adversarially review code changes. Each gets the same prompt and rubric. The adversarial signal comes from model diversity, not assigned personas. Models differ in blind spots, priors, and reasoning patterns. Agreement across models is high-confidence signal; lone-model findings are worth reading but lower confidence.
Adapted for Claude Code. The Agent tool exposes three Claude tiers (opus, sonnet, haiku) from a single provider, not four cross-provider models. The reviewers below vary by tier and by the prompt, so true cross-provider diversity is reduced relative to upstream pstack. Weight cross-model agreement accordingly and lean harder on the lead judgment in Step 5. If you have other providers wired up as subagent types or MCP-backed reviewers, add them for real diversity.
The deliverable is a synthesized verdict. Do NOT auto-apply changes.
Identify what to review from context:
git diff main...HEAD (or the appropriate base branch) for the full changesetPackage the diff (or file contents) plus any surrounding context files the reviewers need to understand the code.
Before spawning reviewers, state the intent explicitly. What is this code trying to accomplish? Derive this from:
Write one clear paragraph. Reviewers challenge whether the work achieves the intent well, not whether the intent itself is correct. If you're unsure about the intent, ask the user before proceeding.
Launch all four in a single message using the Agent tool, each with a different model.
| Subagent | Model |
|---|---|
| Reviewer A | opus |
| Reviewer B | sonnet |
| Reviewer C | haiku |
| Reviewer D | sonnet |
For each reviewer:
subagent_type: general-purposemodel: the model from the tableIf a model slug in the table is rejected as unresolvable when you try to spawn the subagent, check the valid slugs in the Agent tool's error message, pick the closest equivalent (prefer the highest-reasoning tier of the same family), spawn with the valid slug, and open a separate PR to update this table. Do not block the review on the slug issue.
Read references/reviewer-prompt.md and fill in the template with:
references/rubric.mdreferences/code-quality-review.mdThe same filled template goes to all four reviewers, so every model applies the code-quality lens.
Each reviewer produces structured findings as described in the prompt template.
As results come back, build a unified picture:
You are the lead reviewer, a pragmatic senior engineer, not a neutral aggregator.
Read references/lead-judgment.md for the full framework. Reviewers only see a slice of the codebase. You have the full context (the goal, the constraints, the timeline, which tradeoffs were already considered). Use that context aggressively.
Categorize every finding into one of four buckets:
For each finding, include:
Present the verdict in this structure:
[The stated intent paragraph from Step 2]
[Findings that should be addressed. For each: description, which models raised it, why it matters.]
[Findings worth thinking about. For each: description, which models raised it, tradeoff involved.]
[Valid but low-priority. Brief list.]
[Rejected findings with brief rationale. This shows the user what was filtered out and why, so they can override your judgment if they disagree.]
[Where did models agree, where did they diverge, and what does the pattern of agreement/disagreement tell us?]
npx claudepluginhub ennioferreirab/poteto-mode --plugin dev-kitProvides behavioral guidelines to reduce common LLM coding mistakes, focusing on simplicity, surgical changes, assumption surfacing, and verifiable success criteria.
Searches, retrieves, and installs Agent Skills from prompts.chat registry using MCP tools like search_skills and get_skill. Activates for finding skills, browsing catalogs, or extending Claude.