From ultraship
Performs deep code reviews across correctness, security, performance, and maintainability dimensions. Use before merging PRs or completing tasks. Leverages connected LSP for precise analysis.
How this skill is triggered — by the user, by Claude, or both
Slash command
/ultraship:code-reviewThis skill is limited to the following tools:
These tools are removed from Claude's available pool while this skill is active:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Review code the way a principal engineer would — not just "does it work?" but "will this cause problems at 3am?"
Review code the way a principal engineer would — not just "does it work?" but "will this cause problems at 3am?"
Every review should evaluate these dimensions, in order of importance:
Use an LSP if one is connected. If LSP tools are available (check your tools for a language server — TypeScript, Pyright, gopls, rust-analyzer), use them instead of grep-guessing.
find referenceson a changed function tells you the real blast radius;go to definitionconfirms a call signature actually matches; rename/diagnostics surface type errors the eye misses. A review that verifies call sites with an LSP catches breakage that a text-only review ships. If no LSP is connected, fall back to Grep/Glob and say so.
The code must do what it claims to do.
Think like an attacker for every piece of new code.
:id params)Will this work at 10x the current load?
findMany() without take/limit. Will return 1M rows when the table grows.readFileSync, execSync in request handlers. Blocks the event loop.awaits that should be Promise.all()..push(), event listeners added in request handlers.Will the next person (including future-you) understand this in 6 months?
Does this fit the existing patterns, or does it introduce divergence?
Every finding should include a confidence level:
| Confidence | Meaning | Action |
|---|---|---|
| High | This is almost certainly a real issue | Fix before merging |
| Medium | This looks like an issue but context might make it fine | Investigate, fix if confirmed |
| Low | This is a style preference or minor concern | Note for later, don't block merge |
Don't cry wolf. A review that flags 30 "high" issues when only 3 are real trains the developer to ignore reviews. Be precise.
When invoked by /ship, output findings with severity levels (critical/high/medium/low/info) in the same format as other auditors:
{
"category": "code-quality",
"findings": [
{ "severity": "high", "category": "code-quality", "file": "path", "line": N, "message": "description" }
]
}
npx claudepluginhub houseofmvps/ultraship --plugin ultrashipReviews code for security vulnerabilities, correctness bugs, performance issues, and production reliability concerns. Outputs prioritized findings with severity levels.
Conducts thorough code reviews covering security, correctness, performance, maintainability, and testing. Includes automated checks and common vulnerability patterns.
Performs structured code reviews assessing security vulnerabilities, performance issues, maintainability, and best practices with prioritized critical issues and suggestions. For PRs and code feedback.