Skill

ywc-security-audit

Audits code against OWASP Top 10 and project-specific threats, focusing on authentication/authorization, external-facing endpoints, and sensitive data handling.

security

Popularity

Stars

Forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/ywc-agent-toolkit:ywc-security-audit

User invocable

Model invocable

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

**Announce at start:** "I'm using the ywc-security-audit skill to inspect the code against OWASP Top 10 and project-specific threats."

Supporting Files

README.en.mdREADME.es.mdREADME.ja.mdREADME.ko.mdREADME.mdREADME.zh.mdreferences/prompt-injection-checklist.md

SKILL.md

144 lines · ~2.9k tokens

Stats

LanguageShell

Stars5

Forks1

MaintenanceExcellent

Last CommitJun 18, 2026

Actions

View Source View Plugin View on GitHub View README

ywc-security-audit

Announce at start: "I'm using the ywc-security-audit skill to inspect the code against OWASP Top 10 and project-specific threats."

Security Agent Skill for deep security analysis.

Rationalization Defense

When tempted to skip a step, check this table first:

Excuse	Reality
"Code looks clean, OWASP scan is overkill"	Clean code can still leak. Walk OWASP Top 10 in order — every item, every time.
"This is internal-only, threat surface is low"	Internal-only ≠ trusted. Insider threat and lateral movement are real. Audit anyway.
"Auth library is well-known, trust it blindly"	Misuse of a good library is the #1 cause of auth bugs. Audit how it is configured.
"Severity feels High, mark it Critical to be safe"	Inflated severity wastes triage time. Use Critical only when exploit + impact are both demonstrable.
"User input is validated upstream, no need at this layer"	Defense in depth. Validate at every trust boundary, not just at the gateway.
"Token/secret/key is just for dev, exposure is fine"	Never. Dev secrets get committed, leak, and become prod credentials. Always flag.
"I cannot exploit it locally, finding is theoretical"	Theoretical findings still belong in the report. Mark as `unverified — theoretical` rather than dropping.
"OWASP scan is too fine-grained to parallelize"	Grouping into 3 domain clusters lets each Sonnet subagent focus deeply on 3-4 items; it also prevents cross-category contamination that degrades severity classification in a monolithic pass.

Violating the letter of these rules is violating the spirit. A clean security report without honest dimensional coverage is dangerous.

Arguments

Parameter	Format	Example	Description
`--code`	`--code <path>`	`--code api/src/middleware/`	Code path to audit (required)
`--format`	`--format markdown\|html`	`--format html`	Output format. Default `markdown`. With `html`, writes a self-contained HTML report to `claudedocs/`. See html-output.md

Execution Steps

Collect Project Context — Read CLAUDE.md, package.json to identify tech stack. Pay special attention to authentication method, deployment environment (internal/external), and security libraries in use
Read Target Code Files — Read all source files under the --code path

Phase 1 — Parallel OWASP Analysis — Use the Task tool to spawn 3 Sonnet subagents in parallel. Each covers a grouped slice of OWASP Top 10. For each item in their slice, subagents must: Grep/AST search for patterns, trace data flow (input → processing → output), and apply project context. When the Claude Code runtime is in use and the named-agent catalog at claude-code/agents/ is installed, prefer subagent_type: ywc-security-engineer so each subagent carries the dedicated security worker persona, Mission, Boundaries, and Return Contract.

Subagent	Model	OWASP Items
Auth & Data	sonnet	A01 Injection · A02 Broken Auth · A03 Sensitive Data Exposure
Web Layer	sonnet	A04 XSS · A05 Broken Access Control · A06 Security Misconfiguration · PI Prompt Injection (LLM-driven surfaces only — user-controlled string → prompt sink; see `references/prompt-injection-checklist.md`)
Infra & Input	sonnet	A07 SSRF · A08 Input Validation · A09 Rate Limiting · A10 Timing Attacks

Prompt-Injection slice (Web Layer sub-category) — when the audit target includes an LLM-driven surface (agent / chatbot / prompt-template system / function-calling pipeline), the Web Layer subagent additionally walks the four items in references/prompt-injection-checklist.md: user-controlled string flowing directly into a prompt, system/user role separation, canary-token + ML-classifier defense, and external-tool / RAG-result sanitization. The checklist defines default severity and conditions for adjustment. Findings surface under the standard severity rubric below and are reported alongside the OWASP A04-A06 items in the Web Layer subagent's output.

Each subagent classifies its findings:

Critical: Immediately exploitable (SQL injection, auth bypass, hardcoded secrets)
High: Conditionally exploitable (SSRF with internal network access, improper auth checks)
Medium: Potential risk (verbose errors, insufficient rate limiting)
Low: Best practice violation (timing attack potential, unnecessary information disclosure)

Each subagent returns:

Confirmed findings — severity, file:line, issue, risk, recommended fix
Advisor candidates — findings meeting the Advisor Escalation Policy conditions below (suspect code chain + hypothesized exploit, ≤100 lines each)

Aggregate Phase 1 Results — Combine findings from all 3 subagents. Deduplicate by {file}:{line}. Cap advisor candidates at advisor_budget (default: 3), prioritizing Critical > High. Log any dropped candidates in the report.
Phase 2 — Advisor Pass — For each surviving advisor candidate, follow the Advisor Escalation Policy section below. Spawn a short Opus subagent via the Task tool with only the bounded excerpt (≤100 lines). Merge verdicts into the findings list.
Output Severity-Classified Security Report

Audit Checklist (OWASP Top 10)

Injection (SQL, Command, LDAP)
Broken Authentication (Token, Session management)
Sensitive Data Exposure (Logging, API Response, Storage)
XSS (Reflected, Stored, DOM-based)
Broken Access Control (Missing Auth Middleware, Privilege Escalation)
Security Misconfiguration (Default Config, Verbose Errors)
SSRF (Unvalidated URLs, Internal Service Access)
Input Validation (Missing Validation, Type Coercion)
Rate Limiting (Missing Rate Limits on Sensitive Endpoints)
Timing Attacks (Non-constant-time Comparisons for Secrets)

LLM-Driven Surface Addendum (run when target uses an LLM SDK)

Prompt Injection — user input → prompt sink, role separation, canary / classifier defenses, external-tool / RAG result sanitization (full audit items + severity table in references/prompt-injection-checklist.md)

Output Format

## Security Audit Result: {target path}

### Summary
- Critical: N, High: M, Medium: K, Low: L

### Findings
1. [{severity}] {file}:{line}
   - Issue: ...
   - Risk: ...
   - Recommended Fix: ...

### Overall Assessment
(Comprehensive security posture summary)

HTML mode (--format html) — emits the same findings as a self-contained HTML report: severity color coding, tab navigation, and a Copy as Markdown button. Structure and conventions follow html-output.md. The Markdown surface is preserved inside the file, so downstream integration is unaffected.

Advisor Escalation Policy

This skill runs the full OWASP Top 10 deep analysis on a single inherited-model executor. Because security findings are the highest-stakes output category in this repository, the executor applies a permissive escalation bar: when a suspected Critical or High finding has indirect evidence, escalate rather than risk mislabeling. This follows Pattern A from advisor-pattern.md — frontier judgment applied at the specific decision points where it carries real value.

Budget: up to 3 Opus advisor calls per invocation. Security gets a slightly larger budget than spec-review because the downside cost of a missed vulnerability is much higher than the downside cost of a missed spec gap. Unused budget is still good; the bar for escalation must still be met.

Escalation conditions — a finding is an advisor candidate when it matches any of the following:

Indirect exploit chain — A parameter flow could enable SSRF, auth bypass, or injection, but the exploit requires two or more hops through functions the executor did not fully trace. The key question for Opus is whether the chain is actually reachable.
Two OWASP categories compete — The same evidence fits two categories equally (for example A01 Broken Access Control and A07 Auth Failures). The correct category affects severity and remediation, so the choice is irreversible once reported.
Business logic flaw (A04) — A04 is the hardest category to judge mechanically because it depends on domain knowledge, not pattern matching. When a suspected business logic flaw has any ambiguity, escalate.
Crypto decision (A02) — The code makes a hashing or encryption choice and the executor cannot tell whether it is appropriate without knowing the threat model and data sensitivity. Frontier judgment with the spec excerpt is the right call.
Critical severity with indirect evidence — Any Critical-suspected finding where the unsafe path requires interpretation rather than direct observation. Direct means the unsafe call sits on a single visible line; indirect means the tainted value flows through multiple transformations before reaching the dangerous sink.

Context payload rules (critical for cost discipline):

Forward only the decision point: the suspect code chain, the spec or threat-model excerpt, and 2-3 bullet points sketching the hypothesized exploit (≤100 lines total).
Do NOT forward: the full audit target, the full project config, secrets, or the entire CLAUDE.md.
The advisor returns a short verdict (≤200 words) containing confirmed severity, a one-line rationale, and either "confirmed" or "adjusted" with the new severity.
Never include the full file, the full project config, or secrets. If the exploit chain needs information outside the snippet, summarize that information in 1-2 sentences rather than pasting it.

Non-goals — do NOT escalate for these:

Trivial OWASP pattern matches — hardcoded secrets, raw string concatenation in SQL queries, unfiltered user strings assigned to innerHTML-style sinks. These are Critical and unambiguous; report them as Phase 1 confirmed findings.
Missing standard hardening — no rate limit on an endpoint, missing security headers, absent CORS tightening. These are A05 misconfigurations with clear remediation.
Low-severity best practice notes — timing attack potential, overly verbose error messages. Advisor adds no value at this severity.
Well-understood vulnerable dependencies (A06) — a pinned CVE version is a mechanical finding.

Report escalations in the output: mark Phase 2 findings with [P2] prefix and include the advisor's verdict. This preserves auditability of which security calls involved frontier judgment and lets the user calibrate their trust in the severity assignments.

Integration

upstream: After implementation or periodic review
downstream: Fix implementation → PR

ywc-security-audit

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

ywc-security-audit

Popularity

Invocation

Context Preview

Supporting Files

SKILL.md

ywc-security-audit

Rationalization Defense

Arguments

Execution Steps

Audit Checklist (OWASP Top 10)

LLM-Driven Surface Addendum (run when target uses an LLM SDK)

Output Format

Advisor Escalation Policy

Integration

Similar Skills

ywc-security-audit

Rationalization Defense

Arguments

Execution Steps

Audit Checklist (OWASP Top 10)

LLM-Driven Surface Addendum (run when target uses an LLM SDK)

Output Format

Advisor Escalation Policy

Integration

Similar Skills