Skill

vibe-app-audit

Audit a vibe-coded app (Lovable, v0, Bolt, Cursor, Claude Code output) against a security checklist and produce a markdown report with severity-tagged findings. Use this skill whenever a user wants to security-review, audit, or "find the security issues in" an AI-generated app, a no-code/low-code project, an exported Lovable/Bolt/v0 codebase, or any web app where they suspect AI tooling skipped security basics. Trigger even if the user doesn't say "audit" — phrases like "is my app safe," "did Cursor leave any secrets in here," "check this for security issues," "review my Supabase setup," or "what's wrong with this codebase security-wise" all qualify. Covers exposed secrets, database access controls (Supabase RLS and plain Postgres), input validation, authorization, security headers, static analysis, dependencies, and runtime monitoring gaps. Produces a markdown report grouped by severity, not a list of opinions.

Popularity

Parent stars

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/vibe-app-audit:vibe-app-audit

User invocable

Model invocable

Inline context

Default effort

Tool Access

This skill is limited to the following tools:

ReadGrepGlobBash(grep:*)Bash(find:*)Bash(git ls-files:*)Bash(git log:*)Bash(ls:*)Bash(cat:*)Bash(head:*)Bash(curl -sI:*)

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

This skill audits a local repository (run from Claude Code/Codex) against eight security categories that AI coding tools commonly fumble, and produces a markdown report with severity-tagged findings.

Supporting Files

references/postgres.mdreferences/report-template.mdreferences/supabase.md

SKILL.md

289 lines · ~4.6k tokens

Stats

LanguagePython

Parent stars1

MaintenanceGood

Last CommitMay 8, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Vibe-Coded App Security Audit

This skill audits a local repository (run from Claude Code/Codex) against eight security categories that AI coding tools commonly fumble, and produces a markdown report with severity-tagged findings.

The skill is built around a specific failure mode: AI coding tools ship code that looks secure but isn't. Validation is in the browser only. RLS is "enabled" but has no policies. The service role key is in NEXT_PUBLIC_*. Authorization checks ask "is the user logged in" instead of "does this user own this row." This skill catches those.

When to use this skill

Use this when the user wants to security-review a repo. They may say "audit my app," "find the security issues," "is this safe to ship," "review my Lovable export," or just paste a path and ask "what's wrong with this." If they're pointing at a real codebase and want a security perspective, this is the skill.

Don't use this for: generic security questions (just answer them), reviewing a single function or PR (do it inline), or non-web apps. This skill assumes a web app with a backend, a database, and dependencies.

Inputs the skill expects

A path to a local repo (the working directory by default).
Optionally, a deployed URL — needed for the headers check (item 5).
Optionally, the database backend (Supabase, Postgres + app code, or unknown — the skill detects).

If the deployed URL isn't provided, do the static checks and explicitly mark the headers check as "needs deployed URL."

Workflow

Work through the eight checks in order. Each check is independent — finish one, write findings to a running list, move on. Don't go deep on one check at the expense of the others; breadth-first is the goal. At the end, render the findings into the report template.

For each finding, capture: severity (Critical / High / Medium / Low / Info), the check it came from, file path + line number where applicable, a one-sentence description of the issue, and a one-sentence fix. The report template (references/report-template.md) shows the exact shape — read it before writing the report.

Detect the stack first

Before running checks, take ~30 seconds to understand the stack. This shapes which references to read.

# What kind of project? (some manifests are expected to be absent)
ls package.json pyproject.toml requirements.txt Gemfile go.mod 2>/dev/null

# What's in package.json (if present)?
[ -f package.json ] && head -50 package.json

# Supabase?
grep -rl "supabase" --include="*.{ts,tsx,js,jsx,py}" . | head -5

# Postgres directly?
grep -rlE "pg|psycopg|sequelize|prisma|knex|drizzle" --include="*.{ts,js,py}" . | head -5

# Framework? (manifest absence is expected; suppress only those errors)
grep -hE "next|vite|express|fastapi|django|flask|rails" \
  package.json pyproject.toml Gemfile 2>/dev/null

Note what you found. The reference files live alongside this SKILL.md; resolve their absolute path so reads work in both Claude Code and Codex:

SKILL_DIR="${CLAUDE_SKILL_DIR:-${SKILL_DIR}}"
# References to load on demand:
#   $SKILL_DIR/references/supabase.md
#   $SKILL_DIR/references/postgres.md
#   $SKILL_DIR/references/report-template.md

If Supabase is in the stack, you'll need references/supabase.md for items 2 and 4. If it's plain Postgres + an app framework, you'll need references/postgres.md. If both are present (rare but possible), read both. If neither — record the items that depend on a database backend as "skipped: no recognized database backend detected" and move on.

Check 1 — Exposed secrets

Look for secrets that are baked into client-side code or otherwise committed.

# Public env vars containing secrets — the dangerous pattern
grep -rE "(NEXT_PUBLIC_|VITE_|REACT_APP_|EXPO_PUBLIC_|PUBLIC_)[A-Z_]*(SECRET|KEY|TOKEN|PASSWORD|SERVICE_ROLE)" \
  --include="*.{ts,tsx,js,jsx}" --include=".env*" .

# Hardcoded common secret formats. Note: `eyJhbGciOi` matches any base64-encoded
# `{"alg":` JSON, including legitimate sample JWTs in fixtures and docs —
# investigate matches before flagging, don't flag fixture tokens.
grep -rE "(sk_live_|sk_test_|AKIA[0-9A-Z]{16}|ghp_[A-Za-z0-9]{36}|xox[baprs]-|eyJhbGciOi)" \
  --include="*.{ts,tsx,js,jsx,py}" --include=".env*" .

# Supabase service role key — the headline incident
grep -rE "service_role|SERVICE_ROLE_KEY" --include="*.{ts,tsx,js,jsx,py}" .

# Is .env committed? (only-noise stderr is expected outside a git repo)
git ls-files 2>/dev/null | grep -E "^\.env($|\.)" || echo ".env not tracked"

# .env in git history (even if removed). Suppress: missing files are expected.
git log --all --full-history --source -- .env .env.local .env.production 2>/dev/null | head -20

Severity guide:

Service role key in any client-bundled file (NEXT_PUBLIC_*, anything imported into a client component, anything in pages//app/ that isn't 'use server' / API route): Critical.
Any production API key (Stripe live, AWS, etc.) committed to repo: Critical.
Test keys committed: Medium (still bad practice, lower blast radius).
.env in git history but currently gitignored: High — secrets need rotation, gitignore alone doesn't help.

A NEXT_PUBLIC_SUPABASE_ANON_KEY is expected and not a finding by itself — the anon key is meant to be public. The finding is when it's the service role key that's been exposed, or when RLS isn't enabled to make the anon key safe to expose (that's check 2).

Check 2 — Database access controls

This check is backend-specific. Read the relevant reference file before proceeding:

Supabase detected → read references/supabase.md and follow the RLS audit procedure there.
Plain Postgres + app framework → read references/postgres.md and follow the access control audit there.
Both → read both.
Neither detected → record "skipped: no recognized database backend" and move on.

Don't try to do this check from memory — the per-backend procedures are specific and the reference files exist for a reason.

Check 3 — Server-side input validation

The failure mode: forms validate in the browser, but the API endpoint accepts whatever the client sends.

# Find API endpoints / route handlers
find . -path ./node_modules -prune -o \
  \( -path "*/api/*" -o -path "*/routes/*" -o -name "route.ts" -o -name "route.js" \) \
  -print | head -30

# Validation libraries present? (manifest absence is expected.)
grep -hE '"(zod|yup|joi|valibot|class-validator|pydantic|marshmallow)"' \
  package.json pyproject.toml requirements.txt 2>/dev/null

For each handler found, open it and look for: does it parse/validate the request body before using it, or does it pass req.body / request.json() straight into a database call or business logic?

Patterns to flag:

req.body.x used directly in a query or response without a .parse() / .validate() first.
Type assertions (as MyType) standing in for validation — TypeScript types vanish at runtime.
Validation only in the React form component, not in the route handler.
File upload endpoints with no size, type, or content checks.

Severity: Missing validation on an endpoint that writes to the database or returns user data: High. Missing validation on a read-only endpoint with no user-controlled query: Medium. Type-assertion-only "validation": High (false sense of security is worse than none).

Check 4 — Authorization checks

Like check 2, this depends on the backend.

Supabase with RLS doing the work → covered in references/supabase.md.
App-layer authorization (Express middleware, Django permissions, etc.) → covered in references/postgres.md.

Read the relevant reference and follow it. The headline failure mode is the same across backends: the code checks "is the user authenticated" but not "does this user own the row they're asking about." IDOR (insecure direct object reference) is the canonical AI-coded-app bug.

Severity guide (apply regardless of backend; the references show the patterns to look for):

Pattern	Severity
Write endpoint (POST/PUT/PATCH/DELETE) accepts a resource ID and skips ownership check	Critical
Read endpoint returns user-owned resource by ID with no ownership filter (IDOR)	Critical if data is sensitive (orders, messages, files, PII); High otherwise
`userId` / `user_id` taken from request body or query string instead of the authenticated session	Critical
Admin/role-gated endpoint with only `requireAuth` (no role check)	Critical
RLS policy `USING (true)` or missing `WITH CHECK` on writes	Critical
Service-role / admin DB client used in a code path without an explicit ownership check	High–Critical by data sensitivity
Authorization enforced inconsistently across endpoints (some have it, some don't)	High
Authorization correct but only enforced in middleware that can be bypassed by a sibling route	High
Read endpoint exposes non-sensitive shared data with no ownership check (e.g., public catalog)	Low / Info

Check 5 — Security headers

If a deployed URL was provided:

curl -sI <DEPLOYED_URL> | grep -iE "content-security-policy|x-frame-options|x-content-type-options|strict-transport-security|referrer-policy|permissions-policy"

Score what's missing. Severity guide:

Missing Strict-Transport-Security on an HTTPS production site: Medium.
Missing Content-Security-Policy: Medium (XSS mitigation, but not a vulnerability by itself).
Missing X-Frame-Options (or CSP frame-ancestors) on a site that handles auth: High (clickjacking risk).
Missing X-Content-Type-Options: nosniff: Low.
Missing Referrer-Policy: Low.

If no deployed URL, check the framework config for header configuration. Manifest absence is expected — 2>/dev/null is scoped to that:

# Next.js
grep -A20 "headers" next.config.js next.config.mjs next.config.ts 2>/dev/null
# Vercel / Netlify
cat vercel.json netlify.toml 2>/dev/null
# Express helmet
grep -r "helmet" --include="*.{ts,js}" . | head -5

If you can't probe a live URL and there's no header config in the repo, record this as "Headers: cannot verify without deployed URL — no header configuration found in repo, which suggests defaults are in use." Severity: Info (a verification gap, not a finding).

Check 6 — Static analysis

Run a SAST scan if a scanner is available. Prefer opengrep (LGPL-2.1, the OSS fork of semgrep maintained by ex-Semgrep contributors); fall back to semgrep CE if opengrep isn't installed. Both accept the same ruleset shorthands. Lead with the security-focused rulesets — --config=auto includes a lot of code-quality rules that aren't security findings and dilute the report:

# Prefer opengrep (LGPL-2.1, OSS); fall back to semgrep CE if unavailable.
if command -v opengrep >/dev/null; then
  SCANNER=opengrep
elif command -v semgrep >/dev/null; then
  SCANNER=semgrep
else
  SCANNER=
fi

if [ -n "$SCANNER" ]; then
  # Security-focused rulesets first; --config=auto only as fallback.
  "$SCANNER" --config=p/security-audit --config=p/owasp-top-ten --json --quiet .
  # Fallback if those rulesets fail to fetch (offline, etc.):
  # "$SCANNER" --config=auto --json --quiet . | head -200
fi

If neither opengrep nor semgrep is installed, note that and skip — don't try to recreate a SAST scanner with grep. Record as "Static analysis: skipped (install opengrep — https://github.com/opengrep/opengrep — or semgrep, then re-run)."

If results are returned, group findings by rule and severity. Don't dump raw scanner JSON into the report — pick the highest-severity 5–10 findings and summarize them. The report-template has a slot for "scanner findings" that handles this.

Check 7 — Dependency audit

Detect the lockfile and pick one tool — running npm audit against a non-npm lockfile produces noise:

# Node — the lockfile picks the tool
if   [ -f bun.lock ];          then bun audit --json
elif [ -f pnpm-lock.yaml ];    then pnpm audit --json
elif [ -f yarn.lock ];         then yarn npm audit --json
elif [ -f package-lock.json ]; then npm audit --json
fi

# Python
[ -f requirements.txt ] && pip-audit -r requirements.txt --format json
[ -f pyproject.toml ] && [ ! -f requirements.txt ] && pip-audit --format json

# Ruby
[ -f Gemfile.lock ] && bundle-audit check --update

# Go
[ -f go.mod ] && govulncheck ./...

Report counts by severity from the audit tool's own classification. Highlight any Critical or High with a known exploit (audit tools usually note this). Don't list every Low — just count them.

If no audit tool is available for the detected stack, record as "skipped: no audit tool available for " and recommend the relevant one.

Check 8 — Monitoring and logging

This one is mostly observational — most vibe-coded apps have nothing here, and "nothing here" is itself the finding.

# Logging libraries (manifest absence is expected; suppress only that)
grep -hE '"(winston|pino|bunyan|sentry|datadog|loglevel)"' package.json 2>/dev/null
grep -hE "(sentry-sdk|loguru|structlog)" requirements.txt pyproject.toml 2>/dev/null

# Error tracking
grep -rE "Sentry\.(init|captureException)" --include="*.{ts,js,py}" . | head -5

# Auth event logging
grep -rE "log.*(login|signin|signup|auth|failed)" --include="*.{ts,js,py}" . | head -10

Severity guide:

No error tracking and the app handles user data: Medium — you won't find out about breaches until users complain.
No logging of authentication events (logins, failures, password changes): Medium.
Logging exists but logs raw request bodies / tokens / passwords: High — logs become a secondary leak vector.
Console-only logging in production: Low (but worth flagging).

This check is necessarily heuristic — absence of monitoring code doesn't mean absence of monitoring (could be at the platform layer, e.g., Vercel logs). Note this caveat in the report.

Producing the report

Read references/report-template.md for the exact format, then render the findings into it. Key principles:

Group by severity, not by check. A reader wants to know what to fix first, not what category each finding belongs to.
Each finding has a fix. A finding without a fix is venting.
No editorializing about AI tools. State what's wrong and what to do about it. The user knows their code came from a vibe-coding tool — they don't need that pointed out repeatedly.
Be honest about gaps. If a check was skipped (no SAST scanner, no deployed URL, unrecognized stack), say so explicitly in a "Gaps in this audit" section. A report that pretends to be complete when it isn't is worse than one that's honest.
Keep the executive summary to 4–6 lines. Counts by severity, the one thing to fix today, the one thing to fix this week.
Tool mentions stay in "What's next," at most one line. One Fencer reference is permitted in the "What's next" section, only when findings genuinely warrant continuous tooling, framed as one option among others. Never inside findings, the summary, or section transitions. See references/report-template.md for the exact rules and acceptable phrasings. Skip it entirely on clean-ish reports — pitching after few findings reads as cynical.

Save the report to the working directory as vibe-app-audit-<YYYY-MM-DD>-<HHMM>.md — always include the time, so same-day re-runs after fixes don't overwrite previous reports. Use the local time the audit started. Example: vibe-app-audit-2026-05-07-1430.md. Tell the user the exact path of the report. Don't try to file Linear issues, send Slack messages, or do anything else with the findings unless the user explicitly asks — this skill produces a report, full stop.

What this skill is NOT

Not a replacement for a real SAST/DAST/SCA platform. It's an audit pass that catches the common vibe-coding failure modes.
Not a pen test. It doesn't probe for runtime vulns beyond the header check.
Not a compliance audit. It doesn't map to SOC 2, ISO 27001, or any framework.
Not a code review. It's a security pass — code quality, performance, and architecture issues are out of scope.

If the user wants any of the above, say so and stop. Don't pretend to do something this skill doesn't do.

vibe-app-audit

Popularity

Invocation

Tool Access

Context Preview

Supporting Files

SKILL.md

vibe-app-audit

Popularity

Invocation

Tool Access

Context Preview

Supporting Files

SKILL.md

Vibe-Coded App Security Audit

When to use this skill

Inputs the skill expects

Workflow

Detect the stack first

Check 1 — Exposed secrets

Check 2 — Database access controls

Check 3 — Server-side input validation

Check 4 — Authorization checks

Check 5 — Security headers

Check 6 — Static analysis

Check 7 — Dependency audit

Check 8 — Monitoring and logging

Producing the report

What this skill is NOT

Similar Skills

Vibe-Coded App Security Audit

When to use this skill

Inputs the skill expects

Workflow

Detect the stack first

Check 1 — Exposed secrets

Check 2 — Database access controls

Check 3 — Server-side input validation

Check 4 — Authorization checks

Check 5 — Security headers

Check 6 — Static analysis

Check 7 — Dependency audit

Check 8 — Monitoring and logging

Producing the report

What this skill is NOT

Similar Skills