From llm-security
Test an LLM feature for direct and indirect prompt injection using a structured payload set, then record what succeeded and how to mitigate it. Use when assessing a chatbot, copilot, RAG app, or agent for input-handling weaknesses. Authorized testing only.
How this skill is triggered — by the user, by Claude, or both
Slash command
/llm-security:prompt-injection-testThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Evidence-backed findings on whether the target can be made to ignore its
Evidence-backed findings on whether the target can be made to ignore its instructions, leak its system prompt, exfiltrate data, or misuse tools — via direct or indirect injection.
A results table: payload class · payload summary · channel · result · evidence ·
mitigation. Route confirmed issues through security-reporting:finding.
Indirect injection is the higher-impact, more-missed class — always test the RAG/agent ingestion paths, not just the chat box. Keep payloads benign in effect (prove the control gap; don't cause real damage).
Provides CDSS development patterns for drug interaction checking, dose validation, clinical scoring (NEWS2, qSOFA), and alert classification integrated into EMR workflows.
npx claudepluginhub jassics/awesome-claude-security --plugin llm-security