From ruflo-aidefence
Scan inputs for prompt injection, unsafe content, and adversarial attacks using AIDefence
How this skill is triggered — by the user, by Claude, or both
Slash command
/ruflo-aidefence:safety-scan <input-text><input-text>This skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Scan content for prompt injection, jailbreak attempts, and unsafe patterns.
Scan content for prompt injection, jailbreak attempts, and unsafe patterns.
Before processing untrusted input (user submissions, API payloads, webhook data), scan it to detect prompt injection, adversarial content, or policy violations.
mcp__claude-flow__aidefence_is_safe with the input text for a boolean safe/unsafe resultmcp__claude-flow__aidefence_analyze for detailed threat classification and confidence scoresmcp__claude-flow__aidefence_scan for comprehensive multi-layer scanningmcp__claude-flow__aidefence_learn with confirmed threats to improve detectionmcp__claude-flow__aidefence_stats for detection rates and false positive metricsProvides CDSS development patterns for drug interaction checking, dose validation, clinical scoring (NEWS2, qSOFA), and alert classification integrated into EMR workflows.
npx claudepluginhub digitalcrest01/ruflow --plugin ruflo-aidefence