By s3cr1z
Probe the security and safety of AI applications, agents, and foundation models. Orchestrates adversarial attack workflows to discover vulnerabilities in LLMs, agentic systems, MCP servers, multi-agent architectures, RAG pipelines, IDE/code agents, and custom AI endpoints before they are exploited. Covers jailbreaking, prompt injection, data exfiltration, tool manipulation, reasoning attacks, guardrail bypass, and more — mapped to OWASP LLM Top 10, OWASP ASI01-ASI10, MITRE ATLAS, and NIST AI RMF compliance frameworks. 61 attack algorithms, 547 transforms, 141 scorers, 260 bundled harm goals across 25 sub-categories in safety, security, and agentic tiers.
Based on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Guide for reading and interpreting AIRT analytics output including ASR, risk scores, severity, and compliance metrics
Decision tree for selecting AIRT attacks based on goals, target type, and constraints
Maps AIRT attacks, transforms, and scorers to OWASP LLM Top 10, OWASP ASI (ASI01-ASI10), MITRE ATLAS, NIST AI RMF, and Google SAIF
Diagnose and fix common errors in AIRT attack generation, execution, and results collection
Complete catalog of all 84 scorers with exact names, categories, and when to use each
Uses power tools
Uses Bash, Write, or Edit tools
This is the source repo for the capabilities Dreadnode publishes to app.dreadnode.io. A capability is a directory — a manifest plus any combination of agents, tools, skills, and MCP servers — that a Dreadnode runtime picks up and loads:
ai-red-teaming/
capability.yaml # manifest
agents/ # markdown prompts
tools/ # python @tool functions
skills/ # SKILL.md packs
dn capability install dreadnode/ai-red-teaming (swap in any name from capabilities/)dn capability install ./capabilities/ai-red-teaming symlinks the directory into your runtime, so edits go live on reloaddn, press Ctrl+P, filter for dreadnode/dn is the Dreadnode CLI — see getting-started to install and authenticate. Full install reference for capabilities lives at docs.dreadnode.io/capabilities/installing.
Every directory under capabilities/ is a shipped, working example. Read one alongside the docs:
Every skill in this repo is scanned with cisco-ai-defense/skill-scanner for prompt injection, data exfiltration, tool-chaining abuse, and supply chain risk. CI fails on HIGH+ findings and uploads SARIF reports to GitHub Code Scanning. The repo policy in scan-policy.yaml tunes the scanner for security-focused content.
just security-scan # scan all capabilities
just security-scan web-security # scan one capability
just security-scan behavioral="true" # deep dataflow analysis
This repo is published for reference, not as a contribution target — we don't generally accept external PRs that add new capabilities. See CONTRIBUTING.md for what's useful to send and how to build your own capabilities instead.
Each capability declares its license in its capability.yaml.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimnpx claudepluginhub s3cr1z/capabilities --plugin ai-red-teamingBloodHound Enterprise integration via the v2 REST API. HMAC-signed requests (long-lived integration) or JWT bearer (interactive); full coverage of attack-path findings, asset groups + tier-zero curation, AD/Azure/OpenGraph entity walks, raw + saved Cypher, data ingestion (SharpHound/AzureHound uploads), posture trending, and audit logs. Complementary to the existing bloodhound capability — that one talks Bolt to a local CE Neo4j; this one talks REST to a hosted BHE deployment.
Memory forensics and DFIR triage via Volatility3. Curated tools for process enumeration, network artifacts, code injection, credential extraction, registry analysis, and YARA over memory images, plus playbooks for triage, injection / credential / persistence hunts, and YARA-based IoC sweeping.
BloodHound CE integration for Active Directory attack path analysis. Graph-based queries against Neo4j for domain enumeration, tier zero identification, Kerberos attack surfaces, delegation abuse, PKI/ADCS vulnerabilities, and Azure/Entra attack paths.
Read-only GhostWriter integration. Query clients, projects, findings, objectives, targets, scope, deconflictions, evidence, observations, reports, infrastructure (servers and domains), activity logs, white cards, and notes without modifying any GhostWriter state.
Web application penetration testing with 30+ attack technique playbooks covering request smuggling, cache poisoning, SSRF, SSTI, DOM vulnerabilities, authentication bypasses, parser differentials, and client-side attacks. Includes HTTP client tooling, Caido proxy integration via MCP, credential management, DNS rebinding, phone verification, and vulnerability verification.
v9.44.1 — Patch release for Gemini environment/version detection and qwen auth gating. Run /octo:setup.
Harness-native ECC operator layer - 67 agents, 271 skills, 92 legacy command shims, reusable hooks, rules, selective install profiles, and production-ready workflows for Claude Code, Codex, OpenCode, Cursor, and related agent harnesses
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
A growing collection of Claude-compatible academic workflow bundles. Covers scientific figures, manuscript writing and polishing, reviewer assessment, citation retrieval, data availability, paper reading, literature search, response letters, paper-to-PPTX conversion, and evidence-grounded Chinese invention patent drafting. Rules are organized as reusable skill folders with explicit workflows and quality checks.
Persistent file-based planning for AI coding agents. Crash-proof markdown plans (task_plan.md, findings.md, progress.md) that survive context loss and /clear, with an opt-in completion gate and multi-agent shared state. Manus-style. Works with Claude Code, Codex CLI, Cursor, Kiro, OpenCode and 60+ agents via the SKILL.md standard. Includes Arabic, German, Spanish, and Chinese (Simplified and Traditional).
Binary reverse engineering, malware analysis, firmware security, and software protection research for authorized security research, CTF competitions, and defensive security