By cylestio
Scan AI agent code and runtime sessions for OWASP LLM Top 10 vulnerabilities via static and dynamic analysis, apply contextual fixes to issues like prompt injection and data leaks, generate markdown compliance reports with SOC2 mapping, correlate findings with evidence, and gate deployments until critical issues resolve.
Run dynamic runtime analysis on captured AI agent sessions. Analyze token usage, tool calls, behavioral patterns, PII detection, and model pinning. Use when user asks for runtime analysis, dynamic testing, behavioral analysis, or wants to analyze captured agent sessions through the proxy.
Cross-reference static code findings with dynamic runtime observations. Identify VALIDATED (confirmed at runtime) vs UNEXERCISED (never triggered) issues. Use when user asks to correlate, cross-reference static and dynamic findings, or prioritize issues based on runtime evidence.
Debug AI agent workflows by exploring agents, sessions, and events. Investigate behavioral issues and unexpected patterns. Use when user asks to debug, explore sessions, investigate issues, or examine agent behavior at runtime.
Apply intelligent, contextual security fixes to AI agent vulnerabilities. Fix prompt injection, output handling, tool security, data leaks, memory issues, supply chain, and behavioral risks. Use when user says fix, asks to remediate a recommendation (REC-XXX), apply security patches, or resolve vulnerabilities.
Check production deployment readiness for AI agents. Verify all CRITICAL and HIGH severity issues are resolved. Use when user asks about deployment readiness, gate status, blocking issues, or whether their agent is ready for production.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Analyze and debug AI agents in real-time. Scan your code for vulnerabilities (both statically and dynamically), trace LLM calls, and evaluate runtime behavior—all from a single command.
IDE integration provides MCP query tools for inspecting sessions, risk metrics, and security findings directly in your editor. It also enables static analysis to scan your agent code for vulnerabilities before runtime.
Run these commands to register the marketplace and install the plugin:
/plugin marketplace add cylestio/agent-inspector
/plugin install agent-inspector@cylestio
/agent-inspector:setup
After installation, restart Claude Code for the MCP connection to activate.
Copy this command to Cursor and it will set everything up for you:
Fetch and follow instructions from https://raw.githubusercontent.com/cylestio/agent-inspector/main/integrations/AGENT_INSPECTOR_SETUP.md
After setup, restart Cursor and approve the MCP server when prompted.
Run directly with uvx:
uvx agent-inspector openai # or: anthropic
Install via pipx or pip:
pipx install agent-inspector
agent-inspector openai # or: anthropic
This starts:
| Flag | Description |
|---|---|
--port, -p | Override the proxy server port (default: 4000) |
--ui-port | Override the dashboard port (default: 7100) |
--base-url | Override the LLM provider base URL |
--use-local-storage | Enable persistent SQLite storage for traces |
--local-storage-path | Custom database path (requires --use-local-storage) |
--log-level | Set logging level (DEBUG, INFO, WARNING, ERROR, CRITICAL) |
--no-presidio | Disable Presidio PII detection (enabled by default) |
Point your agent to the proxy:
# OpenAI
client = OpenAI(base_url=f"http://localhost:4000/agent-workflow/{AGENT_WORKFLOW_ID}")
# Anthropic
client = Anthropic(base_url=f"http://localhost:4000/agent-workflow/{AGENT_WORKFLOW_ID}")
Replace AGENT_WORKFLOW_ID with your project identifier (e.g., derived from your git repo name, package name, or folder name).
Open http://localhost:7100 to view the live dashboard.
The proxy automatically detects and groups most identifiers. All headers below are optional and only needed when you want to override the automatic behavior. Add headers via your SDK's extra_headers or default_headers parameter.
An agentic workflow composed of multiple LLM calls with different prompts should be identified using the workflow ID in the base URL:
http://localhost:4000/agent-workflow/{AGENT_WORKFLOW_ID}
This groups all calls from the same agent or application together, regardless of prompt type or conversation.
When an agent executes a series of different LLM conversations as part of a single run or task, you can group them into one session:
x-cylestio-session-id: request-f1a1b2a8
This is useful for multi-step workflows where a classifier, retriever, and generator each make separate calls but belong to the same execution. No automatic detection - must be provided explicitly.
The proxy automatically identifies different conversation types based on the system prompt hash. Each unique system prompt creates a distinct conversation type in the dashboard.
To assign a meaningful name instead of an auto-generated hash, optionally use:
x-cylestio-prompt-id: tool-decision-making
Multi-turn conversations with message history are automatically inferred from the prompt content and conversation structure.
To explicitly track a conversation across API calls, optionally generate your own identifier:
x-cylestio-conversation-id: conv-uuid-here
Attach arbitrary metadata to any LLM call for filtering and analysis. Use comma-separated key:value pairs:
x-cylestio-tags: user:[email protected],env:production,team:backend
Tags appear in the dashboard and can be used to filter sessions by user, environment, feature flag, or any custom dimension.
npx claudepluginhub cylestio/agent-inspectorHarness-native ECC operator layer - 67 agents, 271 skills, 92 legacy command shims, reusable hooks, rules, selective install profiles, and production-ready workflows for Claude Code, Codex, OpenCode, Cursor, and related agent harnesses
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Complete collection of battle-tested Claude Code configs from an Anthropic hackathon winner - agents, skills, hooks, and rules evolved over 10+ months of intensive daily use
Code review uncommitted local changes for bugs, security vulnerabilities, CLAUDE.md compliance, and code clarity
Professional WordPress engineering skills for Claude Code — performance optimization, security auditing, Gutenberg block development, and theme/plugin best practices