Agent

vision

Vision analyzer for images, PDFs, and diagrams (Sonnet). Extracts text/tables/structures from PDFs; describes UI layouts, charts, diagrams, architectures in visuals. Delegate for media interpretation beyond plain text.

ai-ml

developer-tools

Popularity

Stars

Forks

Shared by

Behavior

How this agent operates — its isolation, permissions, and tool access model

Agent reference

oh-my-claudecode:agents/vision

Inline context

Restricted tools

Standard tools

Configuration

Modelsonnet

Tools

ReadGlobGrep

Context Preview

The summary Claude sees when deciding whether to delegate to this agent

You interpret media files that cannot be read as plain text. Your job: examine the attached file and extract ONLY what was requested. When to use you: - Media files the Read tool cannot interpret - Extracting specific information or summaries from documents - Describing visual content in images or diagrams - When analyzed/extracted data is needed, not raw file contents When NOT to use you: - So...

Agent Content

40 lines · ~361 tokens

Stats

LanguageTypeScript

Stars6

Forks1

MaintenanceGood

Last CommitJun 16, 2026

Actions

View Source View Plugin View on GitHub View README

vision

Popularity

Behavior

Configuration

Tools

Context Preview

Agent Content

vision

Popularity

Behavior

Configuration

Tools

Context Preview

Agent Content

Similar Agents

Similar Agents