Visual context skills for Claude Code - clipboard, screenshots, and video analysis
npx claudepluginhub ellyseum/claude-visionVisual context skills for Claude Code - clipboard, screenshots, and video analysis
Visual context skills for Claude Code. Gives Claude the ability to see your clipboard, screenshots, and videos.
Claude Code can technically read images—you just paste in the file path and it'll analyze them. But the workflow is clunky:
This plugin fixes all of that. Instead of hunting for file paths, just:
/clipboard what's this?
/screenshot explain this error
/video summarize this tutorial
Claude doesn't natively support video. But with some clever extraction, we can give it Gemini-like video understanding:
The /video skill spawns a dedicated analysis agent with a fresh 200k context. This means:
The result: Claude can "watch" videos by analyzing representative frames and reading the transcript. It handles tutorials, error recordings, meetings, and lectures surprisingly well.
| Skill | Description |
|---|---|
/clipboard | Read text or images from your clipboard |
/screenshot | Analyze your latest screenshot |
/video | Analyze YouTube videos or local recordings |
/claude-vision-setup | Interactive setup wizard |
# Add the marketplace and install the plugin
/plugin marketplace add ellyseum/claude-plugins
/plugin install claude-vision
# Run setup (auto-runs on first use of any skill)
/claude-vision-setup
| Image | Size | Includes | Use Case |
|---|---|---|---|
| Lite | ~500 MB | ffmpeg, yt-dlp | YouTube, screen recordings |
| Full | ~10 GB | ffmpeg, yt-dlp, whisper | Local videos needing transcription |
Which should I choose?
Whisper transcription speed:
| Setup | Speed | 1 hour video |
|---|---|---|
| CPU | ~10x realtime | ~6 minutes |
| GPU (CUDA) | ~50x realtime | ~1 minute |
To enable GPU:
cv-run will auto-detect and enable GPU| Feature | WSL | macOS | Linux |
|---|---|---|---|
| Clipboard (text) | Yes | Yes | Yes (xclip) |
| Clipboard (image) | Yes | Yes (pngpaste) | Yes (xclip) |
| Screenshot | Yes | Yes | Yes |
| Video analysis | Yes | Yes | Yes |
This plugin runs through WSL (Windows Subsystem for Linux), not native Windows. Here's why:
Docker Desktop for Windows requires WSL2 anyway. If you have Docker installed on Windows, you already have WSL2. The plugin just runs inside that same Linux environment where Docker lives.
This keeps the codebase simple—one set of bash scripts instead of maintaining parallel PowerShell versions.
Need native Windows support? If you have a use case that requires native Windows commands or a non-Docker workflow, open an issue and we'll consider it.
Clipboard/screenshot:
brew install pngpaste (for images)apt install xclipVideo processing: Docker (required)
/clipboard what is this?
/clipboard explain this code
/clipboard describe what you see
Auto-detects your screenshot directory (configured during setup) and finds the most recent file by timestamp.