By vusallyv
Let Claude 'watch' videos: extracts scene-change frames + audio transcript from any video URL or local file.
Claude Code plugin that lets Claude "watch" videos.
Claude can read images natively but not video. This plugin extracts scene-change frames + audio transcript from any video — local file, public URL (Loom/YouTube/Vimeo/raw mp4), or private URL with auth — and feeds them to Claude as images + text.
/plugin marketplace add https://github.com/vusallyv/video-context-plugin.git
/plugin install video-context@video-context-marketplace
/video-context:setup
/video-context:setup installs ffmpeg, yt-dlp, whisper-cpp, and the whisper model. Skip it if you want — extract.sh auto-installs the binaries on first use, but the whisper model (~150MB) is only fetched by setup.
Private repo: works with your gh auth (or GITHUB_TOKEN).
Just paste a video link or path and ask Claude to analyze it:
Analyze this recording: https://www.loom.com/share/abc123 What does this video show? /Users/me/Downloads/bug-repro.mp4
Claude detects the video and runs the skill automatically.
Set VIDEO_AUTH_HEADER before invoking Claude (or tell Claude what it is in chat):
export VIDEO_AUTH_HEADER="Authorization: Bearer $TOKEN"
For Atlassian/Jira basic auth:
export VIDEO_AUTH_HEADER="Authorization: Basic $(echo -n "$EMAIL:$JIRA_API_TOKEN" | base64)"
/video-context:setup installs everything on macOS (Homebrew) and Linux (apt/dnf/pacman/zypper; builds whisper-cpp from source). On macOS, extract.sh also auto-installs missing binaries on first use via brew. Windows: install manually.
| Env var | Default | Effect |
|---|---|---|
SCENE_THRESHOLD | 0.4 | Lower = more frames. 0.2 for screen recordings. |
MAX_FRAMES | 20 | Hard cap; trimmed evenly across timeline. |
FRAME_WIDTH | 1280 | Downscale to save tokens. |
WHISPER_MODEL | /opt/homebrew/share/whisper-cpp/ggml-base.en.bin | Whisper model path. |
yt-dlp (or curl with auth).ffmpeg scene-detect extracts keyframes at SCENE_THRESHOLD, plus the first + last frame.MAX_FRAMES evenly across timeline.whisper-cpp → transcript.txt.Read each frame and the text.MIT
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub vusallyv/video-context-plugin --plugin video-contextComprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Standalone image generation plugin using Nano Banana MCP server. Generates and edits images, icons, diagrams, patterns, and visual assets via Gemini image models. No Gemini CLI dependency required.
Multi-model consensus engine integrating OpenAI Codex CLI, Gemini CLI, and Claude CLI for collaborative code review and problem-solving.
Write feature specs, plan roadmaps, and synthesize user research faster. Keep stakeholders updated and stay ahead of the competitive landscape.