Analyze and understand video content using Google Gemini. Supports local video files (mp4, mov, webm, avi, mkv) and YouTube links up to an hour long.
A set of plugins to enhance Claude Code and other agents with various capabilities, such as video understanding, image generation, X API access, and more.
Developed by the superconductor.com team. We help you build software with AI agents.
[!IMPORTANT] For Gemini-based plugins to work, you need to have
GEMINI_API_KEYenvironment variable set. You can get it by following these instructions. For the X API plugin, you needX_BEARER_TOKENset instead (see X Developer Portal).
These plugins use our Gemini Skills CLI.
Analyze and understand video content (up to an hour long) using Google Gemini. Supports local video files (mp4, mov, webm, avi, mkv) and YouTube links.
Transcribe and analyze audio content (up to 9.5 hours long) using Google Gemini. Supports local audio files (mp3, wav, m4a, ogg, flac) and YouTube links.
Generate high-quality images from text descriptions using Google Gemini's image models. Can also use existing images or videos as reference for style guidance.
Create 4-8 second videos from text prompts using Google Veo models. Supports image-to-video animation and native audio generation including dialogue and sound effects.
Convert text to natural-sounding speech using Google Gemini TTS models. Supports 30 different voices and 24 languages with automatic language detection.
Get a second opinion on anything from Google Gemini AI. Also useful for analyzing documents (PDFs up to 50MB) and images.
Interact with the X (Twitter) API v2 using curl commands. Look up user profiles, search posts, retrieve tweets, and more — all using Bearer Token authentication. Requires the X_BEARER_TOKEN environment variable.
Run Claude and add the marketplace:
/plugin marketplace add https://github.com/superconductor/superconductor-plugin-marketplace
Then install the plugins you want:
/plugin install video-understanding
/plugin install audio-understanding
/plugin install image-generation
/plugin install video-generation
/plugin install text-to-speech
/plugin install gemini-consultation
/plugin install x-api
YOU MUST RELOAD CLAUDE CODE TO LOAD THE PLUGINS.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub superconductor/superconductor-plugin-marketplace --plugin video-understandingTranscribe and analyze audio content using Google Gemini. Supports local audio files (mp3, wav, m4a, ogg, flac) and YouTube links up to 9.5 hours long.
Convert text to natural-sounding speech using Google Gemini TTS models. Supports 30 different voices and 24 languages with automatic language detection.
Generate high-quality images from text descriptions using Google Gemini's image models. Can also use existing images or videos as reference for style guidance.
Get a second opinion from Google Gemini AI. Useful for analyzing documents (PDFs up to 50MB), images, or getting Gemini's perspective on any topic.
Interact with the X (Twitter) API v2 using curl commands. Look up user profiles, search posts, retrieve tweets, and more — all using Bearer Token authentication.
Give Claude the ability to watch and understand videos — extracts frames and audio for full video perception
Turn videos into a sequence of relevant still frames + transcript + a self-contained HTML report so Claude can view them as images, hear the audio, and write its analysis back into the report. Pass a local path, an http(s) URL, or pipe video bytes on stdin.
Claude Code plugin for video analysis, deep research, content extraction, web search, and explainer video creation — powered by Gemini 3.5 Flash.
Let Claude watch a video. Downloads with yt-dlp, extracts auto-scaled frames with ffmpeg, pulls captions or falls back to Whisper, and hands frames + transcript to Claude so it can answer questions about the video.
Compose yt-dlp + ffmpeg + Whisper into a single command that hands an AI agent the raw materials to watch any social video — VIDEO + FRAMES + TRANSCRIPT, ready for an LLM to read frames as images and transcript as text.
Full video production pipeline for Remotion — gives Claude eyes (video analysis), voice (TTS/voiceover), ears (music/SFX), stock footage, AI image/video generation, TikTok captions, 3D content, and more. By Dojo Coding Labs.