Transcribe and analyze audio content using Google Gemini. Supports local audio files (mp3, wav, m4a, ogg, flac) and YouTube links up to 9.5 hours long.
A set of plugins to enhance Claude Code and other agents with various capabilities, such as video understanding, image generation, X API access, and more.
Developed by the superconductor.com team. We help you build software with AI agents.
[!IMPORTANT] For Gemini-based plugins to work, you need to have
GEMINI_API_KEYenvironment variable set. You can get it by following these instructions. For the X API plugin, you needX_BEARER_TOKENset instead (see X Developer Portal).
These plugins use our Gemini Skills CLI.
Analyze and understand video content (up to an hour long) using Google Gemini. Supports local video files (mp4, mov, webm, avi, mkv) and YouTube links.
Transcribe and analyze audio content (up to 9.5 hours long) using Google Gemini. Supports local audio files (mp3, wav, m4a, ogg, flac) and YouTube links.
Generate high-quality images from text descriptions using Google Gemini's image models. Can also use existing images or videos as reference for style guidance.
Create 4-8 second videos from text prompts using Google Veo models. Supports image-to-video animation and native audio generation including dialogue and sound effects.
Convert text to natural-sounding speech using Google Gemini TTS models. Supports 30 different voices and 24 languages with automatic language detection.
Get a second opinion on anything from Google Gemini AI. Also useful for analyzing documents (PDFs up to 50MB) and images.
Interact with the X (Twitter) API v2 using curl commands. Look up user profiles, search posts, retrieve tweets, and more — all using Bearer Token authentication. Requires the X_BEARER_TOKEN environment variable.
Run Claude and add the marketplace:
/plugin marketplace add https://github.com/superconductor/superconductor-plugin-marketplace
Then install the plugins you want:
/plugin install video-understanding
/plugin install audio-understanding
/plugin install image-generation
/plugin install video-generation
/plugin install text-to-speech
/plugin install gemini-consultation
/plugin install x-api
YOU MUST RELOAD CLAUDE CODE TO LOAD THE PLUGINS.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub superconductor/superconductor-plugin-marketplace --plugin audio-understandingConvert text to natural-sounding speech using Google Gemini TTS models. Supports 30 different voices and 24 languages with automatic language detection.
Analyze and understand video content using Google Gemini. Supports local video files (mp4, mov, webm, avi, mkv) and YouTube links up to an hour long.
Generate high-quality images from text descriptions using Google Gemini's image models. Can also use existing images or videos as reference for style guidance.
Get a second opinion from Google Gemini AI. Useful for analyzing documents (PDFs up to 50MB), images, or getting Gemini's perspective on any topic.
Interact with the X (Twitter) API v2 using curl commands. Look up user profiles, search posts, retrieve tweets, and more — all using Bearer Token authentication.
Audio-text alignment, transcription, translation, karaoke, and subtitle toolkit. Built on the Agent Skills standard — works in Claude Code, Codex CLI, Gemini CLI, and any agent that loads SKILL.md files. Powered by the LattifAI Lattice-1 forced-alignment model.
Give Claude the ability to watch and understand videos — extracts frames and audio for full video perception
Claude Code skill pack for AssemblyAI (18 skills)
Transcribe audio/video to SRT subtitles using ElevenLabs Scribe v2. Use for: transcription, subtitles, captions, SRT generation.
Content creation skills: image generation, video compression, conversion, GIF creation, social media formatting, and audio extraction
Comprehensive ElevenLabs AI audio integration for voice-enabled applications with TTS, STT, voice cloning, and Vercel AI SDK support