https://github.com/user-attachments/assets/b46e59c6-129b-48a1-aafa-772585fae60b
Voca is a Claude Code plugin that adds an ambient voice interface to your coding sessions. It listens to your microphone, transcribes speech in real-time, and speaks back—all running locally on Apple Silicon. No cloud APIs, no latency, no data leaving your machine.
All inference runs on-device using MLX.
⚠️ Requirement: macOS with Apple Silicon (M1+) is required to run Voca.
Add the marketplace and install the plugin:
/plugin marketplace add patelnav/voca
/plugin install voca@patelnav-voca
Configure Devices: Run the setup command to configure your audio input/output devices and verify that everything is working:
/voca:setup
Start a Voice Session: Once setup is complete, start the ambient voice interface:
/voca:start
Voca uses token-based HTTP polling under the hood. The first poll claims focus; every later poll in that session reuses the returned token. Starting voice mode in another session claims a new token and hands off listening immediately.
Claude will now listen in the background and respond seamlessly to your voice.
Voca is built on cutting-edge local AI technologies to ensure maximum performance and privacy:
| Component | Technology | Description |
|---|---|---|
| Speech-to-Text | Parakeet MLX v2 | Blazing fast transcription (~0.1s on M-series) |
| Text-to-Speech | Kokoro via mlx-audio | High-quality, natural voice generation |
| Voice Activity | Silero VAD | Enterprise-grade voice activity detection |
| Runtime | Python MCP | Model Context Protocol server over stdio |
Admin access level
Server config contains admin-level keywords
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub patelnav/voca --plugin vocaVoice conversations with Claude Code using local speech-to-text and text-to-speech
Give your coding agent a voice. Speaks responses aloud through the VoxClaw macOS/iOS app with per-project voices, speech queue, and polite multi-device sync.
Voice I/O for Claude Code - speak to users and hear their voice input
Standalone image generation plugin using Nano Banana MCP server. Generates and edits images, icons, diagrams, patterns, and visual assets via Gemini image models. No Gemini CLI dependency required.
Write feature specs, plan roadmaps, and synthesize user research faster. Keep stakeholders updated and stay ahead of the competitive landscape.
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns