From hyperframes
Preprocesses audio and video assets for HyperFrames compositions: multi-provider TTS (HeyGen, ElevenLabs, Kokoro), BGM (Google Lyria, MusicGen), Whisper transcription, background removal, and caption authoring with per-word styling.
How this skill is triggered — by the user, by Claude, or both
Slash command
/hyperframes:hyperframes-mediaThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
CLI commands that create assets (`tts`, `bgm`, `transcribe`, `remove-background`), plus everything needed to consume and animate transcript data in HTML. For placing assets into compositions, see `hyperframes-core`.
CLI commands that create assets (tts, bgm, transcribe, remove-background), plus everything needed to consume and animate transcript data in HTML. For placing assets into compositions, see hyperframes-core.
TTS — npx hyperframes tts "..." picks the first available provider:
| Order | Provider | Detected when | Word timestamps |
|---|---|---|---|
| 1 | HeyGen (Starfish) | $HEYGEN_API_KEY / hyperframes auth login | Yes, native — pass --words narration.words.json to capture |
| 2 | ElevenLabs | $ELEVENLABS_API_KEY set | No — chain transcribe after |
| 3 | Kokoro-82M (local, 54 voices) | always (no key required) | No — chain transcribe after |
If the installed
hyperframes ttsis the local-only build (its--helpsays "Kokoro-82M" and has no--provider/--wordsflags), it silently falls back to Kokoro even with$HEYGEN_API_KEYset. To force HeyGen regardless of CLI version, use the self-containedscripts/heygen-tts.mjs(seereferences/tts.md).
BGM — npx hyperframes bgm --duration N:
| Order | Provider | Detected when |
|---|---|---|
| 1 | Google Lyria (RealTime) | $GEMINI_API_KEY or $GOOGLE_API_KEY set |
| 2 | MusicGen (facebook/musicgen-small, local) | Python transformers + torch + soundfile installed |
Override either with --provider <name>.
| Task | Read |
|---|---|
npx hyperframes tts — provider chain, voice IDs, words.json | references/tts.md |
| HeyGen without the CLI — self-contained REST script (wav + words) | scripts/heygen-tts.mjs (see references/tts.md) |
npx hyperframes bgm — Lyria vs MusicGen, mood prompts, tuning | references/bgm.md |
npx hyperframes transcribe — Whisper, model rules, output shape | references/transcribe.md |
npx hyperframes remove-background — transparent cutouts | references/remove-background.md |
| TTS → transcription → captions (no recorded voiceover) | references/tts-to-captions.md |
| Caption authoring — style detection, layout, word grouping, exit | references/captions/authoring.md |
| Transcript handling — input formats, quality gates, cleanup, APIs | references/captions/transcript-handling.md |
| Caption motion — karaoke, marker effects, audio-reactive | references/captions/motion.md |
| Model caches, system dependencies, troubleshooting | references/requirements.md |
am_michael is Kokoro-only; HeyGen UUIDs don't work on Kokoro. If you pass --voice, also pin --provider to avoid silent provider drift when the user's env changes.--model to transcribe. The CLI default small.en silently translates non-English audio. See references/transcribe.md → "Language Rule".--words to HeyGen and use that JSON directly, or run transcribe against the audio file. Don't assume word data is always there.{ id, text, start, end }. See references/transcribe.md → "Output Shape".remove-background --background-output is hole-cut, not inpainted. For "scene without the person", a different tool is needed. See references/remove-background.md → "When NOT the right tool".npx claudepluginhub heygen-com/hyperframes --plugin hyperframesPreprocesses media assets for HyperFrames compositions: text-to-speech narration (Kokoro), audio/video transcription (Whisper), and background removal (u2net).
Scaffolds, validates, previews, and renders HyperFrames video compositions via CLI (`npx hyperframes`). Also troubleshoots the build/render environment.
Drives the HyperFrames CLI dev loop: scaffold, lint, validate, inspect, preview, render, and deploy to AWS Lambda. Use when running any npx hyperframes subcommand or troubleshooting the build/render environment.