From ai-video-storyboard
Use when planning a multi-shot AI video (TikTok Reel, Instagram Ad, YouTube Shorts, product explainer) where the target duration exceeds what a single AI generation can produce (>15s), and you need a coordinated shot list with visually consistent prompts for each segment
How this skill is triggered — by the user, by Claude, or both
Slash command
/ai-video-storyboard:ai-video-storyboardThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
AI video generators produce 5–15 second clips. Real-world videos are longer: 30s TikToks, 60s ads, 90s explainers. This skill bridges that gap by producing a complete **shot-list storyboard** — a coordinated sequence of per-shot prompts with shared visual language, so the assembled final video looks like one intentional piece of work rather than six disconnected clips.
AI video generators produce 5–15 second clips. Real-world videos are longer: 30s TikToks, 60s ads, 90s explainers. This skill bridges that gap by producing a complete shot-list storyboard — a coordinated sequence of per-shot prompts with shared visual language, so the assembled final video looks like one intentional piece of work rather than six disconnected clips.
Core insight: Visual consistency across shots matters more than any single shot being perfect. A mediocre but consistent set of shots edits together; six gorgeous but mismatched shots do not.
Do NOT use for:
Ask the user these questions in a single message. Accept short answers:
If user already provided some of these, skip those questions and confirm the rest.
Based on duration and platform, divide the timeline into shots of ~5 seconds each (the sweet spot for AI video generators).
Standard cadences:
| Platform | Duration | Shots | Pacing |
|---|---|---|---|
| TikTok Hook | 15s | 3 | Fast cuts, single idea |
| TikTok Reel | 30s | 6 | Hook → Build → Payoff → CTA |
| Instagram Ad | 15s | 3 | Hook → Product → CTA |
| Instagram Ad | 30s | 6 | Hook → Problem → Product → Benefit → Social proof → CTA |
| YouTube Short | 60s | 12 | Hook → 3-act structure → CTA |
| Product Explainer | 90s | 18 | Problem → Solution → How it works → Results → CTA |
| Brand Story | 60s | 10-12 | Atmosphere-driven, longer shot holds |
Before writing any shot, lock in the shared visual language. This is what makes shots edit together:
Write these as a Visual Theme block at the top of the output. Every shot's prompt must respect this block.
For each shot, produce this structure:
## Shot N (START-ENDs) — [Purpose label: Hook / Setting / Action / Detail / Reveal / CTA]
**Composition:** [shot type + angle, e.g., "Extreme close-up, overhead"]
**Camera move:** [locked / slow dolly in / tracking / crane up / etc.]
**Lighting:** [from the Visual Theme, applied to this scene]
**Subject:** [what is in frame]
**Action:** [what is happening]
**Prompt to copy:**
> [Complete, cinematic-quality prompt, 40-80 words, including: subject + action + environment + camera + lighting + style + technical spec (duration, aspect ratio, resolution). Always ends with "cinematic 1080p, synchronized audio"]
**Audio direction:** [what the synchronized audio should sound like — ambient sounds, music beat position, voice-over line]
Critical rules for shot prompts:
references/shot-types.md for the full vocabulary.The sequence of shots must have a story arc, not a random list. Use one of these patterns:
Pattern A — Hook / Build / Payoff / CTA (TikTok default)
Pattern B — Problem / Solution / Proof / CTA (Ad default)
Pattern C — Atmosphere → Climax (Brand story)
Close with actionable post-production notes:
## Post-Production Checklist
- [ ] Generate all N shots with your preferred AI video tool
- [ ] Stitch in [CapCut / Descript / DaVinci Resolve / Premiere]
- [ ] Apply [specific LUT or color grade] for consistency
- [ ] Add [transition type and duration] between shots
- [ ] Layer BGM: [genre / BPM / mood]
- [ ] Add text overlays for [hook / CTA / captions]
- [ ] Export [platform spec: 9:16 1080x1920 30fps for TikTok, etc.]
Close with a "Why this works" block explaining the creative decisions. This educates the user and differentiates your output from generic prompt lists. Reference:
The final output is a single Markdown document containing:
See examples/ for four complete sample outputs:
tiktok-reel-30s-coffee.md — Coffee shop opening TikTokreferences/shot-types.md — Cinematography vocabulary (ECU, CU, MS, WS, OTS, POV, angles)references/camera-moves.md — Camera movement vocabulary (dolly, crane, pan, tilt, tracking)references/lighting.md — Lighting terminology (golden hour, rim, motivated, practical)references/genre-templates.md — Pre-built templates for common genres (food, fashion, tech, travel, SaaS)MIT — use freely, commercial or personal.
Provides UI/UX resources: 50+ styles, color palettes, font pairings, guidelines, charts for web/mobile across React, Next.js, Vue, Svelte, Tailwind, React Native, Flutter. Aids planning, building, reviewing interfaces.
Fetches up-to-date documentation from Context7 for libraries and frameworks like React, Next.js, Prisma. Use for setup questions, API references, and code examples.
npx claudepluginhub cajias/agentic-video-skills --plugin ai-video-storyboard