From agent-media
Creates up-to-30s vertical talking-head video with b-roll overlay. Supports single or dual takes for consistent speaker across cuts. Provide actor image, b-roll video, and script or audio.
How this skill is triggered — by the user, by Claude, or both
Slash command
/agent-media:make-broll-talking-headThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Up-to-30s vertical talking-head video: the actor speaks full-frame while a user-supplied b-roll video is overlaid on the lower half. <=15s renders as a single take (zero cuts); 16-30s uses two takes where the voice, face and setting carry across the cut automatically (take 1's voice and final frame seed take 2) — consistent speaker and continuous scene, not a jump-cut. Provide actor_image_url (...
Up-to-30s vertical talking-head video: the actor speaks full-frame while a user-supplied b-roll video is overlaid on the lower half. <=15s renders as a single take (zero cuts); 16-30s uses two takes where the voice, face and setting carry across the cut automatically (take 1's voice and final frame seed take 2) — consistent speaker and continuous scene, not a jump-cut. Provide actor_image_url (any https image) + broll_video_url (any https video — both re-hosted to R2 automatically) and EITHER script (Seedance voice) OR audio_url (your own audio, single clip <=15s). Optional: subtitles; broll_width_rate (0.1-1.0, e.g. 0.8 = b-roll 80% width centered with black margins; omit for full width); broll_start_time (seconds before the b-roll appears); broll_fade_out (dissolve the b-roll at its end).
Call this skill when the user asks for the outcome described above. It runs on the agent-media vNext primitive runtime via the mcp__agent-media__make_broll_talking_head MCP tool. Authentication is the user's existing agent-media Bearer token (issued by agent-media login).
Preferred path: MCP tool mcp__agent-media__make_broll_talking_head. Schema is auto-published via tools/list against the same MCP server, so don't restate the schema here — trust the server's response.
Fallback path: REST.
POST https://api.agent-media.ai/v1/skills/make_broll_talking_head/run
Authorization: Bearer $AGENT_MEDIA_API_KEY
Content-Type: application/json
Idempotency-Key: <any unique string per intent>
??sGET https://api.agent-media.ai/v1/skills/runs/<skill_run_id>
Authorization: Bearer $AGENT_MEDIA_API_KEY
Returns per-step status with intermediate artifact URLs as each primitive completes.
agent-media login.This file is auto-generated by scripts/generate-public-skill.ts from the registry at services/api-v2/src/skills/registry.ts. Do not hand-edit; CI rejects drift.
npx claudepluginhub gitroomhq/agent-media --plugin agent-mediaActs as AI creative director for video production including product ads, short films, montages, TikTok e-commerce. Analyzes inputs, writes English prompts, generates assets, submits tasks.
Orchestrates AI video production workflow: gathers specs interactively, generates scripts/storyboards, Gemini TTS voiceovers, Lyria music, Veo 3.1 clips or image animations, assembles with FFmpeg.
Orchestrates UGC video creation on agent-media vNext runtime. Choose between one-shot `make_ugc_video` or chaining four primitives manually.