By gitroomhq
Generate photorealistic portraits, magazine-style character sheets, lip-synced talking-head videos, and UGC product demos, then burn TikTok/Hormozi captions and post directly to social platforms — all via a single MCP server and bearer token.
Playbook for orchestrating an end-to-end UGC video on the agent-media vNext runtime. Read this before deciding whether to call the one-shot make_ugc_video skill or to chain the four primitives (make_portrait → make_character_sheet → make_simple_selfie → make_subtitles) manually.
Up-to-30s vertical talking-head video: the actor speaks full-frame while a user-supplied b-roll video is overlaid on the lower half. <=15s renders as a single take (zero cuts); 16-30s uses two takes where the voice, face and setting carry across the cut automatically (take 1's voice and final frame seed take 2) — consistent speaker and continuous scene, not a jump-cut. Provide actor_image_url (any https image) + broll_video_url (any https video — both re-hosted to R2 automatically) and EITHER script (Seedance voice) OR audio_url (your own audio, single clip <=15s). Optional: subtitles; broll_width_rate (0.1-1.0, e.g. 0.8 = b-roll 80% width centered with black margins; omit for full width); broll_start_time (seconds before the b-roll appears); broll_fade_out (dissolve the b-roll at its end).
Generate a magazine-style character sheet from a portrait. Provide EITHER portrait_url (must be R2-hosted) OR portrait_image_base64 (PNG/JPEG, ≤10 MB; the API will upload it to R2 first). Optional ≤10-word description for name/age/vibe hints.
Bring your own audio: lip-sync a face (an R2-hosted image / character sheet, OR an existing clip) to a provided audio track. No text-to-speech or voice cloning — the character speaks your uploaded recording. Output is a 9:16 talking-head video.
Generate one photoreal portrait. Optionally takes a reference photo (R2-hosted) and a realism preset. Identity is locked from the reference image when provided.
Requires secrets
Needs API keys or credentials to function
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Agents: read this whole page. It is everything you need to create UGC videos with agent-media — no other docs required.
agent-media turns a short description (or a photo) + a script into a finished, captioned, lip-synced vertical UGC video. Works in Claude Code, Cursor, or any MCP / HTTP agent. One Bearer token authenticates everything.
npx skills add gitroomhq/agent-media — installs all of agent-media's skills into your agent (Claude Code, Cursor, etc.)./plugin marketplace add gitroomhq/agent-media then /plugin install agent-media@agent-media.npx -y -p @agentmedia/mcp-server@latest agent-media-mcp with env AGENT_MEDIA_API_KEY=ma_.... All skills self-describe via tools/list.Get a Bearer token: npm i -g agent-media-cli && agent-media login (stores it at ~/.agent-media/credentials.json), or grab the ma_* token from the dashboard. Every call uses Authorization: Bearer ma_.... You need credits on the account (buy at agent-media.ai).
make_ugc_video runs the whole pipeline — portrait → character sheet → lip-synced talking head → captions — in a single request.
curl -X POST https://api.agent-media.ai/v1/skills/make_ugc_video/run \
-H "Authorization: Bearer ma_..." -H "Content-Type: application/json" \
-d '{ "description": "a friendly 28-year-old woman, soft daylight",
"script": "Okay, this changed my whole morning routine — you have to try it.",
"duration": 10, "subtitles": true }'
# -> 202 { "skill_run_id": "..." } then poll:
curl https://api.agent-media.ai/v1/skills/runs/<skill_run_id> -H "Authorization: Bearer ma_..."
# when status == "succeeded", final_output.video_url is your MP4.
In Claude/Cursor you just say it in words: "Make a 10s UGC video of a friendly woman saying '…' with TikTok captions." — the agent picks the skill.
POST https://api.agent-media.ai/v1/skills/<slug>/run (Bearer auth, JSON body) → 202 with a run_id (or skill_run_id for make_ugc_video).GET /v1/skills/runs/<skill_run_id>; single primitive → GET /v1/primitives/runs/<run_id>. Output is final_output.video_url / artifacts[].url.GET https://api.agent-media.ai/v1/public/skills or MCP tools/list. Trust that over any hand-written list.npx claudepluginhub gitroomhq/agent-media --plugin agent-mediaSocial media automation CLI for scheduling posts, managing integrations, uploading media, and tracking analytics across 28+ platforms including X, LinkedIn, Reddit, YouTube, TikTok, Instagram, and more
Create HeyGen avatar videos, personalized video messages, and translated / dubbed videos. Build a persistent digital identity from a photo, generate presenter-led videos with your digital twin, and localize existing videos into 175+ languages with voice cloning and lip-sync.
AI video generation — describe what you want, Pexo picks the best model across 10+ engines (Seedance, Kling, Veo, Sora) and returns a finished, multi-shot video with music, subtitles, and transitions. Includes the Pexo agent plus image, audio, director, and model-prompting skills.
Generate professional Kling AI video prompts. Supports VIDEO 3.0, VIDEO 3.0 Omni, Avatar 2.0 talking avatars, Multi-Shot Storyboards, Element Reference, Voice Control, Motion Control.
Full video production pipeline for Remotion — gives Claude eyes (video analysis), voice (TTS/voiceover), ears (music/SFX), stock footage, AI image/video generation, TikTok captions, 3D content, and more. By Dojo Coding Labs.
Pika Creative Suite — give any AI agent a face, a voice, and a full creative studio. 9 curated /pika:* skills backed by 58 atomic media tools at mcp.pika.me.
Video generation at scale. Generate videos, images, and audio with Runway's API — batch ad campaigns, product videos, multishot stories, and creative iteration. Supports seedance2, gen4.5, veo3, Nano, Banana Pro, and more.