Name: agent-media
Author: gitroomhq

Stats

Actions

Available In

Tags

agent-media — Claude Skill plugin

Agents: read this whole page. It is everything you need to create UGC videos with agent-media — no other docs required.

agent-media turns a short description (or a photo) + a script into a finished, captioned, lip-synced vertical UGC video. Works in Claude Code, Cursor, or any MCP / HTTP agent. One Bearer token authenticates everything.

1. Connect (pick one)

One-liner (recommended): npx skills add gitroomhq/agent-media — installs all of agent-media's skills into your agent (Claude Code, Cursor, etc.).

Claude Code plugin (skills + MCP tools): inside a Claude Code session run /plugin marketplace add gitroomhq/agent-media then /plugin install agent-media@agent-media.

Any MCP agent: run the MCP server npx -y -p @agentmedia/mcp-server@latest agent-media-mcp with env AGENT_MEDIA_API_KEY=ma_.... All skills self-describe via tools/list.

Plain HTTP: call the REST API directly (below).

2. Auth

Get a Bearer token: npm i -g agent-media-cli && agent-media login (stores it at ~/.agent-media/credentials.json), or grab the ma_* token from the dashboard. Every call uses Authorization: Bearer ma_.... You need credits on the account (buy at agent-media.ai).

3. Make a video (the one call you usually want)

make_ugc_video runs the whole pipeline — portrait → character sheet → lip-synced talking head → captions — in a single request.

curl -X POST https://api.agent-media.ai/v1/skills/make_ugc_video/run \ -H "Authorization: Bearer ma_..." -H "Content-Type: application/json" \ -d '{ "description": "a friendly 28-year-old woman, soft daylight", "script": "Okay, this changed my whole morning routine — you have to try it.", "duration": 10, "subtitles": true }' # -> 202 { "skill_run_id": "..." } then poll: curl https://api.agent-media.ai/v1/skills/runs/<skill_run_id> -H "Authorization: Bearer ma_..." # when status == "succeeded", final_output.video_url is your MP4.

In Claude/Cursor you just say it in words: "Make a 10s UGC video of a friendly woman saying '…' with TikTok captions." — the agent picks the skill.

4. How to call ANY skill

REST: POST https://api.agent-media.ai/v1/skills/<slug>/run (Bearer auth, JSON body) → 202 with a run_id (or skill_run_id for make_ugc_video).

Poll: composed skill → GET /v1/skills/runs/<skill_run_id>; single primitive → GET /v1/primitives/runs/<run_id>. Output is final_output.video_url / artifacts[].url.

MCP: call the tool of the same name; arguments = the skill's input fields.

Exact input schema (always current): GET https://api.agent-media.ai/v1/public/skills or MCP tools/list. Trust that over any hand-written list.

Skills

agent-media — Claude Skill plugin

Agents: read this whole page. It is everything you need to create UGC videos with agent-media — no other docs required.

1. Connect (pick one)

One-liner (recommended): npx skills add gitroomhq/agent-media — installs all of agent-media's skills into your agent (Claude Code, Cursor, etc.).
Claude Code plugin (skills + MCP tools): inside a Claude Code session run /plugin marketplace add gitroomhq/agent-media then /plugin install agent-media@agent-media.
Any MCP agent: run the MCP server npx -y -p @agentmedia/mcp-server@latest agent-media-mcp with env AGENT_MEDIA_API_KEY=ma_.... All skills self-describe via tools/list.
Plain HTTP: call the REST API directly (below).

2. Auth

3. Make a video (the one call you usually want)

make_ugc_video runs the whole pipeline — portrait → character sheet → lip-synced talking head → captions — in a single request.

curl -X POST https://api.agent-media.ai/v1/skills/make_ugc_video/run \
  -H "Authorization: Bearer ma_..." -H "Content-Type: application/json" \
  -d '{ "description": "a friendly 28-year-old woman, soft daylight",
        "script": "Okay, this changed my whole morning routine — you have to try it.",
        "duration": 10, "subtitles": true }'
# -> 202 { "skill_run_id": "..." }   then poll:
curl https://api.agent-media.ai/v1/skills/runs/<skill_run_id> -H "Authorization: Bearer ma_..."
# when status == "succeeded", final_output.video_url is your MP4.

In Claude/Cursor you just say it in words: "Make a 10s UGC video of a friendly woman saying '…' with TikTok captions." — the agent picks the skill.

4. How to call ANY skill

REST: POST https://api.agent-media.ai/v1/skills/<slug>/run (Bearer auth, JSON body) → 202 with a run_id (or skill_run_id for make_ugc_video).
Poll: composed skill → GET /v1/skills/runs/<skill_run_id>; single primitive → GET /v1/primitives/runs/<run_id>. Output is final_output.video_url / artifacts[].url.
MCP: call the tool of the same name; arguments = the skill's input fields.
Exact input schema (always current): GET https://api.agent-media.ai/v1/public/skills or MCP tools/list. Trust that over any hand-written list.

agent-media

Popularity

What's Inside

Confidence

README

agent-media — Claude Skill plugin

1. Connect (pick one)

2. Auth

3. Make a video (the one call you usually want)

4. How to call ANY skill

Skills

Similar Plugins

heygen

pexo

kling-ai-prompt-generator

remotion-superpowers

More by gitroomhq

postiz

agent-media — Claude Skill plugin

1. Connect (pick one)

2. Auth

3. Make a video (the one call you usually want)

4. How to call ANY skill

Skills

Popularity

Health & Quality

More by gitroomhq

postiz

Similar Plugins

heygen

pexo

kling-ai-prompt-generator

remotion-superpowers

pika

runway-api