From agentic-creator-os
Unified image, video, and character generation across 30+ frontier models through a single connector. Model routing, visual prompt engineering, character consistency, async job handling, and brand-locked output for production creative work.
How this skill is triggered — by the user, by Claude, or both
Slash command
/agentic-creator-os:multimodal-studioThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
> One connector. Image + video + consistent characters. Thirty-plus models, vendor-agnostic, brand-locked.
One connector. Image + video + consistent characters. Thirty-plus models, vendor-agnostic, brand-locked.
Turn a creative brief into finished visual assets — images, videos, and reusable characters — without leaving the agent loop. Multimodal Studio is ACOS's unified generation layer: it routes each request to the right model, engineers the prompt, holds brand and character consistency across a whole asset set, and handles the async generation lifecycle end to end.
This is the capability that closes the gap with agent-first platforms like Google Antigravity — except ACOS stays model-agnostic (no lock-in to one vendor's image/video stack) and brand-aware (every asset inherits Frank DNA + project brand tokens).
~~image generation or ~~video generationACOS's default multimodal connector is Higgsfield MCP — one OAuth connection exposes 30+ image and video models plus character training. Per CONNECTORS.md, this skill is tool-agnostic: any MCP that fills the ~~image generation / ~~video generation categories works. Higgsfield is the default because a single server covers all three modalities with consistent characters.
Connect (Claude Code):
claude mcp add --transport http --scope user higgsfield https://mcp.higgsfield.ai/mcp
# then authenticate via OAuth in your Higgsfield account — no API keys to manage
Self-hosted alternative (stdio + API keys): see resources/model-matrix.md.
Tools exposed: generate_image, generate_video, create_character, list_characters, get_generation_status.
Verify before generating: if the higgsfield tools are not available, do NOT silently fall back to describing images in text. Tell the operator the connector is missing and give them the one-line claude mcp add command above.
Capture (ask only for what's missing; infer the rest from project context):
resources/model-matrix.md)Routing heuristics:
create_character once, then reference its ID in every call
Always state which model you chose and why in one line before generating.Structure every visual prompt as: Subject + Action + Setting + Composition + Lighting + Style + Technical.
get_generation_status — do not assume instant results. Images: seconds. Video: tens of seconds to minutes.generate_image calls in one turn), then poll. This is the multi-asset speed advantage.For any recurring subject (a brand mascot, a course instructor avatar, a series protagonist):
create_character once from a reference image or description → get a character ID.generate_image / generate_video call.list_characters to reuse across sessions and projects.
This is how you get a character that looks identical across an entire content series — the thing single-shot image tools cannot do.frankx-brand / brand-guidelines → brand tokens for generationcontent-strategy / video-script → briefs that feed the studiosuno-mastery → score the videos this skill producesinfogenius command → research-grounded image prompts/studio and /generate-video commands → operator entry pointsresources/model-matrix.md — full model selection matrix, aspect-ratio table, self-hosted config, cost tiersnpx claudepluginhub frankxai/agentic-creator-osGuides creation, editing, and verification of skills for AI coding agents using test-driven development with subagent scenarios. Use when authoring or debugging skills.