From web
Generate or edit images via a multi-provider Image API using a bundled Python CLI. Supports Google Gemini (default) and OpenAI. Use when generating, creating, editing, inpainting, masking, removing or replacing backgrounds, creating product shots, concept art, covers, batch image variants, or working with transparent backgrounds.
How this skill is triggered — by the user, by Claude, or both
Slash command
/web:imagineThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Generates or edits images for the current project (e.g., website assets, game assets, UI mockups, product mockups, wireframes, logo design, photorealistic images, infographics). Defaults to **Google Gemini** (`gemini-3.1-flash-image-preview`) as the default provider, with **OpenAI** (`gpt-image-1.5`) available via `--provider openai`. Prefers the bundled CLI for deterministic, reproducible runs.
Generates or edits images for the current project (e.g., website assets, game assets, UI mockups, product mockups, wireframes, logo design, photorealistic images, infographics). Defaults to Google Gemini (gemini-3.1-flash-image-preview) as the default provider, with OpenAI (gpt-image-1.5) available via --provider openai. Prefers the bundled CLI for deterministic, reproducible runs.
Run python "$IMAGINE" generate --help to see all available params for the current provider.
Set the path to the bundled CLI at the start of any image generation workflow:
export IMAGINE="$HOME/.claude/plugins/web/skills/imagine/scripts/image_gen.py"
All CLI invocations below assume $IMAGINE is set.
--reference)--provider openai--background transparent) → --provider openai--provider openai--provider flag needed)generate --image--reference to the commandgenerate-batchgenerateAll image inputs (--image, --mask, --reference) accept both local file paths and https:// URLs.
python "$IMAGINE" ...) with sensible defaults (see references/cli.md).tempfile.gettempdir()/imagine/ for intermediate files (for example JSONL batches); delete when done.output/imagine/ when working in this repo.--out or --out-dir to control output paths; keep filenames stable and descriptive.Prefer uv for dependency management.
uv pip install google-genai pillow
uv pip install openai pillow
If uv is unavailable:
python3 -m pip install google-genai pillow # Google
python3 -m pip install openai pillow # OpenAI
GOOGLE_API_KEY must be set for live API calls.OPENAI_API_KEY must be set when using --provider openai.If the key is missing, give the user these steps:
If installation isn't possible in this environment, tell the user which dependency is missing and how to install it locally.
| Flag | Repeatable | Notes |
|---|---|---|
--image | Yes | Input image for editing (file path or URL) |
--mask | No | Mask image for inpainting (file path or URL) |
--reference | Yes | Style reference image (file path or URL) |
| Parameter | Default | Choices | Notes |
|---|---|---|---|
--model | gemini-3.1-flash-image-preview | gemini-3.1-flash-image-preview | Google Gemini image model |
--aspect-ratio | 1:1 | 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 1:4, 4:1, 1:8, 8:1, 21:9 | Image aspect ratio |
--resolution | 1K | 512px, 1K, 2K, 4K | Output resolution |
--n | 1 | 1-10 | Number of images (concurrent calls) |
--output-format | png | png, jpeg, webp | Output image format |
--provider openai)| Parameter | Default | Choices | Notes |
|---|---|---|---|
--model | gpt-image-1.5 | gpt-image-1.5, gpt-image-1-mini | OpenAI image model |
--size | 1024x1024 | 1024x1024, 1536x1024, 1024x1536, auto | Output size in pixels |
--quality | auto | low, medium, high, auto | Image quality level |
--background | (unset) | transparent, opaque, auto | Transparent requires png/webp |
--output-format | png | png, jpeg, webp | Output image format |
--output-compression | (unset) | 0-100 | Compression (jpeg/webp only) |
--input-fidelity | (unset) | low, high | Requires --image; strict identity/layout lock |
--moderation | (unset) | auto, low | Content moderation level |
--n | 1 | 1-10 | Number of images |
python "$IMAGINE" ...) over writing new one-off scripts.scripts/image_gen.py or files under scripts/providers/. If something is missing, ask the user before doing anything else.Reformat user prompts into a structured, production-oriented spec. Only make implicit details explicit; do not invent new requirements.
Classify each request into one of these buckets and keep the slug consistent across prompts and references.
Generate:
Edit:
Quick clarification (augmentation vs invention):
Template (include only relevant lines):
Use case: <taxonomy slug>
Asset type: <where the asset will be used>
Primary request: <user's main prompt>
Scene/background: <environment>
Subject: <main subject>
Style/medium: <photo/illustration/3D/etc>
Composition/framing: <wide/close/top-down; placement>
Lighting/mood: <lighting + mood>
Color palette: <palette notes>
Materials/textures: <surface details>
Quality: <low/medium/high/auto>
Input fidelity (edits): <low/high>
Text (verbatim): "<exact text>"
Constraints: <must keep/must avoid>
Avoid: <negative constraints>
Augmentation rules:
references/sample-prompts.md.Use case: stylized-concept
Asset type: landing page hero
Primary request: a minimal hero image of a ceramic coffee mug
Style/medium: clean product photography
Composition/framing: centered product, generous negative space on the right
Lighting/mood: soft studio lighting
Constraints: no logos, no text, no watermark
python "$IMAGINE" generate --image input.png --prompt "Replace the background with a warm sunset gradient"
Use case: precise-object-edit
Asset type: product photo background replacement
Primary request: replace the background with a warm sunset gradient
Constraints: change only the background; keep the product and its edges unchanged; no text; no watermark
python "$IMAGINE" generate --reference style.png --prompt "A man riding a motorcycle on a white background"
Use case: style-transfer
Primary request: apply the reference image's visual style to a man riding a motorcycle on a white background
Constraints: preserve palette, texture, and brushwork; no extra elements; plain white background
More principles: references/prompting.md. Copy/paste specs: references/sample-prompts.md.
Asset-type templates (website assets, game assets, wireframes, logo) are consolidated in references/sample-prompts.md.
references/cli.mdreferences/image-api.mdreferences/cli.md: how to run image generation/edits/batches via scripts/image_gen.py (commands, flags, recipes).references/image-api.md: what knobs exist at the API level (parameters, sizes, quality, background, edit-only fields).references/prompting.md: prompting principles (structure, constraints/invariants, iteration patterns).references/sample-prompts.md: copy/paste prompt recipes (generate + edit workflows; examples only).npx claudepluginhub alvis/.claude --plugin webGenerates images from text, edits images with references, performs product placement, style transfer, and multi-image composition using OpenAI DALL-E or Google Gemini.
Generates images from text, edits existing images, applies style transfers, composes from multiple references, and supports multi-turn refinement using Google's Gemini API via Python scripts. For logos, stickers, mockups.
Generates AI images from text prompts, edits images, and composes from multiple references using Gemini models. Supports t2i, i2i, product mockups, and stickers.