From google-image-gen
Generate images using Google's Gemini API. Use this skill when the user wants to generate, create, or edit images with AI. Keywords: image generation, create image, generate picture, AI art, edit image, Gemini, icon, render, illustration.
How this skill is triggered — by the user, by Claude, or both
Slash command
/google-image-gen:google-image-genThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Generate images from text prompts using Google's Gemini API.
Generate images from text prompts using Google's Gemini API.
Run these commands once at the start of a session:
${CLAUDE_PLUGIN_ROOT}/scripts/check_env.sh
cd ${CLAUDE_PLUGIN_ROOT} && uv sync && cd -
If the environment check fails, the user needs to configure their API key.
Recommended: Create ~/.config/google-image-gen/.env with:
GOOGLE_AI_API_KEY=your_key_here
Get an API key from https://aistudio.google.com/apikey
Alternative: Export as environment variable: export GOOGLE_AI_API_KEY=your_key
Important: The plugin runs from its own directory. Use --cwd to ensure output files land in the user's project,
not the plugin cache.
ORIG_CWD="$(pwd)" && cd ${CLAUDE_PLUGIN_ROOT} && uv run python main.py --cwd "$ORIG_CWD" <output_path> "<prompt>" [options]
Note: When the user specifies a path like output.png or images/photo.png, pass it as-is — the --cwd parameter
ensures it resolves relative to the user's project root.
| Option | Short | Description |
|---|---|---|
--style | -s | Style template (.md file with {subject} placeholder) |
--ref | -r | Reference image for style (repeatable, max 14) |
--edit | -e | Edit existing image instead of generating |
--aspect | -a | Aspect ratio: 1:1, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9 |
ORIG_CWD="$(pwd)" && cd ${CLAUDE_PLUGIN_ROOT} && uv run python main.py --cwd "$ORIG_CWD" output.png "A red apple on a wooden table"
ORIG_CWD="$(pwd)" && cd ${CLAUDE_PLUGIN_ROOT} && uv run python main.py --cwd "$ORIG_CWD" images/output.png "A red apple"
ORIG_CWD="$(pwd)" && cd ${CLAUDE_PLUGIN_ROOT} && uv run python main.py --cwd "$ORIG_CWD" thumb.png "Mountain landscape" --aspect 16:9
ORIG_CWD="$(pwd)" && cd ${CLAUDE_PLUGIN_ROOT} && uv run python main.py --cwd "$ORIG_CWD" output.png "Change the sky to sunset" --edit input.png
ORIG_CWD="$(pwd)" && cd ${CLAUDE_PLUGIN_ROOT} && uv run python main.py --cwd "$ORIG_CWD" output.png "Same style but with a car" --ref reference.png
Generates numbered outputs (output_1.png, output_2.png, etc.):
ORIG_CWD="$(pwd)" && cd ${CLAUDE_PLUGIN_ROOT} && uv run python main.py --cwd "$ORIG_CWD" output.png "cat" "dog" "bird"
npx claudepluginhub ypfaff/google-image-gen-plugin --plugin google-image-genGenerate and edit images using Google's Gemini API via Python scripts. Supports text-to-image, image editing, multi-turn refinement, custom resolutions, and aspect ratios.
Generates or edits images using Google Gemini API via nanobanana.py Python script. Supports prompts, aspect ratios like 9:16, models such as gemini-3.1-flash, and resolutions up to 4K.
Generates or edits images via Google Gemini from text prompts or up to 14 reference images for text-to-image and multi-image editing.