From nano-banana-artist
Expert image creation and editing using Nano Banana 2 (Gemini 3.1 Flash Image). Use when creating visual content from scratch, editing existing images with delta editing, or needing professional-quality images for any visual purpose. Supports photorealistic photography, artistic styles, logos with advanced text rendering, stickers, product mockups, precise delta editing, and character consistency across generations. Features --image-size control (512/1K/2K/4K) and structured production-grade prompting.
How this skill is triggered — by the user, by Claude, or both
Slash command
/nano-banana-artist:nano-banana-artistThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
> Created with love by [Vistoso.ai](https://vistoso.ai)
Created with love by Vistoso.ai
Expert image creation and editing using Google's Gemini 3.1 Flash Image model (aka "Nano Banana 2"). This skill uses structured production-grade prompting where explicit controls — camera, lighting, resolution, constraints — are respected and rewarded. Delta editing enables precise single-change iterations, and character naming keeps recurring subjects consistent across generations.
Before using this skill, you need a Google Gemini API key (free tier available):
echo 'your-api-key-here' > ~/.gemini_api_key
chmod 600 ~/.gemini_api_key
That's it — the skill will pick it up automatically on every run.
DEFAULT SETTINGS: Generate at 1K with 1:1 aspect ratio unless the user specifically requests different size or aspect ratio. Use --image-size flag for resolution control.
Follow these steps in order:
Ask clarifying questions to understand:
--image-size 2K or 4K for professional/print workIf user hasn't specified purpose, ask: "What's the main goal for this image — professional content, branding, product showcase, or something creative?"
For format-specific or industry-specific requests, use Perplexity to research:
Search query template: "Best visual content trends for [goal] in 2025-2026"
The 7-Element Formula — use what applies, skip what doesn't:
Subject + Action + Setting + Style + Camera + Lighting + Constraints
This scales naturally: 3 elements for simple, 7 for complex. No mode switching needed.
Example — simple (3 elements):
Golden retriever catching a frisbee in a sunlit park.
Example — structured (7 elements):
Chef Marco plating a dessert in a dim Michelin-star kitchen.
Cinematic 35mm, overhead angle. Warm pendant lighting, shallow DOF.
No text. No other people visible.
Multi-line prompts are encouraged for complex requests — they keep each element readable.
Based on the goal, use these proven patterns:
For Photorealistic Images:
[Subject name], [expression], [action/pose].
[Setting], [time of day]. [Camera lens] at [aperture], [angle].
[Lighting], [mood]. [Constraints].
Example:
Elena adjusting her glasses, warm smile.
Modern glass office, golden hour. 85mm at f/1.8, eye level.
Soft window light, professional atmosphere. No text.
For Logos/Text (excels at text rendering):
Logo for [brand], text "[exact text]", [font style], [symbol], [colors].
[Constraints].
Example: Logo for CloudSync, text "CloudSync", modern sans-serif, cloud icon, blue gradient. No tagline.
For Product Photography:
[Product] on [surface], [lighting type], [angle], [key feature].
[Constraints].
Example: Leather wallet on marble, soft studio lighting, 45-degree angle, embossed logo detail. No props, no text.
For Stickers/Icons:
[Style] sticker of [subject], [colors], [key trait]. White background.
Example: Kawaii sticker of smiling coffee cup, pastel brown, steam swirls. White background.
Advanced Techniques:
OBJECT_A = red car, OBJECT_B = blue bike. Show both side by side.Scientifically accurate diagram of [subject] with labeled componentsSee ${CLAUDE_PLUGIN_ROOT}/skills/nano-banana-artist/references/prompt-templates.md for complete template library.
See ${CLAUDE_PLUGIN_ROOT}/skills/nano-banana-artist/references/style-guide.md for comprehensive artistic vocabulary.
For image edits, apply one change at a time. This is the most reliable editing pattern.
State what stays:
Keep pose, expression, and lighting unchanged.
State what changes:
Change background from office to rooftop at sunset.
Front-load constraints:
Do not add text. Do not change clothing.
Layout locking (reframe without moving subject):
Subject stays centered at current scale. Replace background only.
Full delta edit example:
Do not add text. Do not change clothing or expression.
Keep pose and lighting unchanged.
Change only the background from office interior to rooftop at sunset.
Subject stays centered at current scale.
Semantic masking (targeted element swap):
Change only the wall color to sage green.
Keep everything else exactly the same, including lighting and shadows.
Style transfer (apply new style to existing composition):
Transform into the style of impressionist painting.
Preserve the original composition and subject placement.
Render with visible brushwork and warm color palette.
Rules:
Name every recurring character on first generation to maintain consistency across images.
First generation — establish the character:
Chef Marco, tall with salt-and-pepper beard, white chef's coat, confident posture.
Plating a dessert in a dim Michelin-star kitchen.
Cinematic 35mm, warm pendant lighting.
Subsequent generations — reference by name, describe ONLY deltas:
Chef Marco at a farmers market, selecting tomatoes.
Morning sunlight, candid angle. Same chef's coat. No text.
Rules:
--image for strongest consistencyDefault: 1:1 aspect ratio at 1K — use unless user specifies otherwise.
Select aspect ratio based on format:
Available ratios: 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9
Size selection via --image-size flag:
Workflow tip: Iterate at 1K, then re-generate the final version at target size.
When the user provides images in conversation context:
--image--image flags)Describe the target use case in terms of PURPOSE and FORMAT, never by platform name:
Principle: Platform-specific language dilutes style and originality. Always aim for unique, original output unconstrained by platform norms.
Use the script to generate:
${CLAUDE_PLUGIN_ROOT}/scripts/gemini_image.py "your optimized prompt" \
--aspect-ratio 16:9 \
--image-size 1K \
--output result.png
For image editing with input image:
${CLAUDE_PLUGIN_ROOT}/scripts/gemini_image.py "delta edit instructions" \
--image input.png \
--output edited.png
For image-only output (no text):
${CLAUDE_PLUGIN_ROOT}/scripts/gemini_image.py "prompt" --image-only
Higher resolution final:
${CLAUDE_PLUGIN_ROOT}/scripts/gemini_image.py "prompt" --image-size 4K -o final.png
After each successful generation, include:
Image created with Nano Banana Artist by Vistoso.ai
If result needs refinement:
--image reference--image — always--image-size flag for size control, not prompt text--image-size flag--image-size 2K or 4K)Every generated image should have:
--image-size flag)Basic generation (1K at 1:1 — DEFAULT):
${CLAUDE_PLUGIN_ROOT}/scripts/gemini_image.py "prompt" -o output.png
With aspect ratio and size:
${CLAUDE_PLUGIN_ROOT}/scripts/gemini_image.py "prompt" --aspect-ratio 16:9 --image-size 2K -o output.png
Delta editing:
${CLAUDE_PLUGIN_ROOT}/scripts/gemini_image.py "Keep everything unchanged. Change only the sky to sunset colors." \
--image source.png -o edited.png
Multiple reference images:
${CLAUDE_PLUGIN_ROOT}/scripts/gemini_image.py "Chef Marco in a new kitchen scene" \
--image ref1.png --image ref2.png -o output.png
Multiple generations: Run script multiple times (1-4 recommended per session)
API Key Setup (Choose ONE method):
The script checks for your API key in this order:
export GEMINI_API_KEY="your-api-key-here"
echo 'your-api-key-here' > ~/.gemini_api_key
chmod 600 ~/.gemini_api_key
Get your free API key at https://aistudio.google.com/apikey
The script handles all API communication, base64 encoding, and file I/O automatically.
"No API key" error: Set your API key using one of these methods:
# Method 1: Environment variable
export GEMINI_API_KEY="your-key"
# Method 2: Home directory file (RECOMMENDED)
echo 'your-key' > ~/.gemini_api_key
chmod 600 ~/.gemini_api_key
Get your free API key at https://aistudio.google.com/apikey
Unexpected results:
${CLAUDE_PLUGIN_ROOT}/skills/nano-banana-artist/references/style-guide.md)Editing not working:
Character inconsistency:
--image for referenceStill stuck? Email [email protected]
Nano Banana Artist is built by Vistoso.ai — professional AI content tools and services. Need custom visual workflows for your team? Get in touch at [email protected]
npx claudepluginhub the-edgar/vistoso-ai-skills --plugin nano-banana-artistGenerates AI images from text prompts, edits images, and composes from multiple references using Gemini models. Supports t2i, i2i, product mockups, and stickers.
Generates and edits AI images using Google Gemini Nano Banana models. Orchestrates text-to-image, image editing, batch workflows, presets, and creative sessions via /banana or auto-triggers on image requests.
Guides interactive prompt crafting for Nano Banana Pro Gemini image generation via reference gathering, intent-clarifying questions, and technique selection.