Skill

imagen-prompt

Generate optimized prompts for Google Imagen 4 models. Teaches concrete visual language, composition techniques, and model-specific patterns. Use when creating images with Imagen, Google AI image generation, or Vertex AI imaging.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/model-bridge:imagen-prompt

User invocable

Model invocable

Inline context

Default effort

Tool Access

This skill is limited to the following tools:

ReadGrepGlob

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Craft effective prompts for Google Imagen 4 (Ultra/Fast) that produce high-quality images. Teaches concrete visual language, composition patterns, and the difference between abstract concepts and specific imagery that Imagen understands.

SKILL.md

251 lines · ~2.1k tokens

Stats

Stars0

MaintenanceGood

Last CommitMar 16, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Imagen Prompt Generation Skill

Purpose

Pairs Well With

vertex-setup - For configuring Google Cloud and Vertex AI access
flux-prompt - Alternative for local image generation

When to Use

Use this skill when the user wants to:

Generate images with Google Imagen 4 (Ultra or Fast)
Create product shots, portraits, scenes, or illustrations
Compose images with multiple elements in specific positions
Understand what makes Imagen prompts effective
Troubleshoot why a prompt isn't working

Model Selection

Imagen 4 Ultra (imagen-4.0-ultra-generate-001):

Best quality, recommended for final output
Single-shot generation (no iterative editing)
Generate 4 variations to choose from
Cost: ~$0.06/image

Imagen 4 Fast (imagen-4.0-fast-generate-001):

Faster generation, good for iteration
Lower quality than Ultra
Good for testing prompt ideas

Critical Insight: Imagen 4 does not support inpainting or iterative editing. Invest in prompt quality upfront - single-shot generation with Ultra produces the best results.

Key Rules

DO:

Use concrete, specific language: "red sports car" not "nice car"
Specify positions when needed: "on the left", "in the foreground", "centered"
Include lighting details: "soft morning light", "dramatic side lighting"
Name specific things: "golden retriever puppy", "oak dining table"
Use ONE clear style anchor: "product photography", "oil painting style"
Describe textures and materials: "brushed steel", "worn leather"

DON'T:

Use abstract concepts: "feeling of joy", "essence of freedom"
Stack multiple conflicting styles
Overload with 10+ distinct elements (causes chaos)
Use vague descriptors: "beautiful", "amazing", "perfect"
Rely on iterative editing (not supported in Imagen 4)

Prompt Patterns

Simple Pattern (Single Subject)

For straightforward images with one main focus:

[SUBJECT with specific details], [SETTING/BACKGROUND], [LIGHTING], [STYLE]

Example - Product Shot:

Sleek wireless headphones in matte black finish, floating against a gradient
background transitioning from deep blue to purple, soft studio lighting with
subtle reflections, product photography style, clean and minimal

Example - Portrait:

Young woman with curly red hair and green eyes, wearing a cream knit sweater,
sitting by a rain-streaked window, soft natural light from the left,
candid portrait photography, warm color tones

Example - Food:

Fresh pasta dish with basil and cherry tomatoes in a white ceramic bowl,
rustic wooden table surface, steam rising, warm overhead lighting,
food photography style, shallow depth of field

Spatial Zone Pattern (Complex Compositions)

For images with multiple elements that need specific placement:

[POSITION]: [Element with details]
[POSITION]: [Element with details]
[POSITION]: [Element with details]
[Overall style and lighting]

Positions: LEFT, RIGHT, CENTER, FOREGROUND, BACKGROUND, TOP, BOTTOM

Example - Scene with Multiple Elements:

LEFT: Vintage red bicycle leaning against a brick wall, basket with
fresh flowers. CENTER: Cobblestone alley leading to a sunlit courtyard.
RIGHT: Small cafe with outdoor seating, striped awning.
Golden afternoon light, European street photography style

Example - Product in Context:

FOREGROUND: Leather messenger bag in cognac brown, buckles catching light.
CENTER: Person walking through busy city street, motion blur.
BACKGROUND: Glass storefronts reflecting afternoon sun.
Street photography style, shallow depth of field on bag

Text in Images

Imagen handles text well with specific formatting:

[Scene description] with the text "[YOUR TEXT]" in [font style/placement]

Example:

Vintage travel poster showing the Eiffel Tower at sunset, with the text
"PARIS" in bold Art Deco typography at the bottom, warm orange and gold
color palette, retro illustration style

Example Prompts

Product Photography

Premium watch with rose gold case and black leather strap, positioned at
45-degree angle on dark slate surface, dramatic side lighting creating
sharp reflections on metal, product photography, luxurious feel

Landscape

Misty mountain lake at dawn, snow-capped peaks reflected in still water,
pine forest silhouettes on the shore, soft pink and orange sky,
landscape photography, wide 16:9 composition

Fantasy/Illustration

Massive ancient tree with glowing amber crystals embedded in bark,
bioluminescent mushrooms at base, mystical forest clearing with
shafts of moonlight, fantasy illustration style, rich jewel tones

Architecture

Modern minimalist house with floor-to-ceiling windows, cantilevered
over rocky coastline, dramatic sunset behind, waves crashing below,
architectural photography, clean lines and geometric forms

Portrait with Environment

Chef in white coat standing in professional kitchen, arms crossed,
stainless steel equipment behind, warm overhead lighting,
environmental portrait photography, confident expression

Concrete Language Guide

Instead of → Write:

Abstract	Concrete
"beautiful sunset"	"orange and pink sky with scattered clouds"
"cozy room"	"warm lamp light, knit blanket on leather armchair"
"delicious food"	"steam rising, golden crust, fresh herbs visible"
"professional"	"crisp white shirt, clean workspace, natural light"
"scary atmosphere"	"deep shadows, single flickering light, fog"
"luxury"	"polished marble, gold accents, soft velvet"

Workflow

When user requests an Imagen prompt:

Clarify the subject - What's the main focus?
Determine complexity:
- Single subject → Simple Pattern
- Multiple elements needing placement → Spatial Zone Pattern
Add concrete details:
- Materials and textures
- Colors (specific: "burgundy" not "red")
- Lighting direction and quality
Choose ONE style anchor:
- "product photography"
- "portrait photography"
- "oil painting style"
- "digital illustration"
- "cinematic style"
Specify aspect ratio based on use:
- 16:9 for landscapes, headers
- 1:1 for social media, products
- 9:16 for mobile, stories
- 4:3 for standard photos

API configuration (for reference):

model='imagen-4.0-ultra-generate-001'
number_of_images=4  # Generate variations
aspect_ratio='16:9'  # Or '1:1', '9:16', '4:3'

Troubleshooting

Elements not appearing: Be more specific about placement. Use spatial positions explicitly.

Wrong style: Use only ONE style anchor. Multiple styles confuse the model.

Chaotic output: Too many elements. Simplify to 3-5 key elements maximum.

Vague results: Replace abstract words with concrete descriptions.

Text not rendering: Put text in quotes and specify font style/placement clearly.

Limitations

Does NOT execute API calls (user runs Python/gcloud commands)
Does NOT handle GCP authentication (see vertex-setup skill)
Does NOT support iterative editing (Imagen 4 limitation)
Does NOT make model choice decisions without user input

Tips for Success

Start Simple: Begin with subject + lighting + style. Add complexity only if needed.

Be Specific: "Labrador retriever" beats "dog". "Mahogany desk" beats "wooden desk".

Light Matters: Describe direction, quality, and color of light.

One Style: Pick one clear style anchor and commit to it.

Generate Multiple: Always request 4 variations - quality varies between outputs.

Aspect Ratio: Match ratio to your use case (social, web, print).

imagen-prompt

Invocation

Tool Access

Context Preview

SKILL.md

imagen-prompt

Invocation

Tool Access

Context Preview

SKILL.md

Imagen Prompt Generation Skill

Purpose

Pairs Well With

When to Use

Model Selection

Key Rules

DO:

DON'T:

Prompt Patterns

Simple Pattern (Single Subject)

Spatial Zone Pattern (Complex Compositions)

Text in Images

Example Prompts

Product Photography

Landscape

Fantasy/Illustration

Architecture

Portrait with Environment

Concrete Language Guide

Workflow

Troubleshooting

Limitations

Tips for Success

Similar Skills

Imagen Prompt Generation Skill

Purpose

Pairs Well With

When to Use

Model Selection

Key Rules

DO:

DON'T:

Prompt Patterns

Simple Pattern (Single Subject)

Spatial Zone Pattern (Complex Compositions)

Text in Images

Example Prompts

Product Photography

Landscape

Fantasy/Illustration

Architecture

Portrait with Environment

Concrete Language Guide

Workflow

Troubleshooting

Limitations

Tips for Success

Similar Skills