From gpt-image
Generates, edits, and inpaints images using GPT Image 2 CLI with reference gallery matching for text-to-image, UI mockups, diagrams, posters, typography, and Chinese text prompts.
How this skill is triggered — by the user, by Claude, or both
Slash command
/gpt-image:gpt-imageThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Agent runbook for GPT Image 2 generation/editing. Use the prompt library + packaged CLI. Do not reimplement image API code.
agents/openai.yamlreferences/craft.mdreferences/gallery-anime-and-manga.mdreferences/gallery-architecture-and-interior.mdreferences/gallery-beauty-and-lifestyle.mdreferences/gallery-brand-systems-and-identity.mdreferences/gallery-character-design.mdreferences/gallery-cinematic-and-animation.mdreferences/gallery-cinematic-film-references.mdreferences/gallery-data-visualization.mdreferences/gallery-edit-endpoint-showcase.mdreferences/gallery-events-and-experience.mdreferences/gallery-fashion-editorial.mdreferences/gallery-fine-art-painting.mdreferences/gallery-gaming.mdreferences/gallery-illustration.mdreferences/gallery-infographics-and-field-guides.mdreferences/gallery-ink-and-chinese.mdreferences/gallery-isometric.mdreferences/gallery-more-illustration-styles.mdAgent runbook for GPT Image 2 generation/editing. Use the prompt library + packaged CLI. Do not reimplement image API code.
generate, edit, inpaint, or multi-reference; identify asset type, exact text, aspect ratio, references, safety constraints, and budget/quality.references/gallery.md; load/search the closest references/gallery-<category>.md file(s). Read actual **Prompt** text before choosing a pattern.references/craft.md for dense text, diagrams, UI, data visualization, multi-panel layouts, weak prompts, or no close gallery match.command -v gpt-image), installed tool lists when the tool manager exists, or the runtime’s own skill registry when available. Do not assume a local home path in cloud/hosted runtimes..env, or write API keys unless the user explicitly requested setup. Global/shared installs are opt-in only.gpt-image or scripts/generate.py. Do not create a new generate.py, SDK wrapper, or ad-hoc script for normal image requests.Fast path: precise prompt + explicit “generate now” → quick reference/craft check, then CLI.
Preferred call order:
# Existing CLI on PATH
gpt-image -p "PROMPT" [-f OUT] [-i REF...] [-m MASK] [options]
# Installed skill folder; use runtime-provided skill path when available
uv run "$SKILL_DIR/scripts/generate.py" -p "PROMPT" [-f OUT] [-i REF...] [-m MASK] [options]
# Direct transient CLI when the user requested setup/one-off CLI execution
uvx --from git+https://github.com/wuyoscar/gpt_image_2_skill gpt-image -p "PROMPT" [options]
scripts/generate.py is a launcher: repo-local src/gpt_image_cli → installed gpt-image → PATH gpt-image → transient uvx/uv fallback.
OPENAI_API_KEY from process env, then .env, then ~/.env without overriding existing env; successful API calls may bill the user’s OpenAI account.OPENAI_API_KEY is unset, report missing key or use host-native generation when requested; do not write secrets.unset OPENAI_API_KEY; if a key exists in .env/~/.env, tell them to remove/rename it for the session rather than working around it.| Flag | Values | Use |
|---|---|---|
-p, --prompt | string | Required prompt/edit instruction |
-f, --file | path | Output path; auto-named if omitted |
-i, --image | repeatable path | Use edits endpoint; supports multiple references |
-m, --mask | PNG path | Inpaint with alpha mask; requires -i |
--model | default gpt-image-2 | Image model |
--size | 1k, 2k, 4k, portrait, landscape, square, wide, tall, or literal | Canvas size |
--quality | low, medium, high, auto | Cost/quality dial |
-n, --n | integer | Number of images |
--background | auto, opaque | Generation background |
--moderation | auto, low | Generation moderation setting |
--format | png, jpeg, webp | Output encoding |
--compression | 0-100 | JPEG/WebP compression |
--user | string | Optional end-user identifier |
Quality policy:
low: cheap drafts, broad exploration, many variants.medium: normal exploration, style probing, balanced cost.high: final assets, Chinese text, posters, diagrams, UI, paper figures, dense labels.Size policy:
1k / 1024x1024portraitlandscape2k4ktall| Mode | Trigger | Endpoint |
|---|---|---|
| Text-to-image | no -i | /v1/images/generations |
| Reference edit | one or more -i | /v1/images/edits |
| Inpaint | -i + -m | /v1/images/edits with mask |
Surface API errors verbatim enough for debugging; exit codes: 0 success, 1 API/refusal, 2 bad args/missing key.
references/gallery.md: routing index for the 162-prompt Reference Gallery Atlas. Load first.references/gallery-*.md: concrete prompts, previews, paths, metadata, attribution. Load 1 category for normal requests; 2–3 for hybrids.references/craft.md: prompt-craft checklist. Load for prompt repair, exact text, UI/data/diagram grammar, edit invariants, and multi-panel consistency.references/openai-cookbook.md: official parameter/model semantics. Load for API behavior or model capability questions.Reference loading policy: load the smallest useful slice; never load all category files by default.
-i paths exist; verify -m exists when used.Preserve Curated vs Author + Source metadata when adapting examples. Add new collected prompts to the Reference Gallery before README promotion.
npx claudepluginhub wuyoscar/gpt-image2-skill --plugin gpt-imageRenders GPT Image 2 prompts via Garden local generation, host-native tools, or advisor mode. Covers 18 template categories including posters, UI, products, infographics, academic figures, comics, storyboards, and editing.
Generates AI images from text prompts, edits images, and composes from multiple references using Gemini models. Supports t2i, i2i, product mockups, and stickers.
Generates images from text, edits images with references, performs product placement, style transfer, and multi-image composition using OpenAI DALL-E or Google Gemini.