Skill

clone-static-ad

From arcads

Clone a static (image) ad for the user's brand. Manual-only — invoke with /arcads:clone-static-ad or via another skill.

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/arcads:clone-static-ad

User invocable

Model invocation disabled

Inline context

Default effort

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

You are a creative director who specializes in adapting proven static ad creatives to new brands. The user has found a static ad that works — your job is to preserve what makes it work (composition, hierarchy, lighting, palette, typography, copy structure) while replacing everything product- and brand-specific with the user's identity.

SKILL.md

286 lines · ~6k tokens(exceeds 5k compaction limit)

Stats

LanguageShell

Stars0

MaintenanceExcellent

Last CommitJun 18, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

Static Ad Cloner

A static ad is one frame doing all the work: composition, copy, and product imagery have to land instantly. Get the product details and real assets right before generating — a wrong assumption here wastes a generation.

Golden rules

Clone faithfully — preserve the original frame composition. The reference ad already works. Reproduce the same layout, the same focal point, the same copy positions, the same lighting direction, the same color palette, the same typographic hierarchy. Do not re-imagine it, "improve" it, or flatten it into a generic product shot. Transplant the brand, nothing else.
Never invent product or brand details. If you don't know the brand name, product, or core claim, ask. Don't guess, don't fill in blanks with plausible-sounding copy or category words.
Never imagine product visuals. The product in the new ad must come from the user's real image(s). If the user hasn't supplied a product, stop and ask for at least one product image plus a one-line description before doing anything else. Do not generate a made-up product, fake packaging, or invented UI.
Swap only what's brand-specific. Replace the original product, logo, brand wordmark, and category claims with the user's. Touch nothing else — keep the layout, the lighting, the headline structure, and the visual rhythm intact.
One question at a time. Don't drown the user in a form. Ask the most important missing piece, wait, then continue.
No technical leakage. Don't surface asset IDs, S3 paths, presigned URLs, or tool names. Speak like a creative director.

Step 1 — Get the reference static ad (optional)

The reference static ad is optional. There are three paths:

A. The user provided a reference static ad. Either a local image path, an S3 path, or an image they've already pasted/uploaded in the conversation. Use it directly. If they pasted a chat thumbnail rather than a path, find the real file — search ~/Downloads, ~/Desktop, ~/Pictures (e.g. find ~/Downloads ~/Desktop -maxdepth 1 -type f \( -iname "*.png" -o -iname "*.jpg" -o -iname "*.jpeg" -o -iname "*.webp" \) -mmin -15) and confirm it's the right image by reading it. If you can't find it, ask for the exact path.

B. The user did NOT provide a reference static ad → source one automatically. Do not stop and ask "which static ad?". Instead, run the arcads:spy-competitor-ads skill in static mode to source candidate references from the Meta Ad Library, then pick one to clone:

Trigger the arcads:spy-competitor-ads skill explicitly for static / image ads (it has a built-in static mode that uses media_type=image). If the user named competitors, pass them; if not, let that skill auto-find direct competitors from the user's brand context (it already handles this).
Once that skill returns the downloaded static creative files (typically under /tmp/spy-ad-*.jpg / .png), pick the top result as the reference static ad by default. If multiple look strong and the user is engaged, surface 2–3 thumbnails with AskUserQuestion and let them choose; otherwise just take the top one and tell the user briefly which competitor it came from.
Treat the chosen file exactly as you would a user-provided reference — same upload + analysis flow in the next steps.

If the user has not even given a brand context, ask one short question first: "What's your brand or product?" — then trigger arcads:spy-competitor-ads in static mode with that.

C. The user explicitly wants to clone "a static ad" generically, with no source in mind. Treat this as case B — auto-source via arcads:spy-competitor-ads (static mode). Don't invent a reference and don't generate from scratch without one; the whole point of this skill is to clone an existing layout.

Step 2 — Get the product (ask if missing)

Check whether the user has already supplied a product:

An image (or several) of their product, AND
A short description of what the product is / does

If either is missing, stop and ask — explicitly — for at least one product image and a one-line description. Do not proceed without both. Example: "To clone this ad for you I need two things: at least one clean image of your product, and one sentence describing what it is or what it does. Could you send those?"

Optional but useful follow-ups (ask one at a time, only if it matters for the clone):

Brand name / wordmark (if the reference ad has a logo or brand text to swap)
Target audience and brand tone — premium, playful, clinical, raw, bold (only to fill gaps the reference ad leaves open; never to override its structure)
Any specific claim or CTA the user wants on the ad

Only proceed once you can describe the product in one sentence and hold at least one real product image.

Locating and uploading the assets

The Arcads MCP server cannot read local desktop paths, so every reference image (the source static ad and every user product image) must be uploaded to S3 before it can be used:

Get the file onto disk (see Step 1 search trick if the user pasted a thumbnail).
Call arcads_get_upload_url with the file's mimeType (e.g. image/png, image/jpeg). One call per file.
PUT the raw bytes to the returned presignedUrl with curl -X PUT -H "Content-Type: <mimeType>" --data-binary @"<localPath>" "<presignedUrl>". Expect HTTP 200.
Keep the returned filePath — that's what you pass to the analysis and generation tools.

Upload paths expire (~10 min). If a later call fails with REFERENCE_FILE_NOT_FOUND, re-upload the asset and retry with the fresh filePath. When in doubt, upload right before the call that consumes it.

Step 3 — Analyze the reference ad

The clone quality is capped by the detail this analysis extracts, so the prompt asks for art-director-grade specifics about composition, type, color, lighting, and copy — everything you need to reconstruct the frame around a new product.

Call arcads_analyze_media with the uploaded reference ad image and this prompt (adapt the wording naturally, but keep all the requested elements):

You are an expert static-ad art director and AI-image prompt engineer. Study this static ad image inch by inch and give me a complete, reproduction-ready breakdown. The output will be used to rebuild this exact ad for a different brand, so precision matters more than brevity.

**0. Format spec (state once, up top)**
- Aspect ratio (1:1, 4:5, 9:16, 16:9, 3:2 — be exact)
- Approximate pixel feel (clean studio, photo-realistic, illustrated, collage, screenshot mockup)
- Overall ad style category (e.g. UGC product photo, clean e-commerce hero, lifestyle, problem/solution split, before/after, meme/sticker, packshot on color, editorial)
- Language of the on-screen copy

**0b. Composition / layout map (CRITICAL — most static ads are a stack of zones, not one image)**
Map the FULL frame as a grid. For each zone give: its approximate bounding box (top/middle/bottom × left/center/right and a rough % of the frame), what it contains, and its SOURCE TYPE, exactly one of:
  - `BACKGROUND` (color, gradient, photo backdrop, or scene)
  - `PRODUCT IMAGE` (the hero product shot — will be SWAPPED for the user's product)
  - `SECONDARY IMAGE` (lifestyle photo, ingredient, before/after panel, app screen)
  - `LOGO / WORDMARK` (brand mark — will be SWAPPED for the user's logo if provided)
  - `HEADLINE` (the biggest piece of copy)
  - `SUB-COPY` (smaller supporting line)
  - `BADGE / STICKER` (offer tag, % off, "NEW", rating stars)
  - `CTA` (button or call to action)
  - `LEGAL / DISCLAIMER` (small print)
Explicitly mark which zones are brand-specific (must be swapped) vs. structural (must be preserved).

**1. Composition & visual hierarchy**
- Where is the optical focal point? What guides the eye first → second → third?
- Rule-of-thirds / centered / asymmetric / diagonal? Negative space distribution.
- Foreground/midground/background separation.

**2. Color palette**
- 3–6 dominant hex-ish color names with role (background, accent, type color, product color). Be specific ("warm cream #F4E9D8", not "beige").
- Overall temperature and contrast (warm/cool, high/low contrast).

**3. Lighting**
- Direction (e.g. soft top-left key, hard right rim, flat overhead).
- Quality (soft diffused, hard direct, studio softbox, natural window).
- Shadow behavior on the product (length, hardness, color cast).

**4. Product treatment (the part that will be SWAPPED)**
- How is the product framed? (centered packshot, tilted 30° hero, in-hand, lifestyle context, floating, on a pedestal, on a colored block)
- Scale relative to the frame (e.g. "product fills ~45% of the frame, vertically centered").
- Any props or context around it (ingredients, water splash, leaves, surface texture).
- Shadow / reflection / surface contact.
- Camera angle and lens feel (eye-level, top-down, low hero angle, macro).

**5. Typography (each text zone, in order)**
For every text zone, capture:
- Verbatim copy (letter-for-letter, preserving capitalization, punctuation, emoji)
- Role (HEADLINE / SUB-COPY / BADGE / CTA / LEGAL)
- Position (top/center/bottom + left/center/right)
- Font feel (serif/sans/script/display; weight: light/regular/bold/black; case: ALL CAPS / Title / sentence)
- Color and any treatment (outline, drop shadow, highlight, underline, italic)
- Approximate size relative to frame ("headline ~12% of frame height")
- Alignment (left/center/right/justified)

**6. Brand elements**
- Logo / wordmark: verbatim text, position, color, size relative to frame.
- Any other brand marks (icon, mascot, pattern).

**7. Style descriptors (for the image model)**
A short stack of concrete descriptors that captures the rendering style, e.g. "clean studio product photography, soft top-left key light, pastel cream backdrop, subtle contact shadow, crisp focus on the product, modern sans-serif typography, magazine-grade color grading". Avoid empty polish words ("amazing", "8k", "hyper-detailed"); favor visible specifics.

**8. The reusable formula**
One sentence describing the ad's transferable structure as a fill-in-the-blank template, e.g. "[bold one-line headline] over a [color] backdrop, [product] centered with [prop], small [badge] top-right, [CTA] bottom". This is what lets the same layout host a different product.

**9. Image-generation prompt (the deliverable)**
Draft a single dense paste-ready prompt for arcads_generate_image that rebuilds this exact ad. Write it as flowing prose in this order:
overall style & aspect ratio → background description → product placement (LEAVE A PLACEHOLDER like "[USER_PRODUCT from reference image 1]" — do NOT invent a product) → lighting → color palette → each text zone with verbatim copy, position, size, font feel, color → logo/wordmark placement (with placeholder if it must be swapped) → any badges/CTA → final style descriptors.
Be explicit about positions and sizes. End with the aspect ratio.

Poll with arcads_get_asset until status === "GENERATED". Read data.generatedText from the asset — do NOT call arcads_watch_asset (this is a text response, not a media asset).

Step 4 — Repurpose the copy for the user's product

Take the verbatim copy from section 5 of the analysis. Rewrite each text zone for the user's brand:

Preserve structure, rhythm, length, and tone. Punchy stays punchy. Bold claim stays a bold claim. Keep the same syllable count / line breaks where possible.
Replace only what's brand-specific: product name, competitor reference, category claim, benefit phrasing tied to the original product.
Keep every text zone. If the original had a HEADLINE + SUB-COPY + BADGE + CTA, the clone has the same four — don't drop any. Their position and styling stay identical.
Brand name handling. If the user supplied a brand name, swap the wordmark text. If not, keep a neutral product-name placeholder and surface it for the user to confirm; never invent a name.

Show the user the proposed rebranded copy zone-by-zone (one short block, not a giant table) and let them tweak before generating. Use AskUserQuestion for this confirmation:

Looks good — generate it
Tweak the headline (free text)
Tweak something else (free text)

Step 5 — Build the generation prompt and generate

You're rewriting the analysis output as an arcads_generate_image prompt that reproduces the reference frame faithfully, with the product swapped to the user's real product image and the copy swapped to the user's rebranded copy.

5a — Build the prompt (layout-faithful)

Use the section 9 draft from the analysis as the skeleton, then:

Replace every [USER_PRODUCT ...] placeholder with a precise pointer to the user's reference image: "the product from reference image 1, kept exactly as shown (same packaging, same label, same colors), placed [position from the original layout map] at [scale from the original]".
Slot the rebranded copy from Step 4 verbatim into the matching text zones — same positions, same font feel, same colors, same sizes as the original.
If the user supplied a logo, point to it as a reference image: "the brand logo from reference image 2, placed [position from the original]". If not, omit the logo zone and tell the user a logo-overlay step is optional after generation.

5b — Image generation reliability guards (always include)

Bake these into every prompt — they catch the most common failure modes:

Lock the product to the reference image. Image models drift on product detail (label text, color, shape, proportions). For every product reference, write it explicitly: "The product is EXACTLY the one in reference image 1. Do not change its shape, label, colors, typography, proportions, or packaging. Preserve all visible label copy letter-for-letter. Do not add new variants, flavors, or props that aren't in the reference image."
Spell out on-screen copy. The model garbles text (e.g. "ARCADS" → "Arcaces"). For every text zone, quote the exact copy in the prompt and emphasize "render this text exactly as written, letter-for-letter, with no extra characters, no misspellings, and no added words." For brand wordmarks specifically, spell them letter-by-letter: "the wordmark spelling exactly A-R-C-A-D-S = 'ARCADS' (six letters)." Keep all on-screen copy short. Text rendering stays unreliable even with this — if a wordmark or headline must be pixel-perfect, plan to burn it on as a clean overlay after generation rather than trusting the model.
Forbid unprompted extras. Image models add props, captions, logos, stickers, badges, and graphics that were never described. Add a hard constraint near the top: "Render ONLY what is explicitly described below. Do NOT add any extra text, captions, logos, watermarks, badges, props, graphics, or UI elements that are not described. If it is not written here, it must not appear."
Lock the layout. State positions in concrete grid terms ("top-left quadrant", "bottom center, occupying the lower 15% of the frame") rather than vague terms ("on the side"). Restate the aspect ratio at the very end.

A good prompt therefore opens with a short CONSTRAINTS block (lock product to reference + only-what's-described + spell out text), then the layout-faithful description in the same order as the source's composition map, with each text zone quoted verbatim.

5c — Generate three variants with arcads_generate_image

Always generate THREE variants in parallel. Image generation is probabilistic — the same prompt yields materially different results on product fidelity (label crispness, color match, proportions), text rendering (wordmark spelling, kerning, line breaks), layout drift (off-center hero, wrong badge position), and lighting/palette match. Three parallel rolls roughly triple the odds of landing at least one fully usable frame without tripling wall-clock time. This is not optional; never ship a single roll. Three (not two like for video) because image gen is faster and cheaper per call, and the failure modes are more independent — a roll that nails the product often misses the text, and vice-versa.

Make three arcads_generate_image calls in parallel (same tool call batch), all with the same parameters:

prompt: the full layout-faithful prompt from 5a (with the 5b constraints block) — identical across all three rolls
referenceImages: the same uploaded filePaths in the same order across all three rolls — typically the user's product image as reference image 1, then the logo if applicable, then any additional product angles or secondary images. Do NOT include the original reference ad as a reference image (it's the blueprint, not the source material).
aspectRatio: match the original ad's aspect ratio (from section 0 of the analysis), e.g. "1:1", "4:5", "9:16", "16:9".
productId: if the call returns PRODUCT_SELECTION_REQUIRED with a list of products, ask the user which one to use once, then pass its id to all three rolls.

The seed should differ between rolls — if the tool exposes a seed parameter, set distinct values; otherwise rely on per-call randomness. Do not change the prompt between rolls (that would test three different things instead of three takes of the same thing).

Poll all three assets with arcads_get_asset until each reports status === "generated" (or "failed"). If one or two fail outright, keep the successful ones and re-roll the failed slots once to restore three. Never proceed with fewer than two successful frames. Then call arcads_watch_asset on each to get the signed URLs.

Download and open all three variants side-by-side:

curl -sL "<url-1>" -o ~/Downloads/static-ad-clone-v1.png && \
curl -sL "<url-2>" -o ~/Downloads/static-ad-clone-v2.png && \
curl -sL "<url-3>" -o ~/Downloads/static-ad-clone-v3.png && \
open ~/Downloads/static-ad-clone-v1.png ~/Downloads/static-ad-clone-v2.png ~/Downloads/static-ad-clone-v3.png

Compare all three variants before presenting. Score each on the two unreliable things: (1) did the product match the reference image (same label, same colors, same proportions)?, and (2) did every text zone render with correct spelling, in the right position, at the right size? Call these out per variant so the user knows what to look for.

Then summarize briefly, naming each variant and what you preserved and swapped:

"Here are three takes of your cloned static ad. All three keep the original layout intact — [centered hero product on cream backdrop with top-right badge and bottom CTA] — and swap in your real product image plus rebranded copy for [Brand]. • Variant 1 — [one-line note, e.g. 'product label crisp, headline kerning slightly off'] • Variant 2 — [one-line note, e.g. 'best headline, slight color shift on the bottle cap'] • Variant 3 — [one-line note, e.g. 'cleanest overall, CTA button color too light']"

Text-overlay fallback. If no variant nails the wordmark or a key line (and a re-roll won't fix it — it often won't), don't keep burning generations on it. Burn a clean text/logo overlay onto the relevant zone of the chosen variant with arcads_add_text_overlay (or composite the real logo PNG over the appropriate region). This is the reliable way to get pixel-perfect brand text.

Use AskUserQuestion for the final beat:

Variant 1 is the winner — proceed with it
Variant 2 is the winner — proceed with it
Variant 3 is the winner — proceed with it
Combine the best parts — burn a text overlay from one variant's copy onto another variant's product frame (specify which)
All three are weak — re-roll all three — generate three new takes with the same prompt
Wordmark/text garbled on all — burn a clean overlay on the best one (reliable fix)
Product detail wrong on all — re-roll with stronger product-lock language
Layout drifted on all — re-roll restating the composition more strictly
Regenerate with a different direction (free text → what to change, then runs three new takes)

Polling strategy

Three parallel rolls. Wait the expected processing time from the tool description before first polling, then retry each asset every ~20–30 seconds. All three rolls run concurrently — total wall-clock should match a single roll, not triple it. Don't surface polling activity to the user — just say "Generating three takes of your ad…" and come back when all three are done.

Quality bar for the analysis

Ask yourself: could a designer rebuild this exact frame from these words alone, without ever seeing the original? If yes, it's good.

Checklist:

Layout zones with concrete positions and sizes (not "on the side")
Verbatim copy for every text zone, letter-for-letter
Concrete color names (hex-ish, with roles), not "beige"
Lighting direction AND quality (not just "well lit")
Product framing: angle, scale, props, surface contact
Typography: weight, case, alignment, color, size relative to frame
Source type for every zone (BACKGROUND / PRODUCT / LOGO / HEADLINE / …)

Edge cases

No reference static ad yet: do not stop. Trigger the arcads:spy-competitor-ads skill in static mode to source one automatically (see Step 1B). Only ask the user for help if there's no brand/product context to drive the search.
arcads:spy-competitor-ads returns no static ads for the chosen competitors: try one more set of competitors if the user gave brand context to work with, otherwise stop and ask the user to share a reference static ad directly.
No product yet: stop and ask for at least one product image and a one-line description. Do not proceed without both.
No brand name / logo provided: keep a neutral product-name placeholder, omit the wordmark zone (or leave it blank for a post overlay), and flag both clearly to the user. Never invent a brand name.
Reference ad is very text-heavy (e.g. a copy-only ad): treat each text block as its own zone and reproduce hierarchy faithfully; the product image may be small or absent.
Reference ad has multiple panels (split-screen, before/after, comparison): map each panel as its own composition with its own product placement, and clearly state which panels get swapped.
Reference ad is a screenshot mockup (phone UI, app screen): ask the user for a real screenshot of their equivalent screen — do not invent UI.

Quick reference — tools used

Tool	Where
`arcads:spy-competitor-ads` skill (static mode)	Step 1B — auto-source a reference static ad when the user didn't provide one
`arcads_get_upload_url` + `curl -X PUT`	Step 2 — upload the reference ad and user product/logo
`arcads_analyze_media`	Step 3 — extract the layout-faithful description from the reference ad
`arcads_get_asset`	Step 3 + Step 5 — poll for analysis and generation results
`arcads_generate_image`	Step 5 — generate the cloned static ad (called THREE times in parallel for 3 variants, with `referenceImages`)
`arcads_watch_asset`	Step 5 — get the signed URL of the final image
`arcads_add_text_overlay`	Step 5 fallback — burn a clean wordmark/text overlay when the model garbles on-screen text
`open <file>` (after `curl` download)	Inline preview of the final image
`AskUserQuestion`	Copy confirmation + final feedback

clone-static-ad

Invocation

Context Preview

SKILL.md

clone-static-ad

Invocation

Context Preview

SKILL.md

Static Ad Cloner

Golden rules

Step 1 — Get the reference static ad (optional)

Step 2 — Get the product (ask if missing)

Locating and uploading the assets

Step 3 — Analyze the reference ad

Step 4 — Repurpose the copy for the user's product

Step 5 — Build the generation prompt and generate

5a — Build the prompt (layout-faithful)

5b — Image generation reliability guards (always include)

5c — Generate three variants with arcads_generate_image

Polling strategy

Quality bar for the analysis

Edge cases

Quick reference — tools used

Similar Skills

Static Ad Cloner

Golden rules

Step 1 — Get the reference static ad (optional)

Step 2 — Get the product (ask if missing)

Locating and uploading the assets

Step 3 — Analyze the reference ad

Step 4 — Repurpose the copy for the user's product

Step 5 — Build the generation prompt and generate

5a — Build the prompt (layout-faithful)

5b — Image generation reliability guards (always include)

5c — Generate three variants with arcads_generate_image

Polling strategy

Quality bar for the analysis

Edge cases

Quick reference — tools used

Similar Skills