From renoise
CLI for Renoise AI platform: generate videos/images, upload materials, register assets, browse characters, poll results. Tool layer for direct platform access.
How this skill is triggered — by the user, by Claude, or both
Slash command
/renoise:renoise-genThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
CLI for the Renoise AI video/image generation platform. This skill covers **how to use the tools**. For creative decisions (what to generate, how to write prompts, anchoring strategy), see the **director** skill.
CLI for the Renoise AI video/image generation platform. This skill covers how to use the tools. For creative decisions (what to generate, how to write prompts, anchoring strategy), see the director skill.
Platform URL: https://www.renoise.ai — NEVER renoise.com.
# Text-to-Video — 15s finished cut
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task generate \
--prompt "[0-5s] Close-up of a cat on the moon, slow push in. [5-12s] The cat dances under twinkling stars, smooth orbit. [12-15s] Wide pull back revealing the full lunar landscape, frame holds steady." \
--duration 15 --ratio 16:9
# Image-to-Video — upload a reference image, then generate
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs material upload /path/to/photo.jpg
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task generate \
--prompt "The product rotates slowly on a white pedestal, soft studio lighting, cinematic." \
--materials "42:ref_image" --duration 10 --ratio 16:9
# Generate Image
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task generate \
--prompt "A cute cat sitting on a crescent moon, watercolor style, dreamy atmosphere" \
--model nano-banana-2 --resolution 2k --ratio 1:1
# Generate Image with Text/Typography (use gpt-image-2 for best prompt-following on text/logos)
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task generate \
--prompt "Hero product poster with bold headline 'MIDNIGHT BLOOM' in serif type, centered, minimalist layout" \
--model gpt-image-2 --resolution 2k --ratio 16:9
| Model | Type | Duration / Resolution | Aspect Ratios | Notes |
|---|---|---|---|---|
renoise-2.0 / sd-2.0 | Video | 4–15s, 720p/1080p | 9:16, 16:9, 1:1, 4:3, 3:4, 21:9 | Aliases: seedance-2.0, youmeng-2.0; refs image ≤9, video ≤3, audio ≤3; supports watermark/audio generation |
renoise-2.0-fast / sd-2.0-fast | Video | 4–15s, 720p only | 9:16, 16:9, 1:1, 4:3, 3:4, 21:9 | Aliases: seedance-2.0-fast, youmeng-2.0-fast; refs image ≤9, video ≤3, audio ≤3 |
happyhorse-1.0 | Video | 3–15s, 720p/1080p | 16:9, 9:16, 1:1, 4:3, 3:4 | Alias typo accepted: happyhourse-1.0; refs image ≤9, video 0; no last_frame |
kling-3.0-omni | Video | 3/5/10/15s, 720p/1080p | 16:9, 9:16, 1:1, 4:3, 3:4 | refs image ≤7, video ≤1, audio unsupported; prompt ≤2500 chars; ref video 3–10s, dimensions 720–2160px |
nano-banana-2 | Image | 1k, 2k, 4k | 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9, 1:4, 4:1, 1:8, 8:1 | Google Vertex; widest ratio set |
nano-banana-pro | Image | 1k, 2k, 4k | 1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9 | Google Vertex; higher quality tier |
midjourney-v7 | Image | (no --resolution) | 1:1, 4:3, 3:4, 16:9, 9:16, 3:2, 2:3 | Max 4 ref images; alias midjourney |
gpt-image-2 | Image | 1k, 2k, 4k | 1:1, 3:2, 2:3, 3:4, 4:3, 16:9, 9:16, 21:9 | Max 16 ref images; strongest at text/typography |
Choosing an image model:
nano-banana-2 — default, cheapest, flexible ratios (incl. extreme 8:1 / 1:8 banners).nano-banana-pro — higher fidelity; use for hero / final frames.midjourney-v7 — strongest stylization; pass --ratio only (no resolution).gpt-image-2 — best prompt-following for text/logos/typography in image.node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs <domain> <action> [options]
Domains: task, material, asset, character, credit
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs credit me # User info + balance
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs credit estimate --duration 15 # Estimate cost
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs credit history # Transaction history
# Generate (create + wait + return result)
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task generate \
--prompt "..." --duration 15 --ratio 16:9 \
[--materials "..."] [--characters "..."] [--tags "..."]
# Create only (returns immediately)
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task create \
--prompt "..." --duration 15 --ratio 16:9
# Management
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task list [--status completed] [--tag project-x]
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task get <id>
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task result <id>
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task wait <id> [--interval 10] [--timeout 600]
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task cancel <id> # Pending only
# Chaining — download result → upload as material (one step)
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task chain <id>
# → prints new material ID, ready for --materials "ID:ref_video"
# Tags
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task tags
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task tag <id> --tags a,b,c
Parameters for generate / create:
| Parameter | Description | Default |
|---|---|---|
--prompt | (required) English narrative prompt | — |
--model | Model name | renoise-2.0 / sd-2.0 (video) or nano-banana-2 (image) |
--type | Optional video / image; must match model | inferred from model |
--duration | Video duration, model-specific | 5 |
--ratio | Aspect ratio — supported set varies per model (see Supported Models) | 1:1 |
--resolution | Image 1k/2k/4k; video 720p/1080p; omit for midjourney-v7 | model default |
--watermark | Add video watermark and apply 10% credit discount | off |
| `--audio-generation 0 | 1, --no-audio-generation` | Toggle generated audio when model supports it |
--template-id | Create from template | — |
--tags | Comma-separated tags | — |
--materials | Material refs, comma-separated (see Material Roles); role required | — |
--characters | Character refs: id1,id2 or id1:role,id2:role | — |
Task statuses: user-facing statuses are pending, assigned, running, completed, failed, cancelled. Internal states such as submitted, queued, assigning, archiving, and preparing_facepass are collapsed to running by the public API.
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs material upload /path/to/file.jpg
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs material upload /path/to/clip.mp4 --type video
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs material upload /path/to/audio.mp3 --type audio
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs material list [--type image|video|audio] [--search cat] [--ids 1,2] [--mine]
Register uploaded images as Ark assets so they bypass privacy detection when used as reference_image.
# One-step: create + wait until active (~30-60s)
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs asset register <material_id> --name "Character Name"
# Step by step
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs asset create <material_id> --name "Character Name"
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs asset wait <id>
# Manage
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs asset list
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs asset get <id>
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs asset delete <id>
Asset statuses: pending → processing → active / failed
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs character list [--category female] [--search Jasmine]
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs character get <id>
Three mutually exclusive modes for visual input. Do NOT mix modes.
| Mode | --materials value | Description |
|---|---|---|
| First frame only | ID:first_frame | Pin the first frame; prompt drives the rest |
| First + last frame | ID1:first_frame,ID2:last_frame | Pin start and end; model generates transition |
| Multimodal reference | ID:ref_image, ID:ref_video, asset:ID:reference_image | References for style/content guidance |
Mode-level exclusion:
first_framecannot be mixed withref_image.Within multimodal reference mode,
ref_image+ref_video+asset:ID:reference_imagecombine freely — they use different API fields (idvsuser_asset_id).
| Role | Syntax | What it does | Limit |
|---|---|---|---|
| Character asset | asset:ID:reference_image | Locks face/body identity. Bypasses privacy detection. | Multiple OK |
| Reference video | ID:ref_video | Continues motion/scene from previous segment | Up to 3 |
| Reference image | ID:ref_image | Style/environment/palette guidance. No human faces. | Up to 9 (gpt-image-2: 16, midjourney-v7: 4, kling-3.0-omni: 7) |
| Reference audio | ID:ref_audio | Voice/audio reference where supported. Must be paired with image or video. | sd-2.0: up to 3; unsupported on Kling |
| Character library | --characters "ID" | Platform character. Bypasses privacy detection. | Multiple OK |
Combine as needed per segment:
# Character + continuity + environment
--materials "asset:27:reference_image,42:ref_video,99:ref_image"
# Character only
--materials "asset:27:reference_image"
# Environment only (B-roll)
--materials "99:ref_image"
API representation:
"materials": [
{ "user_asset_id": 27, "role": "reference_image" },
{ "id": 42, "role": "ref_video" },
{ "id": 99, "role": "ref_image" }
]
Images with human faces passed as ref_image will be blocked with PrivacyInformation error. Two ways to bypass:
asset register → asset:ID:reference_image--characters "ID"Never pass raw face images as ref_image.
Scan a folder, upload, analyze with Gemini, output structured material-pool.json:
node ${CLAUDE_SKILL_DIR}/scripts/material-ingest.mjs ./materials/
Auto-match materials to shots:
node ${CLAUDE_SKILL_DIR}/scripts/match-materials.mjs --pool material-pool.json --shots project.json
Face detection: materials with has_face: true are automatically excluded from ref_image matching.
The renoise-2.0 model responds best to:
Structure: Subject (appearance) + Action (what happens) + Camera (movement) + Scene (environment) + Style (visual style)
Shot density for time-annotated prompts:
For the full prompt writing guide, see the director skill's prompt-craft.md.
| Mode | Duration | When to use |
|---|---|---|
| Finished Cut | 15s with [0-3s]...[3-10s]...[10-15s]... time annotations | Default — complete scenes with camera changes |
| Clip Stock | 3–5s, single action + camera, no time annotations | Atomic clips for post-production assembly |
Finished Cut is the default. Use Clip Stock when the user needs individual shots to combine in an editor.
When a video exceeds 15s, split into segments:
3, 5, 10, 15.ID:first_frametask chain <id> to turn a completed segment into ref_video material for the nextffmpeg -sseof -0.2 -i generated/shots/S1.mp4 -frames:v 1 -q:v 2 -y generated/keyframes/S1-end.jpg
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs material upload generated/keyframes/S1-end.jpg
# Use returned material ID as S2 first_frame
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task generate \
--prompt "Continuing from the previous shot: <S2 prompt>" \
--duration 15 --ratio 16:9 --materials "ID:first_frame"
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task chain <id>
# Use returned material ID as the next segment's ref_video
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task generate \
--prompt "Continuing from the previous shot: <S2 prompt>" \
--duration 15 --ratio 16:9 --materials "ID:ref_video"
| Scenario | Approach |
|---|---|
| Character in platform library | --characters "ID" |
| Custom character, have/can generate face image | asset register → --materials "asset:ID:reference_image" |
| Quick one-off, no face image | Text-only description in prompt |
| Product/landscape (no faces) | --materials "ID:ref_image" |
Never pass raw face images as ref_image — privacy detection will block them.
Pick the model by job:
nano-banana-2for drafts / banners / wide aspect ratios,nano-banana-profor hero / final frames,gpt-image-2for anything with text/logos/typography,midjourney-v7for stylized illustration (no--resolution).
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task generate \
--prompt "Professional product design sheet showing a sleek wireless headphone from 6 angles: front view, side view, back view, top view, 3/4 perspective, detail close-up. Clean white background, studio lighting, consistent shadow direction." \
--model nano-banana-2 --resolution 2k --ratio 1:1
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task generate \
--prompt "A modern minimalist living room at golden hour, floor-to-ceiling windows overlooking a city skyline, warm sunlight, photorealistic, 8K detail." \
--model nano-banana-2 --resolution 2k --ratio 16:9
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task generate \
--prompt "Cinematic hero shot: a lone astronaut standing on a red Martian plain, Earth visible in the dusty sky, volumetric sunlight, photorealistic, highly detailed, shallow depth of field." \
--model nano-banana-pro --resolution 2k --ratio 16:9
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task generate \
--prompt "Minimalist product poster with bold serif headline 'MIDNIGHT BLOOM' centered, subtitle 'Eau de Parfum' in small caps, dark navy background, gold foil accents." \
--model gpt-image-2 --resolution 2k --ratio 16:9
node ${CLAUDE_SKILL_DIR}/renoise-cli.mjs task generate \
--prompt "Stylized fantasy portrait of an elven archer, moody forest lighting, painterly, intricate details." \
--model midjourney-v7 --ratio 3:4
| Error | Cause | Fix |
|---|---|---|
PrivacyInformation | Face in ref_image | Use User Asset or Character Library instead |
Insufficient credits (402) | Balance too low | credit me, top up at https://www.renoise.ai |
Task failed | Generation failed | task get <id> to check error. Adjust prompt and retry |
Auth Error (401) | Invalid API Key | Check RENOISE_API_KEY env var |
wait timeout | Generation took too long | Multi-anchor tasks take 8–12 min. Use --timeout 900 or task create + task wait --timeout 900 |
| Material upload fails | File too large / wrong format | < 30 MB, supported format (jpg, png, webp, mp4, mov, etc.) |
npx claudepluginhub arcocodes/renoise-plugins-official --plugin renoiseProvides Runway ML public API reference: models, endpoints, costs, limits, types for video generation, image generation, audio, and characters.
Acts as AI creative director for video production including product ads, short films, montages, TikTok e-commerce. Analyzes inputs, writes English prompts, generates assets, submits tasks.
Generates finished multi-shot videos (5–120s) from text, images, URLs, scripts, or audio with auto model selection across 10+ video generation models. Includes AI music, lip sync, and subtitles.