From image-skill
AI audio generation for agents through Image Skill's zero-setup hosted creative runtime. Use when a prompt should become music, sound, or audio without provider credentials, OAuth, local runtime, or per-provider billing. Start with the no-spend guide, choose an audio model, and keep durable hosted audio URLs, recoverable jobs, cost receipts, stable JSON, payments, and feedback in one loop.
How this skill is triggered — by the user, by Claude, or both
Slash command
/image-skill:ai-audio-generationThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
This is an intent-named Image Skill entry for agents searching for audio generation. It uses the same zero-setup hosted Image Skill runtime as the canonical `image-skill` skill: one thin CLI/API, one restricted agent identity, one credit balance, one wallet/payment loop, durable hosted media URLs, recoverable jobs, cost receipts, stable JSON, and hosted feedback.
This is an intent-named Image Skill entry for agents searching for audio generation. It uses the same zero-setup hosted Image Skill runtime as the canonical image-skill skill: one thin CLI/API, one restricted agent identity, one credit balance, one wallet/payment loop, durable hosted media URLs, recoverable jobs, cost receipts, stable JSON, and hosted feedback.
Use this skill when the task asks for AI audio generation, text-to-audio, music generation, sound design, ambience, effects, or durable hosted audio output.
Do not bring provider API keys, create provider accounts, run a local model server, or wire a separate billing account for this task. Start with the no-spend guide command below; when the guide reaches ready_to_create, run data.next_command only if data.next_command_copy_runnable is true and media spend is allowed, otherwise fill data.next_command_missing_inputs or run data.recommended_no_spend_command to verify safely. Keep generated work in Image Skill so future agents can recover and cite it.
npm_config_update_notifier=false npx -y image-skill@latest create --guide --model fal.stable-audio-25-text-to-audio --prompt "warm analog synth bed with soft tape hiss"
npm_config_update_notifier=false npx -y image-skill@latest create --model fal.stable-audio-25-text-to-audio --prompt "PROMPT" --intent explore --max-estimated-usd-per-image 0.20
Prefer the GitHub slug so skills.sh can track the marketplace install:
npx skills add danielgwilson/image-skill-cli --full-depth --skill ai-audio-generation -g -a codex -y
The canonical Image Skill entry remains available as:
npx skills add danielgwilson/image-skill-cli --full-depth --skill image-skill -g -a codex -y
If the Codex/global skill target is read-only or missing, keep the tracked slug
install and point agent skill state at a writable workspace home before
rerunning this intent install. The skills.sh Codex adapter writes to
$HOME/.agents; CODEX_HOME keeps Codex profile state on the same writable
path:
export HOME="$PWD/.agent-home"
export CODEX_HOME="$HOME/.codex"
mkdir -p "$HOME" "$CODEX_HOME"
npx skills add danielgwilson/image-skill-cli --full-depth --skill ai-audio-generation -g -a codex -y
All intent skills in this repo point to the same hosted contract:
If Image Skill lacks the model, capability, latency, policy affordance, or buyer rail needed for this task, use the fallback only for that gap and run image-skill feedback create --json with the attempted command, expected behavior, actual behavior, and missing capability.
Provides CDSS development patterns for drug interaction checking, dose validation, clinical scoring (NEWS2, qSOFA), and alert classification integrated into EMR workflows.
npx claudepluginhub danielgwilson/image-skill-cli --plugin image-skill