From business-intelligence-skills
Generates talking photo videos from images and creates videos using Flyworks/HiFly digital human avatars with TTS or voice cloning.
How this skill is triggered — by the user, by Claude, or both
Slash command
/business-intelligence-skills:flyworks-avatar-video-1.0.0The summary Claude sees in its skill listing — used to decide when to auto-load this skill
This skill allows you to generate videos using Flyworks (a.k.a HiFly 飞影数字人) Digital Humans. Available features:
This skill allows you to generate videos using Flyworks (a.k.a HiFly 飞影数字人) Digital Humans. Available features:
For detailed documentation, see the references/ folder:
This skill works with a default free-tier token, but it has limitations:
To remove limitations:
export HIFLY_API_TOKEN="your_token_here"scripts/hifly_client.pyThe main entry point for all operations.
# List available public avatars
python scripts/hifly_client.py list_public_avatars
# List available public voices
python scripts/hifly_client.py list_public_voices
# Create a video with a public avatar (TTS)
python scripts/hifly_client.py create_video --type tts --text "Hello world" --avatar "avatar_id_or_alias" --voice "voice_id_or_alias"
# Create a video with a public avatar (Audio URL or File)
python scripts/hifly_client.py create_video --audio "https://... or path/to/audio.mp3" --avatar "avatar_id_or_alias"
# Create a talked photo video using bundled assets
python scripts/hifly_client.py create_talking_photo --image assets/avatar.png --title "Bundled Avatar"
# Clone a voice using bundled assets
python scripts/hifly_client.py clone_voice --audio assets/voice.MP3 --title "Bundled Voice"
# Check status of generated tasks
python scripts/hifly_client.py check_task --id "TASK_ID"
# Manage local aliases (saved in memory.json)
python scripts/hifly_client.py manage_memory add my_avatar "av_12345"
python scripts/hifly_client.py manage_memory list
# First find a voice and avatar
python scripts/hifly_client.py list_public_avatars
python scripts/hifly_client.py list_public_voices
# Generate
python scripts/hifly_client.py create_video --type tts --text "Welcome to our service." --avatar "av_public_01" --voice "voice_public_01"
# Create the avatar from an image URL
python scripts/hifly_client.py create_talking_photo --image "https://mysite.com/photo.jpg" --title "CEO Photo"
# Output will give you an Avatar ID, e.g., av_custom_99
# Save it to memory
python scripts/hifly_client.py manage_memory add ceo av_custom_99
# Generate video using the new avatar
python scripts/hifly_client.py create_video --type tts --text "Here is the quarterly report." --avatar ceo --voice "voice_public_01"
When assisting users with video generation, follow these guidelines:
Video generation requires both text AND a voice. If the user provides text but no voice:
Check local memory first: Run manage_memory list to see if the user has saved any voice aliases.
Ask the user to choose:
Help them select:
list_public_voicesclone_voice --audio [file] --title [name]For a prompt like "Create a talking photo video from my photo saying 'this is my AI twin'":
clone_voicepython scripts/hifly_client.py create_talking_photo \
--image user_photo.jpg \
--text "this is my AI twin" \
--voice SELECTED_VOICE_ID \
--title "My AI Twin"
After creating avatars or cloning voices, offer to save them:
python scripts/hifly_client.py manage_memory add my_avatar AVATAR_ID --kind avatar
python scripts/hifly_client.py manage_memory add my_voice VOICE_ID --kind voice
npx claudepluginhub freestylefly/canghe-skills --plugin content-skillsProvides Runway ML public API reference: models, endpoints, costs, limits, types for video generation, image generation, audio, and characters.
Pipeline for AI avatar video production: single avatar, translation, batch, and hybrid real+AI workflows. Tools: HeyGen, Synthesia, ElevenLabs, Captions, Rask AI, Vbee. Includes voice cloning, anti-detection, and VN ethics compliance.
Generates videos from text prompts or images, animates still images, and creates talking avatars from photos with audio using Kling AI models (VIDEO 3.0, Avatar 2.0, etc.). Handles multi-shot storyboards, character consistency, and prompt engineering.