From klingai-pack
Catalogs Kling AI models for video (T2V/I2V), image generation, lip sync, and effects with versions, speeds, qualities, resolutions, and costs for selection.
How this skill is triggered — by the user, by Claude, or both
Slash command
/klingai-pack:klingai-model-catalogThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Kling AI offers multiple model versions across video generation, image generation, lip sync, virtual try-on, and effects. Each version trades off quality, speed, and cost. This skill is the reference for choosing the right model.
Kling AI offers multiple model versions across video generation, image generation, lip sync, virtual try-on, and effects. Each version trades off quality, speed, and cost. This skill is the reference for choosing the right model.
| Model ID | Supports | Max Duration | Resolution | Speed | Quality |
|---|---|---|---|---|---|
kling-v1 | T2V, I2V | 10s | 720p | Fast | Good |
kling-v1-5 | I2V only | 10s | 1080p | Fast | Better |
kling-v1-6 | T2V, I2V | 10s | 1080p | Medium | Better+ |
kling-v2-master | T2V, I2V | 10s | 1080p | Medium | High |
kling-v2-1 | I2V only | 10s | 1080p | Medium | High |
kling-v2-1-master | T2V, I2V | 10s | 1080p | Medium | High |
kling-v2-5-turbo | T2V, I2V | 10s | 1080p 30fps | Fast | High |
kling-v2-6 | T2V, I2V | 10s | 1080p 30-48fps | Medium | Highest |
T2V = text-to-video, I2V = image-to-video
motion_has_audio: true for synchronized audio| Model ID | Purpose | Resolution |
|---|---|---|
kolors-v1-5 | Face/subject reference | Up to 2048x2048 |
kolors-v2-0 | Image restyle | Up to 2048x2048 |
kolors-v2-1 | Text-to-image | Up to 2048x2048 |
| Feature | Endpoint | Model Versions |
|---|---|---|
| Lip Sync | /v1/videos/lip-sync | v1.6+ |
| Virtual Try-On | /v1/images/kolors-virtual-try-on | v1.5 |
| Video Extension | /v1/videos/video-extend | All video models |
| Effects | /v1/videos/effects | v1.6+ |
| Motion Control | T2V/I2V with camera_control | v1.6+ |
Every video generation accepts a mode parameter:
| Mode | Credits (5s) | Credits (10s) | Use Case |
|---|---|---|---|
standard | 10 | 20 | Drafts, previews, iteration |
professional | 35 | 70 | Final output, client delivery |
Need fastest generation?
→ kling-v2-5-turbo + standard mode
Need highest quality?
→ kling-v2-6 + professional mode
Need audio in the video?
→ kling-v2-6 with motion_has_audio: true
Image-to-video only?
→ kling-v2-1 (optimized for I2V)
Budget-conscious production?
→ kling-v2-5-turbo + standard mode (10 credits/5s)
Legacy compatibility?
→ kling-v1-6 (stable, well-documented)
# Specify model in any video generation request
response = requests.post(f"{BASE}/videos/text2video", headers=headers, json={
"model_name": "kling-v2-6", # model version
"mode": "professional", # standard or professional
"prompt": "A futuristic city at sunset with flying cars",
"duration": "5",
"aspect_ratio": "16:9",
})
| Ratio | Use Case |
|---|---|
16:9 | Landscape, YouTube, presentations |
9:16 | Vertical, TikTok, Reels, Stories |
1:1 | Square, Instagram, thumbnails |
4:3 | Classic TV, presentations |
3:4 | Portrait photos |
3:2 | Standard photography |
2:3 | Tall portrait |
21:9 | Ultra-wide, cinematic |
npx claudepluginhub jeremylongshore/claude-code-plugins-plus-skills --plugin klingai-packGenerates videos from text prompts or images, animates still images, and creates talking avatars from photos with audio using Kling AI models (VIDEO 3.0, Avatar 2.0, etc.). Handles multi-shot storyboards, character consistency, and prompt engineering.
Optimizes Kling AI video generation for speed, quality, and cost using model/mode matrices, Python benchmarking script, and requests connection pooling.
Generates videos from text prompts via fal.ai models like Kling 2.6 Pro, Sora 2, LTX-2 Pro, Runway Gen-3 Turbo, Luma Dream Machine; supplies endpoints, durations, aspect ratios, prompt structures, TypeScript/Python code.