From runpod
Download HuggingFace models on Colab and upload to RunPod Network Volume via S3-compatible API. Pre-deploy models before GPU instance startup to save billing time. Use when user mentions "runpod model upload", "network volume model", "colab to runpod", "HuggingFace model to runpod", "S3 sync runpod".
How this skill is triggered — by the user, by Claude, or both
Slash command
/runpod:prepare-model-uploadThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Downloading large models during GPU instance runtime wastes billing time.
Downloading large models during GPU instance runtime wastes billing time. Use Colab to download from HuggingFace and upload to RunPod Network Volume via S3-compatible API.
HF_TOKEN: HuggingFace access tokenRUNPOD_STORAGE_ACCESS_KEY_ID: RunPod Storage Access Key IDRUNPOD_STORAGE_SECRET_ACCESS_KEY: RunPod Storage Secret Access KeyGet from https://console.runpod.io/user/storage "S3 Compatible API Commands" Example:
aws s3 ls --region xxx --endpoint-url https://s3api-xxx.runpod.io s3://your-volume-id/
aws s3 sync may fail on large volumes:
fatal error: Error during pagination: The same next token was received twice: ...
Use aws s3 cp --recursive instead (no delta transfer).
If upload failed occurs, retry with --checksum-algorithm=CRC32C.
The notebook has separate cells:
When user provides model names or aws cli command examples, output code snippets that can be directly copy-pasted into Colab cells.
Output a ready-to-paste Settings cell:
# Settings
# HuggingFace models (USER/REPOSITORY format, multiple allowed)
HF_MODELS = [
"USER_PROVIDED_MODEL", # parsed from user input
]
# RunPod Storage (copy from https://console.runpod.io/user/storage)
REGION = "" # @param {type:"string"}
ENDPOINT_URL = "" # @param {type:"string"}
BUCKET = "" # @param {type:"string"}
Parse the command and output a ready-to-paste Settings cell with values filled:
Example input:
aws s3 ls --region us-east-1 --endpoint-url https://s3api-xxxxxx.runpod.io s3://abc123def456/
Output:
# Settings
# HuggingFace models (USER/REPOSITORY format, multiple allowed)
HF_MODELS = [
"", # Add your model here, e.g., "Qwen/Qwen3-8B"
]
# RunPod Storage (parsed from aws cli command)
REGION = "us-east-1" # @param {type:"string"}
ENDPOINT_URL = "https://s3api-xxxxxx.runpod.io" # @param {type:"string"}
BUCKET = "abc123def456" # @param {type:"string"}
Combine both into a complete Settings cell ready to run.
User: "I want to put Qwen3-8B on my runpod network volume"
HF_MODELS = [
"Qwen/Qwen3-8B",
]
User: "aws s3 ls --region us-east-1 --endpoint-url https://s3api-xxx.runpod.io s3://my-bucket/"
Parse and output Settings cell with storage values filled.
npx claudepluginhub pokutuna/claude-plugins --plugin runpodPackages and builds custom AI models with Cog for deployment on Replicate. Covers cog.yaml, predict.py, GPU/CUDA setup, and Docker image creation.
CLI for Hugging Face Hub: download/upload models, datasets, spaces; manage repos, buckets, jobs, endpoints, and auth.
Manage Hugging Face Hub resources via the `hf` CLI: download/upload models, datasets, and Spaces, authenticate, and sync files.