From NVIDIA
Installs, deploys, validates, and stops NVIDIA AI-Q Blueprint infrastructure via Docker Compose, local process, or Kubernetes.
How this skill is triggered — by the user, by Claude, or both
Slash command
/nvidia-skills:aiq-deployThis skill is limited to the following tools:
The summary Claude sees in its skill listing — used to decide when to auto-load this skill
Use this skill to get a local or self-hosted NVIDIA AI-Q Blueprint server running and verified for use by
BENCHMARK.mdevals/evals.jsonreferences/configs.mdreferences/docker-compose.mdreferences/end-to-end-validation.mdreferences/env-and-secrets.mdreferences/frag.mdreferences/kubernetes-helm.mdreferences/local-web.mdreferences/locate-or-clone.mdreferences/shutdown.mdreferences/skill-backend.mdreferences/terminal-cli.mdreferences/troubleshooting.mdreferences/validation.mdskill-card.mdskill.oms.sigUse this skill to get a local or self-hosted NVIDIA AI-Q Blueprint server running and verified for use by
aiq-research.
This skill owns setup, deployment, operational checks, troubleshooting, and shutdown. It does not run deep
research itself. After deployment is healthy, hand off the verified server URL to aiq-research.
The workflow stays explicit so deployment validation and handoff are repeatable across supported agent clients.
Users need:
https://github.com/NVIDIA-AI-Blueprints/aiq.uv for local process or CLI mode.npm for local browser UI development mode.kubectl 1.28+, Helm 3.12+, and access to a Kubernetes cluster for Helm mode.NVIDIA_API_KEY; web research requires at least
one supported search provider key such as TAVILY_API_KEY, SERPER_API_KEY, or EXA_API_KEY.3000. Self-hosted model or RAG deployments may require GPU resources.Before writing secrets, verify deploy/.env is ignored:
git check-ignore deploy/.env
Expected output: deploy/.env or a matching ignore rule. If it is not ignored, stop and fix the ignore rule before
placing credentials in the file.
deploy/.env without overwriting user secrets.AIQ_SERVER_URL for aiq-research.If no AI-Q checkout exists, read references/locate-or-clone.md before cloning. In an existing checkout, confirm the
required files:
pwd
test -f pyproject.toml
test -f deploy/.env.example
test -d configs
Expected output: pwd prints the AI-Q repository path; the test commands exit with status 0 and no output.
If the user asks to install, deploy, set up, or run AI-Q without naming a mode, ask:
How do you want to run AI-Q?
1. Skill backend - backend-only service for aiq-research w/o browser UI.
2. CLI - interactive terminal AI-Q.
3. UI - browser AI-Q app with backend and frontend.
4. Custom - choose an existing AI-Q config or review advanced customization docs before deployment.
Wait for the user's answer before starting services.
Do not ask this question when the user already specified a mode, such as Docker Compose, Helm, UI, CLI, or Agent Skill
backend. Do not ask the full mode question when aiq-research routed here because a deep research request needs a
backend. In that case, prefer Agent Skill backend and ask only for permission to start it if needed.
Read references/env-and-secrets.md before changing deploy/.env.
if [ ! -f deploy/.env ]; then
cp deploy/.env.example deploy/.env
echo "created deploy/.env from deploy/.env.example"
fi
Expected output when the file is missing: created deploy/.env from deploy/.env.example. Expected output when the file
already exists: no output, and the existing file is preserved.
Never print secret values. If credentials are missing, ask the user to update deploy/.env; do not ask them to paste
secret values into chat.
Match the user request, then read the referenced file before acting:
| User Intent | Reference |
|---|---|
| No AI-Q checkout exists, install AIQ, clone AIQ, locate repo | references/locate-or-clone.md |
Configure environment, check API keys, inspect .env | references/env-and-secrets.md |
Choose an AI-Q workflow config, understand config files, set BACKEND_CONFIG or CONFIG_FILE | references/configs.md |
Backend-only local server for aiq-research, AIQ as an Agent Skill | references/skill-backend.md |
| Terminal assistant, CLI-only run, no web UI | references/terminal-cli.md |
| Quick local development run, start UI/backend without containers | references/local-web.md |
| Default durable local deployment, Docker Compose, containers, PostgreSQL | references/docker-compose.md |
| Kubernetes, Helm, cluster deployment | references/kubernetes-helm.md |
| Foundational RAG / FRAG integration | references/frag.md |
Basic health checks, shallow smoke checks, handoff to aiq-research | references/validation.md |
| Optional deep research completion validation | references/end-to-end-validation.md |
| Logs, unhealthy services, port conflicts, config failures | references/troubleshooting.md |
| Stop services, restart, rebuild, safe cleanup | references/shutdown.md |
After startup, read references/validation.md and run the appropriate checks for the selected mode. For the default
local backend, verify health:
curl -sf http://localhost:8000/health
Expected output: a successful JSON health response or an empty successful response depending on the server build. If the
command fails, read references/troubleshooting.md and diagnose before claiming the backend is ready.
aiq-research needs a reachable AI-Q server URL. If the backend is on the default port, no extra configuration is
needed:
AIQ_SERVER_URL=http://localhost:8000
If the backend runs elsewhere, tell the user to set:
export AIQ_SERVER_URL="http://localhost:<PORT>"
Do not continue into deep research or deep research completion validation unless the user asks for it or confirms the post-deploy validation prompt. This skill's success criterion is a deployed and basically validated server, not report generation quality.
IMPORTANT: This skill is designed for NVIDIA AI-Q Blueprint version 2.1.0.
Semantic Versioning Compatibility Rules:
Skill version: X.Y.Z
Blueprint version: A.B.C
Compatible IF:
1. A == X (Major versions MUST match)
2. B >= Y (Minor version must be equal or greater)
3. C can be anything (Patch version does not affect compatibility)
Examples:
If your Blueprint version is not compatible:
deploy/.env or environment variables, not in chat transcripts, shell history, committed files,
or example commands.deploy/.env when it already exists.down -v.RAG_SERVER_URL and RAG_INGEST_URL are configured and reachable.test -f deploy/.env || cp deploy/.env.example deploy/.env
git check-ignore deploy/.env
cd deploy/compose
BUILD_TARGET=release docker compose --env-file ../.env -f docker-compose.yaml config --quiet
BUILD_TARGET=release docker compose --env-file ../.env -f docker-compose.yaml up -d --build aiq-agent
curl -sf http://localhost:8000/health
Expected output:
deploy/.env
<docker compose starts aiq-agent and dependencies>
<health endpoint returns a successful response>
If Docker, ports, credentials, or health checks fail, read references/troubleshooting.md before retrying.
export AIQ_SERVER_URL="http://localhost:8100"
curl -sf "$AIQ_SERVER_URL/health"
Expected output: a successful health response. Then tell the user to keep AIQ_SERVER_URL set before invoking
aiq-research.
| Topic | Documentation |
|---|---|
| Locate or clone AI-Q | references/locate-or-clone.md |
| Environment and secrets | references/env-and-secrets.md |
| Workflow configs | references/configs.md |
| Agent Skill backend | references/skill-backend.md |
| CLI deployment | references/terminal-cli.md |
| Local web deployment | references/local-web.md |
| Docker Compose deployment | references/docker-compose.md |
| Kubernetes and Helm deployment | references/kubernetes-helm.md |
| FRAG integration | references/frag.md |
| Basic validation | references/validation.md |
| End-to-end validation | references/end-to-end-validation.md |
| Troubleshooting | references/troubleshooting.md |
| Shutdown and cleanup | references/shutdown.md |
Symptoms:
8000.curl -sf http://localhost:8000/health reaches an unexpected service or fails.Causes:
PORT in deploy/.env conflicts with an existing process.Solutions:
lsof -nP -iTCP:8000 -sTCP:LISTEN
deploy/.env, such as
PORT=8100.curl -sf http://localhost:8100/health
Symptoms:
Causes:
NVIDIA_API_KEY is missing or empty.Solutions:
references/env-and-secrets.md.deploy/.env; do not ask them to paste secrets into chat.references/validation.md after the user updates credentials.Symptoms:
/health succeeds, but /chat or /v1/jobs/async/agents fails.aiq-research reports that async agents are unavailable.Causes:
BACKEND_CONFIG or CONFIG_FILE points at the wrong AI-Q config.Solutions:
references/configs.md and confirm the selected config is API-enabled.configs/config_web_default_llamaindex.yml.references/validation.md.Symptoms:
docker compose down -v.Causes:
down -v removes Docker volumes.Solutions:
references/shutdown.md.npx claudepluginhub nvidia/skills --plugin nvidia-skillsRuns deep research queries through a local NVIDIA AI-Q Blueprint backend. Use for research-shaped requests like 'deep research on...' or 'ask AI-Q about...'.
Builds, deploys, and debugs Hugging Face Spaces for ML applications using Gradio, Docker, or Static SDKs. Covers ZeroGPU and dedicated hardware setup, model loading, inference providers, and troubleshooting failing builds.
Generates validated, runnable implementation plans for ML pipelines, architecture designs, and multi-step projects grounded in official framework documentation.