From envoy-ai-gateway-adopters
Install Envoy AI Gateway and Envoy Gateway with production-ready configuration for AI workloads
How this skill is triggered — by the user, by Claude, or both
Slash command
/envoy-ai-gateway-adopters:aigw-installThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Install Envoy AI Gateway into a Kubernetes cluster. Envoy AI Gateway is built on top of Envoy Gateway and requires Envoy Gateway to be installed first. This skill generates the Helm install commands, verifies deployments, and confirms the setup is ready for AI workloads.
Install Envoy AI Gateway into a Kubernetes cluster. Envoy AI Gateway is built on top of Envoy Gateway and requires Envoy Gateway to be installed first. This skill generates the Helm install commands, verifies deployments, and confirms the setup is ready for AI workloads.
kubectl and helm installedDetermine versions and namespace. If the user did not provide values, use these defaults:
v0.5.0 (latest stable)v1.7.0 (Envoy Gateway latest stable; AI Gateway requires v1.6.x+)envoy-ai-gateway-systemEnvoy AI Gateway extends Envoy Gateway. Install Envoy Gateway first with the AI Gateway-specific values file that enables the Backend API and extension manager:
helm upgrade -i eg oci://docker.io/envoyproxy/gateway-helm \
--version ${EnvoyGatewayVersion} \
--namespace envoy-gateway-system \
--create-namespace \
-f https://raw.githubusercontent.com/envoyproxy/ai-gateway/main/manifests/envoy-gateway-values.yaml
kubectl wait --timeout=5m -n envoy-gateway-system deployment/envoy-gateway --for=condition=Available
Note: The
envoy-gateway-values.yamlfile enablesenableBackend: true,enableEnvoyPatchPolicy: true, and configures the extension manager to connect to the AI Gateway controller. If you cannot fetch from GitHub, download the file locally and use-f ./envoy-gateway-values.yaml.
Install the CRD Helm chart first:
helm upgrade -i aieg-crd oci://docker.io/envoyproxy/ai-gateway-crds-helm \
--version ${AIGatewayVersion} \
--namespace ${Namespace} \
--create-namespace
Install the AI Gateway Helm chart:
helm upgrade -i aieg oci://docker.io/envoyproxy/ai-gateway-helm \
--version ${AIGatewayVersion} \
--namespace ${Namespace} \
--create-namespace
kubectl wait --timeout=5m -n ${Namespace} deployment/ai-gateway-controller --for=condition=Available
The envoy-gateway-values.yaml references the AI Gateway controller service. Ensure the FQDN matches your setup. Default:
ai-gateway-controller.envoy-ai-gateway-system.svc.cluster.local:1063
If you use a different namespace for AI Gateway, update the config.envoyGateway.extensionManager.service.fqdn.hostname in the Envoy Gateway values before or during install.
kubectl get pods -n envoy-gateway-system
kubectl get pods -n ${Namespace}
All pods should be in Running state with Ready status.
For token-based rate limiting or InferencePool (self-hosted model routing), add the corresponding values when installing Envoy Gateway:
# With rate limiting and InferencePool addons
helm upgrade -i eg oci://docker.io/envoyproxy/gateway-helm \
--version ${EnvoyGatewayVersion} \
-n envoy-gateway-system \
--create-namespace \
-f https://raw.githubusercontent.com/envoyproxy/ai-gateway/main/manifests/envoy-gateway-values.yaml \
-f https://raw.githubusercontent.com/envoyproxy/ai-gateway/main/examples/token_ratelimit/envoy-gateway-values-addon.yaml \
-f https://raw.githubusercontent.com/envoyproxy/ai-gateway/main/examples/inference-pool/envoy-gateway-values-addon.yaml
For production, consider these overrides:
# values-production.yaml
controller:
replicas: 2
resources:
requests:
cpu: 100m
memory: 256Mi
limits:
cpu: 500m
memory: 1024Mi
extProc:
resources:
requests:
cpu: 100m
memory: 128Mi
limits:
cpu: 500m
memory: 512Mi
For per-Gateway ExtProc customization (resources, env vars), use GatewayConfig and annotate the Gateway:
apiVersion: aigateway.envoyproxy.io/v1alpha1
kind: GatewayConfig
metadata:
name: my-gateway-config
namespace: default # Same as Gateway
spec:
extProc:
kubernetes:
resources:
requests:
cpu: 200m
memory: 256Mi
limits:
cpu: 1000m
memory: 1Gi
---
# Annotate your Gateway
apiVersion: gateway.networking.k8s.io/v1
kind: Gateway
metadata:
name: my-gateway
namespace: default
annotations:
aigateway.envoyproxy.io/gateway-config: my-gateway-config
spec:
# ...
Install with production values:
helm upgrade -i aieg oci://docker.io/envoyproxy/ai-gateway-helm \
--version ${AIGatewayVersion} \
-n ${Namespace} \
--create-namespace \
-f values-production.yaml
To upgrade AI Gateway:
# Upgrade CRDs first
helm upgrade -i aieg-crd oci://docker.io/envoyproxy/ai-gateway-crds-helm \
--version ${AIGatewayVersion} \
-n ${Namespace}
# Then upgrade controller
helm upgrade -i aieg oci://docker.io/envoyproxy/ai-gateway-helm \
--version ${AIGatewayVersion} \
-n ${Namespace}
Important: If you previously installed only
ai-gateway-helm(without separate CRDs), first install the CRD chart with--take-ownershipto transfer CRD ownership before upgrading.
connection.bufferLimit (e.g., 50Mi) on your Gateway. See /aigw-route for an example.envoy-gateway-values.yamlnpx claudepluginhub missberg/envoy-skills --plugin envoy-ai-gateway-adoptersDeploys AI inference services on CoreWeave Kubernetes using Helm charts and Kustomize for GPU scaling and multi-model setups.
Walks through setting up AI Runway on an existing AKS cluster: cluster verification, controller install, GPU assessment, provider setup, and first model deployment.
Deploys vLLM OpenAI-compatible server to Kubernetes with GPU support, health probes, and services via YAML templates. Checks HF token secret and existing deployments before applying.