Search everything...

Stats

Actions

Available In

dgx-spark

Name: dgx-spark
Author: jeremyeder

By jeremyeder

NVIDIA DGX Spark integration for Claude Code — local model serving, GPU monitoring, VM management, and hybrid AI workflows

npx claudepluginhub jeremyeder/dgx-agentskills --plugin dgx-spark

Popularity

Stars

Med: 0·Avg: 285

Installs

Med: 0·Avg: 1

What's Inside

Slash Commands3

/spark-models

Manage AI models on the DGX Spark.

/spark-status

Quick health check for the DGX Spark. Reports system status, GPU utilization, running models, and VPN connectivity.

/spark-switch

Toggle Claude Code's model backend between Anthropic API and DGX Spark.

Agents1

spark-monitor

/spark-monitor

Background health monitoring agent for the DGX Spark. Periodically checks system status and alerts on state changes. Use when you want ongoing awareness of Spark health during a long session.

Skills5

spark-hybrid

/spark-hybrid

Configure Claude Code to use the DGX Spark as a model backend — full local, hybrid (Opus primary + Spark for subagents), or failover mode. Use when switching between local and cloud inference, pointing Claude Code at Spark, or setting up hybrid workflows. Triggers on: "use local model", "switch to Spark", "switch to Anthropic", "hybrid mode", "point Claude Code at Spark", "use Spark for subagents".

spark-models

/spark-models

Manage AI models on the DGX Spark — list, pull, serve, stop, and recommend models across Ollama and vLLM backends. Use when deploying models, checking what's running, pulling new models, or getting recommendations for a use case. Triggers on: model names (Qwen, Llama, DeepSeek, Gemma), "serve model", "pull model", "what models are running", "deploy model on Spark".

spark-setup

/spark-setup

Set up and provision an NVIDIA DGX Spark from scratch or after factory reset. Use when configuring a new Spark, recovering from reset, or verifying system state. Triggers on: "set up DGX Spark", "configure Spark", "provision Spark", "factory reset".

spark-vms

/spark-vms

Manage KVM/QEMU virtual machines on the DGX Spark. Create, start, stop, and snapshot VMs on the ARM64 hypervisor. Use when running VMs, creating virtual machines, or managing virtualization on the Spark. Triggers on: "create VM on Spark", "virtual machine", "KVM", "run Windows on Spark".

spark-vpn

/spark-vpn

Set up and manage Tailscale VPN on the DGX Spark for remote access. Use when configuring remote access, setting up Tailscale, or troubleshooting VPN connectivity. Triggers on: "Tailscale", "VPN", "remote access to Spark", "access Spark from outside".

MCP Servers1

dgx-spark

External

Stats

Version0.1.0

Stars0

MaintenanceGood

LicenseMIT

Last CommitMar 15, 2026

AddedMar 24, 2026

Actions

View on GitHub View README Plugin Marketplace JSON

Own this plugin?

Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).

Available In

dgx-agentskills

Safety Signals

Caution

External network access

Connects to servers outside your machine

Uses power tools

Uses Bash, Write, or Edit tools

README

DGX AgentSkills

A Claude Code plugin for integrating the NVIDIA DGX Spark into AI development workflows. Provides local model serving, GPU monitoring, VM management, and hybrid local+cloud inference — all accessible through skills, commands, and MCP tools within Claude Code.

Prerequisites

NVIDIA DGX Spark with SSH access configured
Node.js 20+ (for development)
Claude Code installed
Docker on the DGX Spark (pre-installed with DGX OS)

Quickstart

1. Configure

cp .env.example .env
# Edit .env with your Spark's hostname and SSH user

2. Deploy MCP Server to Spark

./deploy/install.sh

This rsyncs the project to your Spark, builds the MCP server Docker container, and starts it.

3. Connect to MCP Server

Update .mcp.json with your Spark's hostname:

{
  "mcpServers": {
    "dgx-spark": {
      "type": "http",
      "url": "http://YOUR-SPARK-HOSTNAME.local:3100/mcp"
    }
  }
}

Replace YOUR-SPARK-HOSTNAME with your Spark's actual hostname (e.g., jeder-spark). If using Tailscale for remote access, use the Tailscale hostname instead (e.g., http://jeder-spark:3100/mcp).

Claude Code reads this file to discover the MCP server. Without it, skills like /spark-status and all spark_* MCP tools will be unavailable.

4. Install Plugin

# Add the marketplace (one-time)
claude plugin marketplace add jeremyeder/dgx-agentskills

# Install the plugin
claude plugin install dgx-spark@dgx-agentskills --scope user

Or from within a Claude Code session:

/plugin marketplace add jeremyeder/dgx-agentskills
/plugin install dgx-spark@dgx-agentskills

5. Verify

/spark-status

6. Pull a Model

/spark-models pull qwen3.5:32b

7. Serve via vLLM (for Claude Code integration)

/spark-models serve Qwen/Qwen3-Coder-Next --vllm

8. Switch Claude Code to Local Backend

/spark-switch local

Skills

Skill	Description
`spark-setup`	Reproducible provisioning from scratch or after factory reset
`spark-models`	Model lifecycle management across Ollama and vLLM
`spark-hybrid`	Configure Claude Code to use Spark as model backend
`spark-vpn`	Tailscale VPN setup for remote access
`spark-vms`	KVM/QEMU virtual machine management

Commands

Command	Description
`/spark-status`	Quick health check — system, GPU, models, VPN
`/spark-models [action] [model]`	List, pull, serve, stop, or recommend models
`/spark-switch [mode]`	Toggle between local, cloud, and hybrid backends

MCP Tools

Tool	Description
`spark_get_status`	System overview: uptime, CPU, memory, disk
`spark_gpu_utilization`	GPU memory, compute %, temperature, power
`spark_list_models`	All models across Ollama and vLLM
`spark_pull_model`	Pull a model via Ollama
`spark_start_model`	Start a vLLM container with tool-calling support
`spark_stop_model`	Stop a model container
`spark_list_containers`	All Docker containers on Spark
`spark_container_logs`	Tail container logs
`spark_vpn_status`	Tailscale connection state and peers
`spark_health_check`	MCP server health with latency

Configuration

Mac-side `.env` (repo root)

Variable	Description	Default
`SPARK_MCP_URL`	MCP server URL	`http://your-spark.local:3100`
`SPARK_MCP_URL_TAILSCALE`	MCP URL via Tailscale	`http://your-spark:3100`
`SPARK_HOST`	Spark hostname for SSH	`your-spark.local`
`SPARK_USER`	SSH username	`jeder`
`SPARK_VLLM_ENDPOINT`	vLLM API endpoint	`http://your-spark.local:8000`
`SPARK_OLLAMA_ENDPOINT`	Ollama API endpoint	`http://your-spark.local:11434`

Spark-side `.env` (deployed to `~/dgx-agentskills/.env`)

Variable	Description	Default
`MCP_PORT`	MCP server port	`3100`
`OLLAMA_HOST`	Ollama API address	`localhost:11434`
`VLLM_IMAGE`	vLLM container image	`nvcr.io/nvidia/vllm:latest`
`VLLM_PORT`	vLLM serving port	`8000`
`VLLM_GPU_MEMORY_UTILIZATION`	GPU memory fraction for vLLM	`0.7`

Architecture

Mac (Claude Code)
  │
  ├── Plugin (skills, commands, hooks)
  │     └── .mcp.json → HTTP → DGX Spark MCP Server (:3100)
  │
  └── Claude Code session
        └── ANTHROPIC_BASE_URL → DGX Spark vLLM (:8000)

DGX Spark (your-spark.local)
  ├── MCP Server (Docker container, port 3100)
  │     ├── nvidia-smi (GPU metrics)
  │     ├── docker CLI (container management)
  │     ├── ollama CLI (model management)
  │     └── tailscale CLI (VPN status)
  ├── Ollama (host, port 11434)
  ├── vLLM (Docker container, port 8000)
  └── Tailscale (mesh VPN)

Development

# Bootstrap dev environment
./scripts/setup-dev.sh

# Run tests
cd mcp-server && npm test

# Run linting
./scripts/lint.sh

View full README on GitHub

dgx-spark

Popularity

What's Inside

Confidence

README

DGX AgentSkills

Prerequisites

Quickstart

1. Configure

2. Deploy MCP Server to Spark

3. Connect to MCP Server

4. Install Plugin

5. Verify

6. Pull a Model

7. Serve via vLLM (for Claude Code integration)

8. Switch Claude Code to Local Backend

Skills

Commands

MCP Tools

Configuration

Mac-side .env (repo root)

Spark-side .env (deployed to ~/dgx-agentskills/.env)

Architecture

Development

Similar Plugins

sparkrun

vastai-pack

skypilot

ml-training

NVIDIA

vllm-skills

More by jeremyeder

odh-ai-helpers

DGX AgentSkills

Prerequisites

Quickstart

1. Configure

2. Deploy MCP Server to Spark

3. Connect to MCP Server

4. Install Plugin

5. Verify

6. Pull a Model

7. Serve via vLLM (for Claude Code integration)

8. Switch Claude Code to Local Backend

Skills

Commands

MCP Tools

Configuration

Mac-side .env (repo root)

Spark-side .env (deployed to ~/dgx-agentskills/.env)

Architecture

Development

Popularity

Health & Quality

More by jeremyeder

odh-ai-helpers

Similar Plugins

sparkrun

vastai-pack

skypilot

ml-training

NVIDIA

vllm-skills

Mac-side `.env` (repo root)

Spark-side `.env` (deployed to `~/dgx-agentskills/.env`)

Mac-side `.env` (repo root)

Spark-side `.env` (deployed to `~/dgx-agentskills/.env`)