By NVIDIA-TAO
NVIDIA TAO skill bank with generated capability discovery for model training, data processing, workflow orchestration, AutoML, and platform execution.
Performs deep Root Cause Analysis (RCA) on NVIDIA TAO Visual ChangeNet classification experiments with image-evidence-driven investigation. Use when analyzing ChangeNet model failures, investigating poor recall / FAR / PASS-NO_PASS metrics, auditing visual inspection pipeline quality, or running an RCA report for an AOI defect-detection model. Trigger phrases include "RCA on my ChangeNet model", "why is my AOI model failing", "audit ChangeNet predictions", "investigate FAR regressions", "root cause analysis on visual-changenet".
Fine-tune any HuggingFace CV / VLM / LLM model on local NVIDIA GPUs inside an NGC PyTorch container. Use when the user wants to fine-tune a HuggingFace model (full or LoRA), train a vision / VLM / LLM model end-to-end, generate a reproducible HF training pipeline, smoke-test a HuggingFace model locally before scale-up, push a fine-tuned model to the HF Hub with a model card, or emit a self-contained rerun skill for an existing HuggingFace finetune. Supports image classification, object detection, semantic / instance / panoptic segmentation, depth estimation, image-text-to-text VLM (SFT / LoRA), and LLM SFT / DPO / GRPO. Six-step workflow: inspect and qualify, hardware and NGC image, research, generate and smoke, train + eval + infer, push and emit rerun skill.
Integrate a HuggingFace Computer Vision model into the NVIDIA TAO Toolkit ecosystem (tao-core config, tao-pytorch trainer, tao-deploy TensorRT pipeline). Use when the user asks to "integrate a HuggingFace model into TAO", "add an HF model to TAO Toolkit", "wire a HuggingFace ViT/DETR/ SegFormer into tao-pytorch", "build a TAO trainer + deploy pipeline for an HF CV model", or pastes a HuggingFace model URL/ID and wants it turned into a TAO model. Covers the full 7-phase loop: prerequisites check, HuggingFace inspection and validation, codebase exploration, tao-core configuration and native trainer implementation, ONNX export plus TensorRT deploy integration, packaging and L0 testing, container-based end-to-end validation, and (conditional) accuracy/latency tuning. Supports classification, object detection, semantic / instance / panoptic segmentation, zero-shot detection, and depth estimation.
Run the canonical NVIDIA AOI three-phase training pipeline — Phase 1 AutoML baseline (HPO), Phase 2 DEFT loop (RCA → SDG → mining → plain-train retrain), Phase 3 AutoML refinement on the DEFT-augmented dataset. This is the default entry point for any "run the AOI workflow", "fine-tune my PCB AOI model end-to-end", "improve my AOI ChangeNet model", or "AOI workflow with AutoML" request — route here instead of tao-run-deft-aoi directly unless the user explicitly asks for the DEFT loop ONLY (e.g. "run JUST the DEFT loop", "skip AutoML, only DEFT"). Also handles the same three-phase pattern for non-AOI DEFT applications — AutoML baseline then DEFT loop warm-started from AutoML's winning HPs then post-DEFT AutoML refinement on the iteration-augmented dataset. Trigger phrases include "run the AOI workflow", "AOI end-to-end", "AutoML + DEFT", "AutoML then DEFT", "tune hyperparameters then DEFT", "DEFT with AutoML at both ends", "warm-start DEFT", "improve my AOI model".
Run AutoML / hyperparameter optimization (HPO) for NVIDIA TAO networks using AutoMLRunner. Handles algorithm selection (bayesian, hyperband, asha, bohb, llm, hybrid, autoresearch), WandB experiment tracking, job execution on any TAO SDK platform, result interpretation, and per-rec custom evaluation hooks. Use when the user mentions TAO AutoML, hyperparameter optimization, HPO, automl, automl_settings, AutoMLRunner, tao_automl, bayesian search, hyperband, ASHA, LLM-guided search, autoresearch, or wants to tune training hyperparameters for any TAO network. Platform-agnostic — runs on any SDK (Lepton, Brev, SLURM, Kubernetes, Docker).
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Portable agent skills for training, evaluating, and running inference on NVIDIA TAO models. Works with Claude Code, Codex, Gemini CLI, or any coding agent that speaks the Agent Skills open standard. Zero Python required for local docker workflows — install the plugin, install docker + nvidia-container-toolkit, and an agent can run every skill by constructing docker run commands directly. For advanced features (job tracking, multi-node, Lepton access, S3 I/O wrapping), an optional Python layer — the TAO Execution SDK — sits on top.
The skill bank works with both Claude Code and Codex. Pick the runtime you use.
In a Claude Code session, add the marketplace and install the plugin:
/plugin marketplace add [email protected]:NVIDIA-TAO/tao-skills-bank.git
/plugin install tao-skills@tao-skill-bank
That's it — no git clone, no pip install. The tao-skills plugin bundles all 56 skills (every model, data, platform, and application). The plugin's SessionStart hook loads the AGENTS.md identity at the start of every session.
Codex setup has two independent pieces — the plugin (which surfaces the skills to Codex) and AGENTS.md (which loads the agent identity). You need both for parity with Claude Code.
Option A — VS Code Codex extension (recommended for VS Code users). Open the extension's plugin UI, add the marketplace URL, and install tao-skill-bank — all from the UI. Most discoverable, one click.
Option B — CLI + TUI. Add the marketplace from the shell, then install the plugin from inside the Codex TUI (no CLI install subcommand exists yet — openai/codex#17431):
codex plugin marketplace add [email protected]:NVIDIA-TAO/tao-skills-bank.git
codex # opens TUI
/plugins # then: select tao-skill-bank → Install plugin
Either path installs the bundle to ~/.codex/plugins/cache/<marketplace>/tao-skill-bank/<version>/ (the <marketplace> segment comes from the name field in .agents/plugins/marketplace.json).
AGENTS.md)The plugin install does not auto-load AGENTS.md — Codex's AGENTS.md discovery walks down from the project root, not into the plugin cache (see openai/codex#16430 for why plugin-bundled SessionStart hooks don't fix this yet). Pick one:
git clone this repo and launch codex from inside the clone. Codex auto-loads AGENTS.md from the project root per the agents.md cross-runtime spec.cp ~/.codex/plugins/cache/<marketplace>/tao-skill-bank/<version>/AGENTS.md ~/.codex/AGENTS.md. The identity then loads in every Codex session, anywhere.Once Codex starts honoring plugin-bundled hooks, the identity will install automatically alongside the plugin — until then, this manual step is needed.
On first session start, the plugin looks for ~/.config/tao/.env and auto-loads it. To set up:
mkdir -p ~/.config/tao
cp "${CLAUDE_PLUGIN_ROOT}/.env.example" ~/.config/tao/.env # template ships in the plugin
# Edit ~/.config/tao/.env and fill in NGC_KEY, LEPTON_*, S3 keys, etc.
The .env.example is also at the repo root for direct reference. The agent never reads credential values — it only checks presence.
The TAO SDK is opt-in and installed lazily. Most skills (any model or data skill) run with just docker run and need no Python. Only skills/platform/tao-run-on-lepton (tao-run-on-lepton), skills/platform/tao-run-platform (tao-run-platform), the managed-platform skills (slurm/kubernetes/docker), and skills/applications/tao-run-automl (tao-run-automl) require the SDK; their Preflight blocks tell the agent to pip install the right extra the first time the skill is invoked. The SDK is on public PyPI; the exact pinned version lives in versions.yaml and each Preflight resolves it via scripts/resolve_versions_key.py.
Claude Code:
/plugin marketplace update tao-skill-bank
/reload-plugins
If skills look stale (cached contents):
rm -rf ~/.claude/plugins/cache/tao-skill-bank
then re-run /plugin install.
Codex:
codex plugin marketplace upgrade tao-skill-bank
If you copied AGENTS.md to ~/.codex/AGENTS.md, re-copy from the upgraded plugin cache to pick up identity changes.
The quickest way to verify your setup: run a Visual ChangeNet inference on a sample image.
npx claudepluginhub nvidia-tao/tao-skills-bank --plugin tao-skillsA growing collection of Claude-compatible academic workflow bundles. Covers scientific figures, manuscript writing and polishing, reviewer assessment, citation retrieval, data availability, paper reading, literature search, response letters, paper-to-PPTX conversion, and evidence-grounded Chinese invention patent drafting. Rules are organized as reusable skill folders with explicit workflows and quality checks.
Persistent file-based planning for AI coding agents. Crash-proof markdown plans (task_plan.md, findings.md, progress.md) that survive context loss and /clear, with an opt-in completion gate and multi-agent shared state. Manus-style. Works with Claude Code, Codex CLI, Cursor, Kiro, OpenCode and 60+ agents via the SKILL.md standard. Includes Arabic, German, Spanish, and Chinese (Simplified and Traditional).
Core skills library for Claude Code: TDD, debugging, collaboration patterns, and proven techniques
Harness-native ECC operator layer - 67 agents, 271 skills, 92 legacy command shims, reusable hooks, rules, selective install profiles, and production-ready workflows for Claude Code, Codex, OpenCode, Cursor, and related agent harnesses
Superpowers Plus core skills library for Claude Code: planning, execution routing, TDD, debugging, and collaboration workflows