Skill

nvidia-tensorrt-llm-deployment-review

Static review of TensorRT/TensorRT-LLM deployment pipelines: ONNX/PyTorch export, precision selection, calibration cache, dynamic shapes, plugin loading, engine provenance, and runtime memory sizing.

PyTorch

ai-ml

performance

Popularity

Stars

Forks

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/vanguard-frontier-agentic:nvidia-tensorrt-llm-deployment-review

User invocable

Model invocable

Inline context

Default effort

Tool Access

This skill is limited to the following tools:

ReadGrepGlob

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

Static review of TensorRT and TensorRT-LLM deployment pipelines against NVIDIA's TensorRT Developer Guide — ONNX/PyTorch export, FP16/INT8/FP8/INT4 precision, calibration data integrity, dynamic shape profiles, plugin trust boundaries, engine cache provenance. This skill is doc-anchored: it grounds review findings in NVIDIA's published documentation rather than in a certification blueprint, bec...

Supporting Files

metadata.json

SKILL.md

36 lines · ~744 tokens

Stats

LanguagePython

Stars18

Forks2

MaintenanceExcellent

Last CommitJun 15, 2026

Actions

View Source View Plugin View on GitHub View README