Plugins listed here are tagged for this technology stack and auto-indexed from public GitHub repositories.
Plugins listed here are tagged for this technology stack and auto-indexed from public GitHub repositories.
Claude Code plugins tagged for PyTorch development. Browse commands, agents, skills, and more.
Manage the full ML lifecycle on Hugging Face Hub: search and select models, train or fine-tune with TRL/Unsloth, evaluate locally, build and deploy Gradio demos on Spaces, publish research papers, and monitor training metrics — all from the command line or agent.
Look up Python code examples and enforce Pythonic style — fetch syntax, concurrency, ML, and HPC references from pythonsheets.com while writing, debugging, or optimizing code, and get linting guidance for readable, idiomatic Python.
Work with Mooncake Python APIs to perform distributed storage operations, RDMA/TCP data transfers, and PyTorch tensor processing.
Autonomously optimize LLM serving infrastructure — profile torch traces, benchmark SGLang/vLLM/TensorRT-LLM, simulate capacity and compute, and run RLCR loops that patch code to match or beat competitor performance. Also includes human-like PR review and incident triage for production serving.
Build, train, and deploy AI models on AWS SageMaker with deep ML expertise: validate datasets, fine-tune models (SFT, DPO, RLVR), generate Jupyter notebooks, evaluate model quality, and diagnose HyperPod cluster issues (NCCL, GPU, Slurm, EKS) — all from your coding assistant.
Implement and review nearly every Apple Kit framework across iOS, macOS, watchOS, and visionOS — from AccessorySetupKit and WidgetKit to CarPlay, covering AR, HealthKit, StoreKit, CloudKit, SwiftData, Core ML, and system-level capabilities like networking, security, testing, performance profiling, and accessibility.
Run Claude Code workflows across software lifecycle phases: architect, review, test, deploy, and document with 70+ modular skills for code quality, security auditing, automated releases, manuscript preparation, system diagrams, and agent orchestration.
Provides 197 computational skills for scientific AI agents to perform life sciences research, covering genomics, proteomics, drug discovery, medical imaging, biostatistics, and scientific writing via integrations with databases, analysis tools, and ML frameworks.
Author, optimize, and deploy PyTorch models for on-device execution on Apple silicon using Core AI. Covers op compatibility rules, weight quantization/palettization for accuracy-size tradeoffs, and the full export-compile-run pipeline on Neural Engine and GPU.
Configure Claude Code agents with architectural principles, safety hooks, and skills for multi-session coordination, code review, pixel art generation, video production, and AI model engineering.
Automates the full lifecycle of migrating and optimizing AI models for Huawei Ascend NPUs: environment setup, code analysis and adaptation, operator development (AscendC/Triton), distributed training with MindSpeed/Megatron, performance profiling and tuning, precision verification, and deployment as vLLM inference services.
Automates the full academic research workflow: literature search, data processing, idea validation, experimental design, paper drafting and polishing, publication-ready figure generation, peer review simulation, rebuttal crafting, compliance auditing, patent/software registration, and presentation creation, with integrity checks and multi-format typesetting.
Draft, analyze, and file USPTO, EPO, and PCT patent applications from invention disclosures. Conduct prior art searches across 100M+ patents via BigQuery, assess patentability and compliance (35 USC, EPC, MPEP), generate patent-style technical diagrams, and prepare IDS documents—all within Claude Code.
Run LLM post-training workflows including SFT, OSFT, LoRA fine-tuning, and GRPO reinforcement learning through a unified interface with automatic GPU memory estimation and environment setup.
Automate end-to-end academic research: write scientific papers with LaTeX/Markdown, search and cite literature, analyze data with Python libraries (pandas, PyTorch), run bioinformatics pipelines, generate publication-quality figures, create posters and presentations, and manage citations and references. Includes tools for grant writing, peer review, and clinical decision support.
Automate end-to-end ML performance investigations: research SOTA papers and architectures, generate phased plans, judge experimental methodologies, profile bottlenecks, run metric-improvement campaigns with atomic git commits, auto-rollback on regressions, and leverage specialist agents for data lifecycle and deep paper analysis.
Orchestrate an end-to-end academic research workflow inside Claude Code: from literature search and citation verification, through figure design and code-backed implementation, to manuscript drafting, revision, and rebuttal letter assembly — all coordinated by a supervisor agent that tracks bottlenecks and safety gates.
Run a structured empirical research pipeline for ML/AI claims: transform ideas into falsifiable hypotheses, preregister experiments, reproduce baselines, execute studies, run adversarial falsification, apply statistical rigor, and force a kill-or-ship decision using repository evidence.
Develop, review, and deploy Go projects with conventions for architecture, testing, and git workflow, while also building interactive web UIs with Datastar, performing security reviews, fine-tuning AI models, and maintaining living documentation and experimental optimization loops.
Guardrail your AI/ML research workflow with an AI collaborator that searches literature using query variations, analyzes codebases and logs, designs minimal falsification experiments, records predictions, and audits bugs.
Automate multi-chip GPU AI inference workflows on the FlagOS platform: kernel generation and review, model migration from upstream vLLM, containerized stack installation and environment verification, and end-to-end performance benchmarking across NVIDIA, AMD, Ascend, and other hardware backends.
Accelerate GPU kernel development with an integrated workflow: query a knowledge base of CUDA, Triton, and CUTLASS patterns, benchmark custom kernels against PyTorch baselines, profile with Nsight Compute, and run iterative optimization loops with correctness checks
Bootstrap Claude Code with 17 specialized agents, skills, and hooks to audit/evolve .claude/ configs, engineer/refactor Python code via TDD, profile/optimize ML workloads, generate docs/tests, design systems, diagnose issues, and manage workflows professionally.
Extend video diffusion models with LVSA (Long Video Sparse Attention) support by implementing a ModelAdapter for geometry, QKV extraction, RoPE, and output projection in single-stream, dual-stream, or joint-attention DiTs.
Installs LVSA and generates long videos with block-sparse attention, automatically selecting SDPA vs FlashInfer backend and configuring reference latent frames per model while verifying sparse path engagement.
Diagnose NVIDIA LongVidio Sparse Attention (LVSA) failures: identify silent dense fallback, out-of-memory at long sequences, missing MP4 outputs in Docker, quality regressions from training references, and environment variable misconfigurations.
Delegate expert-level AI/ML workflows to specialized agents: engineer optimized prompts with evaluation and A/B testing, architect scalable LLM systems with RAG/LoRA fine-tuning, build production NLP pipelines for NER/classification/QA, and deploy optimized models via vLLM/Triton/Docker/K8s for reliability, performance, and cost control.
Run an autonomous optimize-measure-keep/discard loop on any metric (LLM loss, test speed, bundle size) powered by git: edit code, benchmark, auto-revert regressions, repeat until target is met.
Prefix terminal commands with 'gpu' to run ML training, LLM inference, ComfyUI workflows, and media processing on remote NVIDIA GPUs (A100, H100, RTX 4090) from your Mac. Automatically provisions pods, syncs files bidirectionally, streams logs, debugs interactively, selects optimal GPUs, and optimizes costs.
Streamline end-to-end data science and ML workflows: frame business problems into ML tasks, preprocess and validate data with quality checks, perform EDA on diverse formats, design and execute experiments with hyperparameter tuning via Optuna and interpretability via SHAP, audit reproducibility and leakage, evaluate model performance and readiness for deployment, generate model cards, and extract structured learnings into docs.
Turn Claude Code into a professional media production workstation: transcode, stream, package, QC, color-grade, and deliver video/audio across broadcast, OTT, and AI-enhanced pipelines using FFmpeg, OBS, GStreamer, WebRTC, and 90+ open-source media tools.
Equip AI agents with 9 engineering skills to architect scalable backends and distributed systems, secure apps and pipelines, prototype MVPs, build mobile and ML apps, guide frontend development, automate DevOps infrastructure, and plan senior-level software delivery.
Train and run inference on machine learning models using Hugging Face Transformers and PEFT with PyTorch on cloud GPUs from Modal, Lambda Labs, or RunPod—no local GPU required.
Trace PyTorch operator implementations across Python, C++, and CUDA layers, analyze nn.Module-to-native binding chains, map code changes to affected tests, and query dispatch mechanisms.
Apply 97 structured reasoning patterns from history's greatest thinkers to any problem — debug, design, research, or write — using specialized agents that analyze, critique, and synthesize across domains.
Write idiomatic MLX code for machine learning on Apple Silicon, implementing arrays, neural networks, training loops, lazy evaluation, unified memory, Metal GPU acceleration, and PyTorch migrations.