Plugins listed here are tagged for this technology stack and auto-indexed from public GitHub repositories.
Plugins listed here are tagged for this technology stack and auto-indexed from public GitHub repositories.
Claude Code plugins tagged for Hugging Face development. Browse commands, agents, skills, and more.
Build and deploy production-grade LLM applications with LangGraph for agent orchestration, advanced RAG pipelines leveraging vector and hybrid search, prompt engineering patterns, and automated evaluation. Covers embedding model selection, vector index optimization, and multi-agent architectures for document Q&A, chatbots, and semantic search over proprietary data.
Manage the full ML lifecycle on Hugging Face Hub: search and select models, train or fine-tune with TRL/Unsloth, evaluate locally, build and deploy Gradio demos on Spaces, publish research papers, and monitor training metrics — all from the command line or agent.
Optimize Python deep learning models using Adam, SGD optimizers, learning rate schedulers, and regularization to improve accuracy and reduce training time. Generate production-ready AI/ML code from context analysis, including validation, error handling, performance metrics, insights, artifacts, and documentation.
Fine-tune pre-trained ML models like ResNet, BERT, and GPT on custom datasets using transfer learning. Generates production-ready Python code with validation, error handling, performance metrics, documentation, and saves artifacts for deployment.
Build recommendation engines by generating Python code for collaborative, content-based, or hybrid filtering using scikit-learn, TensorFlow, or PyTorch to personalize movies, products, or content. Analyze context to produce complete AI/ML tasks with validation, error handling, performance metrics, insights, artifacts, and documentation.
Explain machine learning model predictions using SHAP, LIME, and feature importance to identify influential features and debug behavior. Generate production-ready AI/ML code from context, including validation, error handling, performance metrics, insights, artifacts, and documentation.
Deploy, debug, optimize, monitor, and secure GPU-accelerated ML inference and training workloads on CoreWeave Kubernetes clusters, including cost tuning, data handling, migrations from AWS/GCP/Azure, CI/CD automation, and production checklists.
Evaluate single ML models or compare multiple ones on test datasets across classification, regression, NLP, and generative tasks. Compute metrics, statistical significance, inference performance, costs, robustness, bias checks; generate visualized reports with confusion matrices, performance profiles, tables, rankings, and recommendations.
Rapidly implement production-ready AI/ML features in apps: integrate LLMs with prompt engineering and response handling, build ML pipelines for recommendation systems, add computer vision for visual search, and enable intelligent automation using OpenAI, Anthropic, LangChain, Hugging Face, or Ollama.
Store, search, and recall persistent cross-session memories for Claude Code using Ebbinghaus decay, enabling the AI to retain user facts, preferences, and project decisions across conversations.
Rapidly implement production-ready AI/ML features in apps: integrate LLMs via prompt engineering and response handling, build ML pipelines for user behavior-based recommendations, add computer vision for photo-based product search, and deploy intelligent automations.
Delegate image analysis, OCR text extraction, barcode/QR detection, and document processing to a vision expert agent using latest models like GPT-4V, Claude Vision, Mistral-OCR, Tesseract, and EasyOCR for efficient visual AI workflows.
Build RAG pipelines for document Q&A and chatbots by chunking large docs, generating embeddings, storing in vector DBs, and retrieving context to reduce hallucinations. Engineer and optimize LLM prompts using chain-of-thought, few-shot examples, constitutional AI, meta-prompting, and validation workflows.
Rapidly implement production-ready AI/ML features in apps, including LLM integrations with prompt engineering, ML pipelines for recommendations, computer vision for visual search, and intelligent automation, using a specialized agent.
Delegate advanced image analysis workflows to expert vision AI subagents that perform OCR with Tesseract/EasyOCR, barcode/QR detection, document processing, and optimization using cutting-edge models like GPT-4V, Claude Vision, and Mistral-OCR.
Configure Claude Code agents with architectural principles, safety hooks, and skills for multi-session coordination, code review, pixel art generation, video production, and AI model engineering.
Automates the full lifecycle of migrating and optimizing AI models for Huawei Ascend NPUs: environment setup, code analysis and adaptation, operator development (AscendC/Triton), distributed training with MindSpeed/Megatron, performance profiling and tuning, precision verification, and deployment as vLLM inference services.
Delegate end-to-end ML engineering workflows to specialized agents that construct data preparation and training pipelines with feature engineering and hyperparameter tuning, optimize inference through quantization, pruning, batching, and edge deployment, and manage MLOps for model versioning, monitoring, A/B testing, and production orchestration.
Consult a virtual CTO team of specialized agents to design scalable architectures, generate phased roadmaps with effort estimates, recommend tech stacks, validate plans with ruthless reports identifying flaws and risks, challenge assumptions, estimate costs, and decide build-vs-buy for web, mobile, or AI/ML projects.
Deploy vLLM OpenAI-compatible inference servers locally with hardware detection, via Docker images, or Kubernetes YAML manifests with GPU support, then benchmark throughput, TTFT, TPOT, inter-token latency, and prefix caching using synthetic data, ShareGPT, or fixed prompts.
Diagnose and fix ML training failures (OOM, NaN, divergence), generate citation-grounded implementation plans for fine-tuning and inference pipelines, and verify code/configs against official framework docs before running GPU jobs.
Run GGUF models locally with Mozilla Llamafile, launching OpenAI-compatible API servers configurable for GPU/CPU inference, SDK integrations, installations, startups, and connection troubleshooting in offline setups.
Find, compare, run, and prompt AI models hosted on Replicate directly from the editor, including building custom models with Cog, searching by task, and deploying via CI/CD
Generate images and videos inside Claude Code by building, running, and debugging ComfyUI workflows for models like Flux, WAN, Qwen, and LTX-V2. Manage custom nodes, sweep parameters, diff workflows, and orchestrate multi-step pipelines.
Build high-quality computer vision datasets and models with FiftyOne: import/export datasets, run inference, evaluate predictions, find duplicates, visualize embeddings, and develop custom plugins — all from within Claude.
Routes OpenAI-compatible clients through FreeRide, a local gateway that distributes inference requests across free-tier AI providers (OpenRouter, Groq, NVIDIA NIM, Cloudflare Workers AI, HuggingFace). Detects a running FreeRide instance and wires any OpenAI-shaped client against it for cost-free model access.
Automate end-to-end ML performance investigations: research SOTA papers and architectures, generate phased plans, judge experimental methodologies, profile bottlenecks, run metric-improvement campaigns with atomic git commits, auto-rollback on regressions, and leverage specialist agents for data lifecycle and deep paper analysis.
Run a structured empirical research pipeline for ML/AI claims: transform ideas into falsifiable hypotheses, preregister experiments, reproduce baselines, execute studies, run adversarial falsification, apply statistical rigor, and force a kill-or-ship decision using repository evidence.
Automate image processing, OCR, barcode/QR detection, and document analysis with expert vision AI. Proactively select optimal models like GPT-4V, Claude Vision, or Mistral-OCR; engineer prompts, preprocess images, benchmark performance, and integrate APIs from OpenAI, Anthropic, and Hugging Face.
Build full-stack dApps on Ritual Chain using skills and agents that generate Solidity smart contracts with async precompiles for on-chain ML inference, HTTP/LLM calls, scheduling, secrets, X402 payments; create React/Next.js frontends with wagmi hooks; set up backends, testing, debugging, and deployment workflows from idea to verification.
Build production-grade LLM apps in Python: implement RAG pipelines with embeddings and hybrid search, design LangChain/LangGraph agents, optimize prompts, tune vector indexes, and evaluate performance using AI agents, skills, and commands for architecture, code gen, and benchmarking.
Transcribe handwritten historical Swedish documents using HTR tools with interactive viewers and JSON exports, search Riksarkivet archives via metadata and transcriptions, browse and zoom PDFs/IIIF images, annotate pages in Label Studio, map topics to record types, and upload files to Hugging Face Spaces for remote access.
Index PDFs, markdown, and source code into dual Qdrant (semantic) and MeiliSearch (full-text) indexes via the arc CLI, then search across corpora using either vector or keyword queries with AST-aware chunking and git metadata.
Orchestrate a polyglot extreme programming agent framework that manages cloud deployments with AWS, enforces DRY code quality via duplication detection, configures per-project environment secrets with direnv, scrapes web content, and coordinates agent swarms for task decomposition and release gating
Reproduce Long Video Sparse Attention (LVSA) paper headline numbers using bundled benchmarks scripts, including SotA comparison, latency scaling, and scoring with VQeval and VBench-Long, plus figure regeneration.
Build dynamic neural networks that grow, prune, and adapt topology during training in Python with Hugging Face. Design state machines for module lifecycles, diagnose issues in growth decisions, gradient isolation, and integration, plus consult an expert advisor for continual learning, PEFT/LoRA, and modular composition.
Delegate expert-level AI/ML workflows to specialized agents: engineer optimized prompts with evaluation and A/B testing, architect scalable LLM systems with RAG/LoRA fine-tuning, build production NLP pipelines for NER/classification/QA, and deploy optimized models via vLLM/Triton/Docker/K8s for reliability, performance, and cost control.
Train and run inference on machine learning models using Hugging Face Transformers and PEFT with PyTorch on cloud GPUs from Modal, Lambda Labs, or RunPod—no local GPU required.
Discover, evaluate (quality, licensing, provenance), and acquire datasets from Kaggle, HuggingFace, IPFS, arXiv, DBLP for AI model training and fine-tuning. Download free or paid data via local stdio MCP server—no API keys needed.
Build and manage robotic dataflow applications with dora-rs: develop nodes in Rust and Python, configure YAML pipelines, integrate ML, vision, audio processing, robot control, and data collection for imitation learning, with AI agents for generating and debugging dataflows.