Plugins listed here are tagged for this technology stack and auto-indexed from public GitHub repositories.
Plugins listed here are tagged for this technology stack and auto-indexed from public GitHub repositories.
Claude Code plugins tagged for Ollama development. Browse commands, agents, skills, and more.
Build and deploy production-grade LLM applications with LangGraph for agent orchestration, advanced RAG pipelines leveraging vector and hybrid search, prompt engineering patterns, and automated evaluation. Covers embedding model selection, vector index optimization, and multi-agent architectures for document Q&A, chatbots, and semantic search over proprietary data.
Optimize Python deep learning models using Adam, SGD optimizers, learning rate schedulers, and regularization to improve accuracy and reduce training time. Generate production-ready AI/ML code from context analysis, including validation, error handling, performance metrics, insights, artifacts, and documentation.
Explain machine learning model predictions using SHAP, LIME, and feature importance to identify influential features and debug behavior. Generate production-ready AI/ML code from context, including validation, error handling, performance metrics, insights, artifacts, and documentation.
Explore and analyze codebases using semantic search, dependency graphs, and context artifacts to understand architecture, find functions and types, trace features, and inspect schemas or specs.
Generate test fixtures that mock LLM responses, tool calls, errors, multi-turn loops, embeddings, and structured output across multiple AI providers for use with Copilot Kit's aimock library.
Evaluate single ML models or compare multiple ones on test datasets across classification, regression, NLP, and generative tasks. Compute metrics, statistical significance, inference performance, costs, robustness, bias checks; generate visualized reports with confusion matrices, performance profiles, tables, rankings, and recommendations.
Rapidly implement production-ready AI/ML features in apps: integrate LLMs with prompt engineering and response handling, build ML pipelines for recommendation systems, add computer vision for visual search, and enable intelligent automation using OpenAI, Anthropic, LangChain, Hugging Face, or Ollama.
Rapidly implement production-ready AI/ML features in apps: integrate LLMs via prompt engineering and response handling, build ML pipelines for user behavior-based recommendations, add computer vision for photo-based product search, and deploy intelligent automations.
Rapidly implement production-ready AI/ML features in apps, including LLM integrations with prompt engineering, ML pipelines for recommendations, computer vision for visual search, and intelligent automation, using a specialized agent.
Query, search, manage, and explore Weaviate vector databases using natural language, semantic/hybrid/keyword methods; import data from PDF/CSV/JSON files; generate example datasets; get AI answers with source citations; bootstrap clusters—all integrated into your coding workflow for rapid AI app prototyping.
Hardens AI coding agent sessions with quality gates, security guards, and continuous QA — blocks dangerous operations, enforces structured development modes, and scores session work with handoff artifacts.
Index project documentation, codebases, and knowledge graphs for hybrid retrieval: BM25 keywords, semantic similarity, GraphRAG relationships, or fused multi-mode search. Retrieve cited chunks with scores to research dependencies, errors, and concepts in seconds using Ollama, OpenAI, or Anthropic.
Build provider-agnostic, type-safe streaming LLM chats with tools, agent loops, and multimodal support directly in React and Next.js apps using hooks like useChat, compatible with OpenAI, Anthropic, Gemini, Ollama.
Orchestrate self-hosted AI agents via Station CLI to create agents, run tasks with 55+ MCP tools, manage environments, deploy teams, configure providers and backends through browser UI or CLI, and monitor executions with OpenTelemetry telemetry.
Delegate end-to-end ML engineering workflows to specialized agents that construct data preparation and training pipelines with feature engineering and hyperparameter tuning, optimize inference through quantization, pruning, batching, and edge deployment, and manage MLOps for model versioning, monitoring, A/B testing, and production orchestration.
Intelligently route AI tasks across 20+ LLM providers (OpenAI, Anthropic, Gemini, Ollama, etc.) using complexity-first model selection to minimize costs while preserving output quality. Automatically classify prompts and dispatch to the cheapest capable model, track cross-session savings, and receive weekly cost digests via Slack/Discord.
Run multi-agent AI collaboration workflows in Claude via TeamMCP: connect agents to shared channels, DMs, tasks, project state, and inbox; automate standups, reviews, state updates, context search; bridge to WeChat; route to other AI providers like Ollama.
Unify Python LLM API calls across 100+ providers like OpenAI, Anthropic, Ollama, and llamafile servers using OpenAI format, with built-in retries, fallbacks, exception handling, and cost tracking.
Semantically search codebases using natural language queries and dependency graphs, then perform tasks like root cause analysis, safe refactoring, PR review, codebase onboarding, and commit message generation with blast radius awareness.
Routes OpenAI-compatible clients through FreeRide, a local gateway that distributes inference requests across free-tier AI providers (OpenRouter, Groq, NVIDIA NIM, Cloudflare Workers AI, HuggingFace). Detects a running FreeRide instance and wires any OpenAI-shaped client against it for cost-free model access.
Delegate coding tasks to OpenAI Codex, Google Gemini, or Ollama models for generation, review, refactoring, and fixes; compare responses across LLMs; get proactive second opinions on architecture, design, and security; run iterative review loops with build/test verification until clean; generate AI images via Gemini.
Develop, review, and deploy Go projects with conventions for architecture, testing, and git workflow, while also building interactive web UIs with Datastar, performing security reviews, fine-tuning AI models, and maintaining living documentation and experimental optimization loops.
Deploy full Home Assistant smart home platforms on Ubuntu servers via Docker Compose, configure Lovelace dashboards, automations, energy monitoring, cameras, sensors, and local LLM voice assistants with Ollama; manage entities, troubleshoot issues, audit security, and optimize via natural language commands, YAML generation, and agentic workflows.
Search, read, and reason across markdown notes with hybrid search (FTS5 + semantic), backlinks, and context-aware reads, plus write, edit, and Git sync capabilities via a local MCP server with optional Ollama embeddings.
Install modular skills to add messaging channels (Discord, Slack, Telegram, WhatsApp, Gmail, Emacs), AI integrations (Ollama, Parallel, OpenAI Whisper), multimodal tools (image vision, voice transcription, PDF extraction), code review automation (Qodo), CLI access, menu bar status, and runtime switches to NanoClaw agents via guided git merges, setups, and configurations.
Build production-grade LLM apps in Python: implement RAG pipelines with embeddings and hybrid search, design LangChain/LangGraph agents, optimize prompts, tune vector indexes, and evaluate performance using AI agents, skills, and commands for architecture, code gen, and benchmarking.
Route complex development tasks to the optimal LLM model based on complexity and cost, automatically switching between OpenAI, Anthropic, Ollama, and 20+ providers to minimize Claude API spending while maintaining output quality.
Search, manage, and embed project documentation with zero-config RAG — ingest docs from files, git history, or web crawls, run semantic/keyword search, benchmark retrieval quality, and keep knowledge bases synced with code.
Equip Claude Code and compatible AI agents with biomimetic long-term memory via local MCP server. Recall relevant repo facts, user preferences, prior fixes, and architecture decisions before coding tasks; curate and store high-salience context during workflows; consolidate episodic data into compact semantic memory for persistent continuity across sessions.
Automate end-to-end network infrastructure delivery on Itential Platform using AI agents and CLI commands: start from use case specs or reverse-engineer projects to generate requirements, feasibility assessments, designs, builds via FlowAgent/IAG services/workflows, device/golden config management, compliance checks, and as-built documentation.
Enables Claude Code to collaborate with multiple AI backends (Codex, Gemini, Ollama, Claude) through structured Q&A rallies, second-opinion calls, and iterative refinement rounds. Developers can delegate tasks to backend agents, get feedback loops, and synthesize results across models.
Dispatch parallel AI sub-agents through CLI to automate code maintenance: fix lints, repair tests, handle migrations, refactor, and improve code quality. Operate in VM-isolated environments with tapes-driven learning from Claude, OpenAI, or Ollama models.
Swiss legal intelligence for precedent research, case strategy, document drafting, and citation verification across all 26 cantons, with multi-lingual support and attorney-client privilege protection.
Generate professional PPTX presentations with custom themes, bullet slides, images, tables, and shapes using python-pptx; create customizable QR codes in SVG, PNG, EPS, or PDF from URLs, text, WiFi, or contacts; extract clean markdown from PDFs by rendering pages to images and running local GLM-OCR via Ollama.
Iterative AI-driven code refinement workflows: submit code to ralph-o-matic for automated refinement loops, or orchestrate full development from idea to submission including brainstorming, planning, and parallel execution.
Provides a comprehensive collection of DSPy skills for building, optimizing, debugging, and deploying AI systems—covering signatures, modules, pipelines, retrieval-augmented generation, agents, evaluations, guardrails, monitoring, and deployment patterns
Set up a personal knowledge base with semantic search using PostgreSQL/pgvector and vector embeddings. Capture thoughts and retrieve them via semantic similarity, deployable with Docker or AWS RDS, and integrated with a local MCP server.
Streamlines Claude Code development with git automation, documentation audits, command optimization, and implementation plan tracking. Includes agents for verifying completion claims and auditing code quality against project requirements.
Coordinates a sovereign AI fleet across Claude Code, Cursor, and other CLIs with persistent memory, cross-repo context synchronization, multi-agent orchestration, and policy-enforced security—all running locally with forkable, auditable infrastructure.
Build production-ready LLM applications by delegating to expert AI agents that engineer prompts, manage dynamic contexts with vector DBs and knowledge graphs, optimize single and multi-agent performance, and orchestrate RAG, multimodal, and enterprise AI workflows.
Delegate tasks to subagents powered by external OpenAI-compatible chat endpoints or LM Studio native APIs. Manage endpoint aliases by saving, listing, and removing them, with built-in support for MCP tools in LM Studio.
Store, semantically search, and graph-link persistent project knowledge like decisions, gotchas, patterns, and learnings across Claude sessions. Capture memories from conversations or git changes, check recent work against stored warnings, create multi-agent thinking docs, and monitor memory health with agents and CLI commands.