By qdrant
Helps Qdrant users optimize and manage vector search deployments through tuning performance, diagnosing search quality, scaling clusters, deploying infrastructure, upgrading versions, and using client SDKs.
Qdrant provides client SDKs for various programming languages, allowing easy integration with Qdrant deployments.
Guides Qdrant deployment selection. Use when someone asks 'how to deploy Qdrant', 'Docker vs Cloud', 'local mode', 'embedded Qdrant', 'Qdrant EDGE', 'which deployment option', 'self-hosted vs cloud', or 'need lowest latency deployment'. Also use when choosing between deployment types for a new project.
Guides embedding model migration in Qdrant without downtime. Use when someone asks 'how to switch embedding models', 'how to migrate vectors', 'how to update to a new model', 'zero-downtime model change', 'how to re-embed my data', or 'can I use two models at once'. Also use when upgrading model dimensions, switching providers, or A/B testing models.
Guides Qdrant monitoring and observability setup. Use when someone asks 'how to monitor Qdrant', 'what metrics to track', 'is Qdrant healthy', 'optimizer stuck', 'why is memory growing', 'requests are slow', or needs to set up Prometheus, Grafana, or health checks. Also use when debugging production issues that require metric analysis.
Different techniques to optimize the performance of Qdrant, including indexing strategies, query optimization, and hardware considerations. Use when you want to improve the speed and efficiency of your Qdrant deployment.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Agent skills for building with Qdrant vector search
Skills encode deep Qdrant knowledge so coding agents can make the engineering decisions that determine whether vector search works well: quantization, sharding, tenant isolation, hybrid search, model migration, and more.
Skills are not documentation. Qdrant already has docs in markdown. Skills answer "when?" and "why?", not "how?"
They are structured as the handbook of a Solutions Architect working on Qdrant: given a problem, navigate to the exact place in the documentation where the answer lives. No tutorials, no concept explanations. Only references and minimal snippets where absolutely necessary.
These skills are under active development. Skill content and structure may change between versions as Qdrant evolves.
Skills are hosted at skills.qdrant.tech. Pass the URL of a skill directly to your agent — no installation required:
Use skills.qdrant.tech
This keeps your agent's context focused: it fetches only the skill relevant to your current problem rather than loading everything upfront.
If you prefer skills to be available offline or without passing URLs manually, you can install them locally. Refer to the Installation section.
The recommended way is the URL method: ask your agent about Qdrant and pass skills.qdrant.tech in your prompt.
"I have 50M vectors on a single node and search is slow, should I add more nodes? Use skills.qdrant.tech"
"My search results are returning irrelevant matches. Use skills.qdrant.tech"
If you use the installation method, just ask your agent about Qdrant. Skills are triggered automatically when your question matches their description.
"I have 50M vectors on a single node and search is slow, should I add more nodes?"
→ qdrant-scaling skill activates, recommends quantization and vertical scaling before adding nodes
"My search results are returning irrelevant matches"
→ qdrant-search-quality skill activates, walks through diagnosis and search strategy options
"How do I switch from OpenAI embeddings to Cohere without downtime?"
→ qdrant-model-migration skill activates, guides zero-downtime migration with dual vectors
| Skill | Useful for |
|---|---|
| qdrant-clients-sdk | SDK setup, code examples, snippet search across Python, TypeScript, Rust, Go, .NET, Java |
| qdrant-scaling | Scaling decisions: data volume, QPS, latency, query volume, horizontal vs vertical |
| qdrant-performance-optimization | Search speed, memory usage, indexing performance |
| qdrant-search-quality | Diagnosing bad results, search strategies, hybrid search |
| qdrant-monitoring | Metrics, health checks, debugging optimizer and cluster issues |
| qdrant-deployment-options | Choosing between local, self-hosted, cloud, and hybrid |
| qdrant-model-migration | Switching embedding models without downtime |
| qdrant-version-upgrade | Safe upgrade paths, compatibility guarantees, rolling upgrades |
Install using the npx skills CLI:
npx skills add qdrant/skills
Add the marketplace, then install all Qdrant skills:
/plugin marketplace add qdrant/skills
/plugin install qdrant@qdrant
Install from the Cursor Marketplace or add manually via Settings > Rules > Add Rule > Remote Rule (GitHub) with qdrant/skills.
Clone this repo and copy the skill folders into the appropriate directory for your agent:
| Agent | Skill Directory | Docs |
|---|---|---|
| Claude Code | ~/.claude/skills/ | docs |
| Cursor | .cursor/skills/ | docs |
| OpenCode | ~/.config/opencode/skill/ | docs |
| OpenAI Codex | ~/.codex/skills/ | docs |
| Pi | ~/.pi/agent/skills/ | docs |
For additional Qdrant context, pair skills with these MCP servers:
npx claudepluginhub qdrant/skills --plugin qdrantPinecone vector database integration. Streamline your Pinecone development with powerful tools for managing vector indexes, querying data, and rapid prototyping. Use slash commands like /quickstart to generate AGENTS.md files and initialize Python projects and /query to quickly explore indexes. Access the Pinecone MCP server for creating, describing, upserting and querying indexes with Claude. Perfect for developers building semantic search, RAG applications, recommendation systems, and other vector-based applications with Pinecone.
Manage vector embeddings and similarity search
Cloudflare Vectorize vector database for semantic search and RAG. Use for vector indexes, embeddings, similarity search, or encountering dimension mismatches, filter errors.
Weaviate plugin for Claude Coding
OpenSearch skills to help set up and deploy OpenSearch for a variety of use cases: build search applications with semantic, hybrid, neural sparse, and agentic search strategies; analyze observability data with log analytics (PPL and Query DSL) and distributed traces (OpenTelemetry); deploy to Amazon OpenSearch Service or OpenSearch Serverless with Bedrock integration for embeddings and RAG. Just ask your AI assistant to set up a search app, query logs, investigate traces, or deploy to AWS.
Official Claude plugin for Azure Cosmos DB (NoSQL). Bundles skills for data modeling, partition key design, query optimization, SDK best practices, indexing, vector search, full-text search, global distribution, security, and more.