By zilliztech
RAG (Retrieval Augmented Generation) solutions: basic RAG, RAG with reranking, agentic RAG, and multi-hop RAG for complex reasoning
Use when user needs an autonomous RAG agent that decides when and what to retrieve dynamically. Triggers on: agentic RAG, agent, autonomous retrieval, tool use, function calling, research agent, conversational RAG, dynamic retrieval, self-directed search, RAG with tools, intelligent assistant, adaptive retrieval.
Use when user needs multi-step reasoning with iterative retrieval for complex questions. Triggers on: multi-hop, multi-step RAG, complex questions, chain of retrieval, iterative retrieval, complex reasoning, cross-document reasoning, question decomposition, research questions, fact synthesis, connecting information across documents.
Use when user needs high-precision RAG with reranking for domains where accuracy is critical. Triggers on: rag rerank, precise RAG, cross-encoder, reranking RAG, legal QA, medical QA, high-precision QA, two-stage retrieval, semantic reranking, improve RAG accuracy, relevance scoring, document ranking.
Use when user wants to build RAG, Q&A system, or knowledge base with documents. Triggers on: RAG, retrieval augmented generation, Q&A system, knowledge base, document Q&A, chat with docs, ChatGPT for docs, LLM + retrieval, semantic search over documents, ground LLM with facts, reduce hallucination, enterprise search.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
A data retrieval development assistant based on Claude Code Skills.
We specialize in the data retrieval vertical:
┌─────────────────────────────────────────────────────────┐
│ Scenario Plugins (6 plugins) │
│ rag-toolkit, retrieval-system, multimodal-retrieval, │
│ rec-system, memory-system, data-analytics │
│ = Pre-built solutions = AI era caching mechanism │
└─────────────────────────────────────────────────────────┘
↑
Match / Combine
↑
┌─────────────────────────────────────────────────────────┐
│ core plugin │
│ Methodology (pilot) + Atomic operators │
│ (embedding, chunking, ...) │
└─────────────────────────────────────────────────────────┘
Core Ideas:
Scenarios are classified by architectural differences, not by industry or model:
plugins/retrieval-system/skills/
├── semantic-search/ # Category: architecture definition
│ ├── SKILL.md # Generic workflow + model selection table
│ └── verticals/ # Subcategory: vertical application guides
│ ├── legal.md # Legal search
│ ├── academic.md # Academic papers
│ └── ecommerce.md # E-commerce search
User describes requirement
│
▼
pilot activates
│
├─→ Clarify data and query
│
├─→ Can match a scenario?
│ ├─ Yes → Use pre-built solution
│ └─ No → Combine core operators
│
├─→ Generate code → User tests
│
└─→ Collect feedback → Iterate
/plugin marketplace add zilliztech/milvus-marketplace
# Core tools (required)
/plugin install core@milvus-marketplace
# Install scenario plugins as needed
/plugin install rag-toolkit@milvus-marketplace # RAG solutions
/plugin install retrieval-system@milvus-marketplace # Text search
/plugin install multimodal-retrieval@milvus-marketplace # Image/video/multimodal
/plugin install rec-system@milvus-marketplace # Recommendations
/plugin install memory-system@milvus-marketplace # Chat memory
/plugin install data-analytics@milvus-marketplace # Duplicate detection, clustering
Simply describe what you want to build:
"Help me build a document Q&A system"
"I want to implement semantic search"
"Build an image search application"
The pilot will automatically activate, clarify requirements, and help you orchestrate the toolchain and generate code.
| Type | Skill | Purpose |
|---|---|---|
| Controller | pilot | AI application navigator - understands requirements, orchestrates tools, delivers code |
| Operator | embedding | Text/image vectorization |
| Operator | chunking | Document chunking |
| Operator | indexing | Milvus index management |
| Operator | data-ingestion | Batch data import |
| Operator | rerank | Search result reranking |
| Operator | pdf-extract | PDF text extraction |
| Operator | vlm-caption | Image captioning (VLM) |
| Environment | local-setup | Local Milvus deployment |
| Skill | Architecture | Vertical Applications |
|---|---|---|
| semantic-search | embedding → vector search | Legal, academic, news, e-commerce, code, patents |
| hybrid-search | vector + BM25 keyword + score fusion | E-commerce, legal, academic |
| filtered-search | vector search + scalar filtering | E-commerce, recruitment, real estate |
| multi-vector-search | multi-vector field joint search | Products, papers, resumes |
npx claudepluginhub zilliztech/milvus-marketplace --plugin rag-toolkitAutomatic semantic memory for Claude Code — remembers what you worked on across sessions
Long-term memory solutions for chatbots and AI assistants: conversation history retrieval, user profiling, and persistent memory across sessions
Multimodal retrieval solutions: image-to-image search, text-to-image search, video content retrieval, and multimodal RAG for documents with images
Data analytics solutions: duplicate detection (deduplication, plagiarism detection) and clustering (topic modeling, user segmentation)
Core tools for vector database development: pilot (main controller + routing), ray (data processing orchestration), embedding, chunking, indexing, rerank, and local deployment
Build Retrieval-Augmented Generation pipelines
Pinecone vector database integration. Streamline your Pinecone development with powerful tools for managing vector indexes, querying data, and rapid prototyping. Use slash commands like /quickstart to generate AGENTS.md files and initialize Python projects and /query to quickly explore indexes. Access the Pinecone MCP server for creating, describing, upserting and querying indexes with Claude. Perfect for developers building semantic search, RAG applications, recommendation systems, and other vector-based applications with Pinecone.
OpenRAG agent skills: guided installation and SDK integration helpers.
Google File Search API powered RAG pipeline - managed retrieval-augmented generation with document processing
Agent skills for Qdrant vector search: scaling, performance optimization, search quality, monitoring, deployment, model migration, version upgrades, and SDK usage
Weaviate plugin for Claude Coding