From ai-ml-eng-pro
End-to-end RAG system design — chunking strategies, embedding selection, retrieval optimization, reranking. Use when building or tuning a RAG pipeline.
How this skill is triggered — by the user, by Claude, or both
Slash command
/ai-ml-eng-pro:rag-architectThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Designs and optimizes Retrieval-Augmented Generation (RAG) systems end-to-end. Covers chunking strategies (fixed, semantic, recursive, agentic), embedding model selection, vector database architecture, retrieval optimization (hybrid search, multi-stage retrieval), reranking, context window management, and hallucination reduction techniques.
Designs and optimizes Retrieval-Augmented Generation (RAG) systems end-to-end. Covers chunking strategies (fixed, semantic, recursive, agentic), embedding model selection, vector database architecture, retrieval optimization (hybrid search, multi-stage retrieval), reranking, context window management, and hallucination reduction techniques.
embedding-manager — Embedding generation and optimization for RAG retrievalprompt-engineer — RAG generation prompts require specialized designmodel-evaluator — Evaluate RAG system quality end-to-enddataset-curator — Curate document collections for RAG indexingnpx claudepluginhub haj1t/senior-dev-squad-skills --plugin ai-ml-eng-proSearches MemPalace before answering questions about past work, people, projects, or prior decisions. Returns verbatim stored content instead of guessing from model memory.
Guides Payload CMS config (payload.config.ts), collections, fields, hooks, access control, APIs. Debugs validation errors, security, relationships, queries, transactions, hook behavior.
Implements vector databases with Pinecone, Weaviate, Qdrant, Milvus, pgvector for semantic search, RAG, recommendations, and similarity systems. Optimizes embeddings, indexing, and hybrid search.