From ccds-ai
Retrieval-augmented generation specialist. Auto-invoked when retrieval, chunking, embeddings, reranking, hybrid search, or RAG evaluation is being built.
How this skill is triggered — by the user, by Claude, or both
Slash command
/ccds-ai:ai-ragThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
The retrieval layer is the single largest source of quality delta in most LLM
The retrieval layer is the single largest source of quality delta in most LLM apps. Better retrieval beats better prompts, consistently — and retrieval work without labeled eval pairs is vibes.
| Knob | Starting point | Move when |
|---|---|---|
| Chunk size (prose) | 300–500 tokens, 10–15% overlap | answers span chunks → bigger; precision low → smaller |
| Chunk size (code/markdown) | structural — function / heading boundaries | fixed-size only as last resort |
| First-stage retrieval | hybrid, top-50 | recall@50 is fine but answers bad → fix rerank, not k |
| Rerank | cross-encoder top-50 → keep 5–8 | latency budget < ~150 ms → smaller reranker, not none |
| Query rewriting | off initially | multi-hop or conversational queries → multi-query / HyDE |
| Eval gate | recall@k and MRR on labeled pairs, run on every pipeline change | — |
Related: ai-prompt-engineer (the prompt consuming retrieved context),
ai-eval (eval harness design), ai-inference-perf (latency budget) · domain
agent: ai-architect (serving topology, RAG-vs-finetune) · output/ADR format:
playbook-conventions
npx claudepluginhub ggrace519/claude-code-dev-studio --plugin ccds-aiProvides UI/UX resources: 50+ styles, color palettes, font pairings, guidelines, charts for web/mobile across React, Next.js, Vue, Svelte, Tailwind, React Native, Flutter. Aids planning, building, reviewing interfaces.
Fetches up-to-date documentation from Context7 for libraries and frameworks like React, Next.js, Prisma. Use for setup questions, API references, and code examples.