From llm-application-dev
Selects and optimizes embedding models for semantic search and RAG. Covers model comparison, chunking strategies, and pipeline design.
How this skill is triggered — by the user, by Claude, or both
Slash command
/llm-application-dev:embedding-strategiesThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Guide to selecting and optimizing embedding models for vector search applications.
Guide to selecting and optimizing embedding models for vector search applications.
| Model | Dimensions | Max Tokens | Best For |
|---|---|---|---|
| voyage-3-large | 1024 | 32000 | Claude apps (Anthropic recommended) |
| voyage-3 | 1024 | 32000 | Claude apps, cost-effective |
| voyage-code-3 | 1024 | 32000 | Code search |
| voyage-finance-2 | 1024 | 32000 | Financial documents |
| voyage-law-2 | 1024 | 32000 | Legal documents |
| text-embedding-3-large | 3072 | 8191 | OpenAI apps, high accuracy |
| text-embedding-3-small | 1536 | 8191 | OpenAI apps, cost-effective |
| bge-large-en-v1.5 | 1024 | 512 | Open source, local deployment |
| all-MiniLM-L6-v2 | 384 | 256 | Fast, lightweight |
| multilingual-e5-large | 1024 | 512 | Multi-language |
Document → Chunking → Preprocessing → Embedding Model → Vector
↓
[Overlap, Size] [Clean, Normalize] [API/Local]
Full template library and detailed worked examples live in references/details.md. Read that file when you need the concrete templates.
npx claudepluginhub wshobson/agents --plugin llm-application-devSelects and optimizes embedding models like Voyage AI and OpenAI for RAG/semantic search. Covers comparisons, chunking strategies, domain models, and Python templates.
Guides selection and optimization of embedding models for vector search and RAG, including model comparisons, chunking strategies, dimension reduction, and Python templates for OpenAI and local models.