By the-focus-ai
Hybrid document search using embeddings and grep for semantic + literal matching with metadata extraction and filtering
A Claude Code plugin that provides hybrid document search combining grep (literal matching) with semantic embeddings for comprehensive search across project documents.
Part of The-Focus-AI/claude-marketplace.
.embeddings/index.json file for fast loadingInstall via Claude Code plugin marketplace or add directly:
claude --plugin-dir /path/to/embeddings-search-skill
The skill activates automatically when searching for documents by meaning:
# Index a directory (default: sliding window strategy)
npx tsx src/cli.ts index .
# Index with specific strategy
npx tsx src/cli.ts index . --strategy paragraph
npx tsx src/cli.ts index . --strategy sentence
npx tsx src/cli.ts index . --strategy sliding --window 500 --overlap 150
# Search documents
npx tsx src/cli.ts search "budget concerns"
# Search with filters
npx tsx src/cli.ts search "love" --collection "jane austin"
npx tsx src/cli.ts search "meeting notes" --type report
npx tsx src/cli.ts search "quarterly review" --after 2024-01-01 --before 2024-04-01
# View discovered taxonomy
npx tsx src/cli.ts taxonomy .
# List files matching filters
npx tsx src/cli.ts files --collection "jane austin" --type novel
# View index statistics
npx tsx src/cli.ts stats .
The indexer automatically extracts metadata from:
--collection <name> - Filter by collection (derived from directory names)--type <type> - Filter by document type (novel, letter, report, essay, documentation, notes)--author <name> - Filter by author--topic <topic> - Filter by topic (love, marriage, society, money, etc.)--path <pattern> - Filter by file path pattern--after <date> - Only documents modified after date--before <date> - Only documents modified before dateDirectly indexed:
.txt - Plain text.md - MarkdownRequires text extraction first:
.pdf - PDF documents.doc, .docx - Word documents.ppt, .pptx - PowerPoint presentationsFor binary files, create a .txt sidecar with extracted text (e.g., report.pdf.txt).
Index is stored in .embeddings/index.json containing:
@xenova/transformers - Local embedding modelfast-glob - File discoverytsx - TypeScript executionMIT
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Generate images and videos using Google Gemini and Veo models. Provides skills for AI image generation, image editing, text-to-video, and image-to-video workflows.
QuickBooks Online read-only access for Claude Code using OAuth 2.0
Pocket Casts integration - access starred episodes, listening history, show notes, and podcast metadata
Claude Code skills for managing Hetzner servers, Cloudflare DNS, and Docker deployments
Google services integration for Gmail, Calendar, Sheets, Docs, and YouTube using OAuth 2.0
npx claudepluginhub the-focus-ai/claude-marketplace --plugin embeddings-search-skillSearch your local documents with natural language. Index folders of PDFs, Word docs, markdown, and more — then ask questions in English or German and get answers with source citations.
Open source document intelligence and review plugin for Claude Code: ingest, search, review, enrich, and export local document workspaces.
Fast local documentation search with llms.txt indexing. Search 12K+ line docs in 6ms with line-accurate citations. One command (/blz) and one agent (@blz:blazer) for all documentation operations.
dahatake のエージェントスキル集。ローカル Markdown 横断クエリなど、コーディングエージェント向けのスキルを提供します。
LightRAG-based PDF document indexing and semantic search for research workflows
Zero-config knowledge base MCP server — search, manage, and embed documentation (SQLite default, PostgreSQL optional)