By cwensel
Index PDFs, markdown, and source code into dual Qdrant (semantic) and MeiliSearch (full-text) indexes via the arc CLI, then search across corpora using either vector or keyword queries with AST-aware chunking and git metadata.
Manage Qdrant collections (semantic search only)
Configuration and cache management
Manage container services (Qdrant, MeiliSearch)
Manage dual-index corpora (recommended)
Check that all Arcaneum prerequisites are met and the system is ready for use.
Qdrant collection management for semantic search. Use when user mentions managing collections, creating collection, listing collections, verifying collection integrity, exporting or importing collections, or viewing indexed items in Qdrant.
Docker container management for Qdrant and MeiliSearch services. Use when user mentions starting services, stopping services, checking container status, viewing logs, restarting containers, or resetting database services.
Dual-index corpus management for combined semantic and full-text search. Use when user mentions corpus, dual indexing, syncing content to both Qdrant and MeiliSearch, checking parity between systems, deleting corpora, or managing content that needs both search types.
MeiliSearch index management for full-text search. Use when user mentions managing indexes, creating index, listing indexes, verifying index health, exporting or importing indexes, or viewing indexed items in MeiliSearch.
Search indexed corpora using semantic (vector) OR full-text (keyword) search via the arc CLI. Use when the user asks to search, find, look up, or query a corpus, collection, knowledge base, codebase, docs, PDFs, or markdown. Covers both conceptual queries and exact-term lookups.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
CLI tools and Claude Code plugins for semantic and full-text search.
pipx install "https://github.com/cwensel/arcaneum/releases/download/v0.8.0/arcaneum-0.8.0-py3-none-any.whl"
Arcaneum helps you discover and understand project dependencies, documentation, and reference implementations. By indexing libraries, frameworks, and technical papers, you can semantically search for patterns, APIs, and concepts when building new projects. Works especially well with the RDR (Recommendation Data Record) model for AI-assisted development planning.
The system supports PDF documents and source code with git-aware, AST-based chunking.
Currently Available:
arc corpus sync --max-embedding-batch, --text-workers, --cpu-workers, and
single-system indexing flags such as --embedding-batch-size and --process-priorityarcaneum[sentence-transformers]arcaneum[sentence-transformers]arcaneum[sentence-transformers]arcaneum[sentence-transformers]arcaneum[sentence-transformers]See arc models list for complete model information and recommendations.
Use arc models list --json for the LLM-readable catalog: it includes backend,
recommended and default corpus uses, support/risk tier, prompt policy, context
limit, hardware support, runtime-aligned batch guidance, and reindex warnings.
Arcaneum records each collection's embedding prompt policy when it is indexed. Reindex a corpus after changing model query/document prompts, tasks, or prompt-aware model defaults; semantic search rejects collections whose stored prompt policy no longer matches the current model registry.
--gpu to opt into accelerator embeddingARC_EXPERIMENTAL_COREML=1/arc:search, /arc:index, /arc:collection, etc.)/help or /commands in Claude CodeGet started with Arcaneum in just a few commands:
# 1. Install
pipx install "https://github.com/cwensel/arcaneum/releases/download/v0.8.0/arcaneum-0.8.0-py3-none-any.whl"
# 2. Install Claude Code plugin (optional, in Claude Code)
# /plugin install cwensel/arcaneum
# 3. Verify and start services
arc doctor
arc container start
# 4. Create a corpus and sync content (indexes to both Qdrant and MeiliSearch)
arc corpus create Frameworks --type code
arc corpus sync Frameworks ~/libs/fastapi ~/libs/sqlalchemy
npx claudepluginhub cwensel/arcaneum --plugin arcPersistent file-based planning for AI coding agents. Crash-proof markdown plans (task_plan.md, findings.md, progress.md) that survive context loss and /clear, with an opt-in completion gate and multi-agent shared state. Manus-style. Works with Claude Code, Codex CLI, Cursor, Kiro, OpenCode and 60+ agents via the SKILL.md standard. Includes Arabic, German, Spanish, and Chinese (Simplified and Traditional).
A growing collection of Claude-compatible academic workflow bundles. Covers scientific figures, manuscript writing and polishing, reviewer assessment, citation retrieval, data availability, paper reading, literature search, response letters, paper-to-PPTX conversion, and evidence-grounded Chinese invention patent drafting. Rules are organized as reusable skill folders with explicit workflows and quality checks.
Harness-native ECC operator layer - 67 agents, 271 skills, 92 legacy command shims, reusable hooks, rules, selective install profiles, and production-ready workflows for Claude Code, Codex, OpenCode, Cursor, and related agent harnesses
Comprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
Tools to maintain and improve CLAUDE.md files - audit quality, capture session learnings, and keep project memory current.
Intelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.