By shihwesley
Turn any topic into a compact expertise artifact through a structured research pipeline (question tree, source discovery, zero-context fetch, .mv2 indexing, REPL distillation) without loading raw content into LLM context. Execute Python code in isolated Docker sandboxes for data analysis, prototyping, and DSPy sub-agents. Query Neo4j graph databases for knowledge retrieval.
Execute Python code in an isolated Docker sandbox with DSPy sub-agent support. Use for code execution, data analysis, or recursive language model tasks that need sandboxed evaluation.
Research agent that fetches, indexes, and searches documentation using the neo-research knowledge store. Use when you need to research a library, framework, or API before implementation.
Unified research pipeline. Parses any input (topic, paragraph, URLs), builds a structured question tree, discovers and fetches sources (zero context cost), indexes into .mv2, distills via systematic querying into a compact expertise artifact. Output: agent becomes domain expert without reading raw content.
Research Apple frameworks against a spec or plan file using exported Dash docset documentation. Produces per-feature reference sections with verbatim code examples and API signatures. Use when user says /apple-research, needs to look up Apple framework APIs before implementation, or wants to generate framework reference docs from a spec file.
Audit previously researched topics and optionally re-index them through the current pipeline. Use when the research flow has changed and you want to backfill, or to see what's been researched.
Show what's indexed in the neo-research knowledge store — sources, chunk counts, file sizes. Use when the user asks what docs are available, what's been indexed, or wants a status check before searching.
Research any topic — builds question tree, discovers sources, fetches to disk (zero context cost), indexes into .mv2, distills into expertise artifact. Agent becomes domain expert. Use when you need to learn about a technology, protocol, framework, or domain before working with it.
Admin access level
Server config contains admin-level keywords
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Research pipeline for Claude Code. One command turns any topic into agent expertise — without blowing your context window.
> /research "SwiftUI NavigationStack in iOS 17+ with programmatic navigation"
Research complete: swiftui-navigationstack
- Sources: 14 fetched, 12 indexed
- Expertise: ~/.claude/research/swiftui-navigationstack/expertise.md (3.2K tokens)
- Skill generated: ~/.claude/skills/swiftui-navigationstack/SKILL.md
- Deep-dive: rlm_search(query="...", project="swiftui-navigationstack")
We built neo-research as a Docker sandbox for running Python and testing whether a REPL-based approach to recursive language model programs could work. It worked, but the project was missing a clear purpose — a bag of tools without a workflow.
At the same time, we had a separate /research skill that was supposed to help Claude learn new topics. It was doing fetch-and-hope: search Google, grab some pages, dump them in a knowledge store, and cross your fingers.
Neither tool was solving the real problem: how do you make an AI agent actually know a topic the way a developer knows a topic? Not "I have some indexed docs I can search." Real working knowledge — the kind where you understand the mental model, know the common patterns, and can warn about the gotchas before someone hits them.
So we combined them. The sandbox REPL handles the heavy data processing (fetching, indexing, querying) without burning context tokens. The research methodology structures what to look for. The distillation step turns raw indexed content into a compact expertise artifact. The agent reads 10-15K tokens of targeted excerpts instead of 500K of raw docs.
The result is a research pipeline that vibe coders can use: /research <topic> and your agent becomes the expert. The RLM REPL theory still gets tested — the sandbox runs all the programmatic knowledge store queries during distillation. But now it has a reason to exist.
/research "WebTransport protocol"
│
▼
┌─────────────────────────┐
│ 1. Parse input │ Topic, URLs, context — any format
│ 2. Build question tree │ 4-7 branches: what/how/API/gotchas/...
│ 3. Discover sources │ Targeted search per branch, quality-ranked
│ 4. Fetch to disk │ curl → markdown → .mv2 (zero context cost)
│ 5. Distill │ Query .mv2 per branch → expertise.md
│ 6. Generate artifacts │ Skill and/or subagent from expertise
└─────────────────────────┘
│
▼
Agent now knows the topic.
Knowledge store available for deep-dives.
The input is flexible. A topic name, a paragraph of context, specific URLs, or a mix of all three. The pipeline figures out what to search for based on a question tree — not blind keyword searches.
Content never enters the agent's context window during fetch. Pages go to disk, then into a .mv2 knowledge store. The agent only reads targeted excerpts during the distillation phase.
/plugin marketplace add shihwesley/shihwesley-plugins
/plugin install neo-research@shihwesley-plugins
First run creates a Python venv and installs dependencies automatically.
Docker is optional — the research pipeline and knowledge tools work without it. Sandbox execution tools need Docker.
git clone https://github.com/shihwesley/neo-research.git
cd neo-research
./scripts/setup.sh
# Start MCP server directly (stdio transport)
./scripts/start-server.sh
# Optional: build Docker sandbox image
docker compose up -d --build
After installing, restart Claude Code. The MCP server loads with 21 tools.
Research a topic (the main workflow):
> /research fastapi
Research complete: fastapi
- Sources: 14 fetched, 12 indexed
- Expertise: ~/.claude/research/fastapi/expertise.md (4.1K tokens)
- Knowledge store: ~/.claude/research/fastapi/knowledge.mv2
- Skill generated: ~/.claude/skills/fastapi/SKILL.md
Search indexed knowledge directly:
> Use rlm_search("dependency injection") to find the pattern
Found 5 results:
1. [0.94] tutorial/dependencies.md — "Declare the dependency..."
2. [0.88] advanced/dependencies.md — "Sub-dependencies are resolved..."
npx claudepluginhub shihwesley/shihwesley-plugins --plugin neo-researchMaps codebases of any size using parallel Sonnet subagents with Merkle-tree change detection. Generates docs/CODEBASE_MAP.md with architecture diagrams, module guides, data flows, and navigation guides. Creates docs/.mercator.json merkle manifest for O(1) staleness checks — compare a single root hash to know if anything changed. Post-commit hook auto-refreshes hashes in ~2 seconds with zero API tokens. Supports incremental updates: only re-explores changed modules. Works without git via pure hash-based diff.
Reverse engineer and extract capabilities from any source — repos, configs, binaries, articles — into your agent workflow. Security scanning, manifest tracking, cross-agent restore.
TLDR-aware code simplifier — uses AST summaries for context, full reads only for target files. 80%+ token savings on large codebases. Merkle-integrated for O(1) change detection when used with mercator-ai.
Ambient dev environment management — auto-switches between dev, test, staging, and prod using Docker containers, sidecars, and task-aware classification. Provides /orbit commands for explicit control.
Ambient .tech.md generation — auto-documents and tracks freshness of your codebase
MCP server that saves 98% of your context window with session continuity. Sandboxed code execution in 11 languages, FTS5 knowledge base with BM25 ranking, and automatic state restore across compactions.
Open-source, local-first Claude Code plugin for token reduction, context compression, and cost optimization using hybrid RAG retrieval (BM25 + vector search), reranking, AST-aware chunking, and compact context packets.
Complete AI coding workflow system. Self-correcting memory + persistent FTS5-indexed research wikis + auto-research loop + multi-LLM council on a single SQLite store. 33 skills, 8 agents, 22 commands, 37 hook scripts across 24 events. Cross-agent via SkillKit.
Intelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.
Complete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.
Give your AI a memory — mine projects and conversations into a searchable palace. 33 MCP tools, auto-save hooks, and guided setup.