From mnemo
Local, token-free, graph-based project memory. Use this skill whenever the user wants to remember, recall, or build durable context about a project, codebase, client, or document set — e.g. "remember this project", "build memory from these files", "what do we know about X", "recall our decisions on Y", "give me the project overview", "set up project memory", or when starting work on a project that already has memory. Memory is created and queried entirely by LOCAL models (Ollama + Tesseract + MarkItDown), so it costs almost no Claude tokens. Prefer this over reading whole documents into context. Not for one-off web research.
How this skill is triggered — by the user, by Claude, or both
Slash command
/mnemo:memoryThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Mnemo gives Claude durable **project memory** built and queried **entirely
Mnemo gives Claude durable project memory built and queried entirely locally (Ollama for the LLM + embeddings, Tesseract for OCR, MarkItDown for document conversion). The guiding principle:
Claude tokens are precious; local compute is free. Never load whole documents into the conversation when a
memory_queryreturns the same answer in a few hundred tokens.
Everything is stored in a global store (~/.claude-memory/, override with
MNEMO_HOME) namespaced per project, so memory persists across sessions and is
reusable across projects.
memory_overview (loads a compact digest:
overview + key entities/relationships/facts; a few hundred tokens).memory_query(query, scope="project"|"all").
Returns only the most relevant entities + facts with provenance. Use
scope="all" to draw on every project's memory at once.memory_expand(entity) for its neighborhood.memory_build(source_dir). Converts every
file (PDF/DOCX/PPTX/XLSX/images via OCR + a local vision model), extracts a
knowledge graph, writes memory.md + an embedding index + an interactive HTML
mind map. Returns compact stats only. Runs for minutes on large corpora — all
local; do not read the files yourself.memory_update (incremental; only changed
files are re-processed).memory_open_mindmap.memory_status, memory_list_projects.memory_overview once instead of opening
files.memory_query first. Only read a specific source file if
memory is genuinely insufficient — and then read just that one file.If memory_status shows Ollama down or models missing, run the installer
(one-time, local, free):
./scripts/install.sh
It installs Ollama + Tesseract, pulls qwen2.5:7b (extraction),
nomic-embed-text (retrieval), and moondream (image captions), and creates the
Python environment. Models are configurable via MNEMO_EXTRACT_MODEL,
MNEMO_EMBED_MODEL, MNEMO_VISION_MODEL.
| Tool | Purpose |
|---|---|
memory_build(source_dir, project?, model?, vision?, ocr_lang?, max_files?, reset?) | Build/rebuild memory from a folder (local, token-free). |
memory_update(project?, source_dir?) | Incremental refresh of changed files. |
memory_overview(project?) | Compact project digest (best at session start). |
memory_query(query, project?, k?, scope?) | Semantic recall → tiny relevant subgraph. |
memory_expand(entity, project?, depth?) | One entity's neighborhood. |
memory_list_projects() | All projects + counts. |
memory_open_mindmap(project?) | Open the interactive HTML graph. |
memory_status() | Stack health (Ollama, models, Tesseract, store). |
Converted text, the graph, and the mind map live only in ~/.claude-memory/ on the
local machine. Nothing is sent to the cloud and nothing about your documents is
committed to the plugin's repository.
Provides UI/UX resources: 50+ styles, color palettes, font pairings, guidelines, charts for web/mobile across React, Next.js, Vue, Svelte, Tailwind, React Native, Flutter. Aids planning, building, reviewing interfaces.
Fetches up-to-date documentation from Context7 for libraries and frameworks like React, Next.js, Prisma. Use for setup questions, API references, and code examples.
npx claudepluginhub gru-953/mnemo-mcp --plugin mnemo