From pdf-ingest
Convert PDFs and office documents into machine-readable text or Markdown. Use for batch conversion, corpus building, long documents (convert once, read selectively), when output files are required, or when the host model cannot ingest PDFs natively. If the host reads PDFs natively (vision-capable, e.g. Claude's Read tool), prefer native reading for short interactive questions. Routes between markitdown, Docling/Granite, marker, and MinerU by document type, with token- and memory-budget discipline.
How this skill is triggered — by the user, by Claude, or both
Slash command
/pdf-ingest:pdf-ingestThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Convert the document to Markdown with the right engine, then read the conversion selectively. Never feed a raw PDF or a whole large conversion into context.
Convert the document to Markdown with the right engine, then read the conversion selectively. Never feed a raw PDF or a whole large conversion into context.
uvx --from 'markitdown[pdf]' markitdown <file>.pdf | head -c 2000. If real prose comes back, the PDF is born-digital; if near-empty or garbage, it is a scan and needs an OCR-capable engine (step 4).engines/markitdown.md.engines/docling.md.engines/marker.md, engines/mineru.md.--effort high. Details: engines/mineru.md.Read the matching engine file before invoking; each contains exact commands and known failure modes.
--image-export-mode placeholder. If figures matter, use referenced and handle the image files separately.--use_llm) should call the same local OpenAI-compatible endpoint that serves the agent, not a second backend. Endpoint and model name: ask the user or check the obvious local ports (Osaurus 1337, Ollama 11434, mlx_lm/ds4 8000).Creates, edits, and optimizes skills for Claude Code, including drafting, evaluating with test prompts, iterating on performance, and improving skill descriptions for better triggering accuracy.
npx claudepluginhub danmackinlay/pdf-ingest-skill --plugin pdf-ingest