By danmackinlay
Convert PDFs and office documents into machine-readable text or Markdown, routing by document type between markitdown, Docling, marker, and MinerU.
An agent skill for agents that must read PDFs without cloud help: no vision API to lean on, often no network at all, and a local chat model already occupying most of the machine's memory.
It is deliberately specialised, on two axes:
What the skill actually does: route each PDF to the cheapest adequate local converter. Probe the text layer, then choose between markitdown (instant text-layer extraction), Docling + Granite-Docling (default quality engine), marker (maths → LaTeX, scans), and MinerU (accuracy-maximalist escalation) — with token- and memory-budget discipline throughout.
SKILL.md — the routing policy and budget disciplinesengines/*.md — per-engine commands and known failure modes, loaded on demandDesign rationale and the empirical legwork behind the recipes: danmackinlay.name/notebook/pdf_ingestion.
Not a general document-AI toolkit, and not the right tool when the host model reads PDFs natively (a vision-capable cloud model answering a quick question about a short PDF should just read it — the skill says so in step 0). It earns its keep on batch conversion, corpus building, long documents, and text-only local models.
Claude Code — clone (or submodule) into a skills directory:
git clone https://github.com/danmackinlay/pdf-ingest-skill ~/.claude/skills/pdf-ingest
Hermes Agent:
git clone https://github.com/danmackinlay/pdf-ingest-skill ~/.hermes/skills/pdf-ingest
pi:
pi install git:github.com/danmackinlay/pdf-ingest-skill
The converters themselves install separately (each engine file states its one-liner; all are uv tool install / uvx style).
This repo is published via git subtree from a private monorepo where the skill is actively edited and tested. Issues and PRs are welcome here and will be subtree-pulled back.
MIT licensed.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub danmackinlay/pdf-ingest-skill --plugin pdf-ingestUltra-compressed communication mode. Cuts ~75% of tokens while keeping full technical accuracy by speaking like a caveman.
Frontend design skill for UI/UX implementation
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Memory compression system for Claude Code - persist context across sessions
Marketing skills for AI agents — conversion optimization, copywriting, SEO, paid ads, ad creative, and growth
Standalone image generation plugin using Nano Banana MCP server. Generates and edits images, icons, diagrams, patterns, and visual assets via Gemini image models. No Gemini CLI dependency required.