From mineru
Parses PDF, Office, and image files into structured Markdown using the MinerU API. Supports OCR, formula/table recognition, batch processing, and multi-format export (DOCX/HTML/LaTeX).
How this skill is triggered — by the user, by Claude, or both
Slash command
/mineru:mineruThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Parse PDF, Office, and image documents into structured Markdown via the MinerU API.
references/api_reference.mdreferences/comparison.mdreferences/integrations.mdscripts/__init__.pyscripts/chunking.pyscripts/local_engine.pyscripts/mineru.pyscripts/mineru_mcp.pyscripts/sinks/__init__.pyscripts/sinks/_http.pyscripts/sinks/_md.pyscripts/sinks/airtable.pyscripts/sinks/base.pyscripts/sinks/coda.pyscripts/sinks/confluence.pyscripts/sinks/dingtalk.pyscripts/sinks/feishu.pyscripts/sinks/linear.pyscripts/sinks/local.pyscripts/sinks/notion.pyParse PDF, Office, and image documents into structured Markdown via the MinerU API.
# Zero-config: no token, no install (free Agent API)
python3 "${CLAUDE_PLUGIN_ROOT}/scripts/mineru.py" ./document.pdf --output ./output/
# Pipe Markdown back to an agent
python3 "${CLAUDE_PLUGIN_ROOT}/scripts/mineru.py" ./document.pdf --stdout
# Power mode: token unlocks large files / batch / extra formats
export MINERU_TOKEN="..." # https://mineru.net/apiManage/token
python3 "${CLAUDE_PLUGIN_ROOT}/scripts/mineru.py" ./pdfs/ --output ./output/ --workers 8 --resume
--ocr with language selection (--lang)--stdout (Markdown) and --json (machine status)--resumeA token is optional — the Agent API works without one. Set a token to unlock the Standard API (≤ 200 MB / ≤ 200 pages, batch, DOCX/HTML/LaTeX):
export MINERU_TOKEN="your-token-here" # https://mineru.net/apiManage/token
Official API docs: https://mineru.net/apiManage/docs
npx claudepluginhub nebutra/mineru-skillConverts local documents, remote document URLs, or web pages to Markdown using MinerU. Supports OCR, table/formula recognition, and works with PDF, DOCX, PPTX, images. Auto-selects free lightweight API or standard Token API based on configuration. Useful for PDF-to-Markdown, image-to-text, and document conversion tasks.
Parses local files (PDF, DOCX, XLSX, HTML, etc.) into clean markdown on disk. Offers AI summaries and Q&A over document content.
This skill should be used when the user says "process documents", "extract text from PDF", "OCR this document", "convert PDF to markdown", "extract emails from documents", "parse document", "document conversion", "batch OCR", "extract structured data from PDF", "read PDF", "extract tables from PDF", "convert Word document", "convert docx to markdown", or wants to extract, convert, or process documents and scanned images.