From pdf-to-markdown
Converts PDFs to structured Markdown preserving headings, tables, lists, reading order. Use for text extraction, batch processing, RAG ingestion, LLM context, or PDF analysis tasks.
How this skill is triggered — by the user, by Claude, or both
Slash command
/pdf-to-markdown:pdf-to-markdownThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Convert PDFs into structured, semantic Markdown that preserves the document's logical structure — headings, tables, lists, and reading order — rather than producing flat text. This is significantly higher quality than reading a PDF directly with the `read` tool, which only extracts raw text without structure.
Convert PDFs into structured, semantic Markdown that preserves the document's logical structure — headings, tables, lists, and reading order — rather than producing flat text. This is significantly higher quality than reading a PDF directly with the read tool, which only extracts raw text without structure.
Before running any commands, set SKILL_DIR to the absolute path of the directory containing this SKILL.md file. Use $SKILL_DIR/bin/pdf-to-markdown in all commands below.
The $SKILL_DIR/bin/pdf-to-markdown wrapper automatically installs the platform-specific binary into ~/.local/share/nutrient/cli/ from the CDN. It caches the binary and only checks for updates every 6 hours, so subsequent runs are fast.
$SKILL_DIR/bin/pdf-to-markdown INPUT.pdf OUTPUT.md
If OUTPUT.md is omitted, the converter writes the Markdown to stdout instead.
For multiple files, pass directories instead of individual files. The converter processes all PDFs in the input directory in parallel, which is much faster than converting one at a time.
$SKILL_DIR/bin/pdf-to-markdown INPUT_DIR/ OUTPUT_DIR/
$SKILL_DIR/bin/pdf-to-markdown INPUT [OUTPUT]Free for processing up to 1,000 documents per calendar month.
Commercial license required for:
Contact [email protected] for commercial licensing.
npx claudepluginhub pspdfkit-labs/nutrient-skills --plugin pdf-to-markdownConverts local PDF, DOCX, XLSX, PPTX, images via OCR, and audio files to clean Markdown using Microsoft's markitdown CLI. Best for text extraction from local documents.
Convert PDF files to LLM-ready markdown, DocTags, or JSON using Docling. Handles analysis, summarization, OCR, and batch processing with token savings estimates.
Parses local or remote PDF files into markdown or structured JSON using fastCRW. Supports CLI, MCP, and REST interfaces with options for AI summaries and structured extraction.