Skill Detail

Extract OCR-ready Markdown from documents with Zerox

Use Zerox to convert PDFs, images, and office documents into Markdown or structured extraction outputs using vision models.

Data Extraction & TransformationMulti-Framework
Data Extraction & Transformation Multi-Framework Security Reviewed
⭐ 12.2k GitHub stars ⬇ 5.3k/wk npm
INSTALL WITH ANY AGENT
npx skills add agentskillexchange/skills --skill extract-ocr-ready-markdown-from-documents-with-zerox Copy
Works best when you want a reusable capability, not another fragile one-off prompt.
At a glance
Tools required
Node.js or Python Zerox package, graphicsmagick/ghostscript for Node PDF conversion or poppler for Python, model provider credentials
Install & setup
Install the Node package with npm install zerox or the Python package from PyPI, install required PDF conversion dependencies, configure model credentials, then call Zerox with a file path or URL and review the generated Markdown or structured output.
Author
Omni AI
Publisher
Open Source Project
Last updated
Jun 2, 2026
Quick brief

Use Zerox when an agent workflow needs document OCR that preserves enough layout context for AI ingestion. The operator provides a PDF, DOCX, image, or remote file URL, configures model credentials, selects page and concurrency options, and receives Markdown or structured extraction output that can be reviewed before downstream automation. Operators can run it on local files, choose provider-specific models, extract only selected pages, preserve table-heavy formatting, save outputs for review, and pass normalized Markdown to retrieval or data-entry workflows. This is not a generic OCR library listing: the skill boundary is the document-to-agent-input workflow, including file conversion, vision-model extraction, optional schema extraction, and handoff to review or retrieval steps. Invoke it instead of using a hosted OCR product manually when the workflow must run from code, choose model providers, preserve table-heavy formatting, or feed normalized content into an agent pipeline. Keep the scope limited to extraction and review, not full document management.