Skill Detail

Extract OCR-ready Markdown from documents with Zerox

Use Zerox to convert PDFs, images, and office documents into Markdown or structured extraction outputs using vision models.

Data Extraction & TransformationMulti-Framework

Data Extraction & Transformation Multi-Framework Security Reviewed

⭐ 12.2k GitHub stars ⬇ 5.3k/wk npm

INSTALL WITH ANY AGENT

npx skills add agentskillexchange/skills --skill extract-ocr-ready-markdown-from-documents-with-zerox Copy

Works best when you want a reusable capability, not another fragile one-off prompt.

View source Documentation

At a glance

Tools required

Node.js or Python Zerox package, graphicsmagick/ghostscript for Node PDF conversion or poppler for Python, model provider credentials

Install & setup

Install the Node package with npm install zerox or the Python package from PyPI, install required PDF conversion dependencies, configure model credentials, then call Zerox with a file path or URL and review the generated Markdown or structured output.

Author

Omni AI

Publisher

Open Source Project

Last updated

Jun 2, 2026

Quick brief

Use Zerox when an agent workflow needs document OCR that preserves enough layout context for AI ingestion. The operator provides a PDF, DOCX, image, or remote file URL, configures model credentials, selects page and concurrency options, and receives Markdown or structured extraction output that can be reviewed before downstream automation. Operators can run it on local files, choose provider-specific models, extract only selected pages, preserve table-heavy formatting, save outputs for review, and pass normalized Markdown to retrieval or data-entry workflows. This is not a generic OCR library listing: the skill boundary is the document-to-agent-input workflow, including file conversion, vision-model extraction, optional schema extraction, and handoff to review or retrieval steps. Invoke it instead of using a hosted OCR product manually when the workflow must run from code, choose model providers, preserve table-heavy formatting, or feed normalized content into an agent pipeline. Keep the scope limited to extraction and review, not full document management.

Best fit

When to reach for it

Best when the job fits Data Extraction & Transformation.
Works naturally with Multi-Framework setups.
Requires Node.js or Python Zerox package, graphicsmagick/ghostscript for Node PDF conversion….
Installation is straightforward: Install the Node package with npm install zerox or the Python package from PyPI, install required…

Trust & provenance

Why this listing is credible

Trust status: Security Reviewed.
12.2k GitHub stars on the linked upstream source.
5.3k/week npm downloads recorded.
Last updated Jun 2, 2026.

View source ↗ Documentation ↗