Skill Detail

Documind AI-Powered Structured Data Extraction from Documents

Documind is an open-source Node.js tool that uses AI to extract structured JSON data from PDFs and other documents. Define a custom schema for what you need, and Documind returns clean, typed data — supporting OpenAI and local LLM backends like Llama 3.2 Vision.

Data Extraction & TransformationCustom Agents

Data Extraction & Transformation Custom Agents Security Reviewed

Tool match: documind ⭐ 1.5k GitHub stars ⬇ 14/wk npm

INSTALL WITH ANY AGENT

npx skills add agentskillexchange/skills --skill documind-ai-structured-data-extraction Copy

Works best when you want a reusable capability, not another fragile one-off prompt.

View source

At a glance

Last updated

Mar 25, 2026

Quick brief

Documind is an open-source document processing tool that leverages AI vision models to extract structured data from unstructured documents. Maintained at github.com/DocumindHQ/documind, it bridges the gap between raw document content and the clean, typed JSON that applications actually need.

How it works

What this skill actually does

The key innovation in Documind is schema-driven extraction. Instead of getting back a blob of text from a document, you define exactly what fields you want — account numbers, transaction dates, line items, totals — with their expected data types and descriptions. Documind then uses AI vision models to locate and extract those specific fields, returning structured JSON that matches your schema. This makes it practical for automating document workflows where you need specific data points, not just raw text.

Under the hood, Documind converts documents to images using Ghostscript and GraphicsMagick, then sends them to an AI vision model for analysis. It supports OpenAI’s GPT-4 Vision as well as local models through Llama 3.2 Vision integration, giving teams the option to keep sensitive documents on-premise. The tool also supports auto-generated schemas — point it at a document and it will infer what fields are available.

Beyond JSON extraction, Documind can convert documents to Markdown format, which is useful for feeding document content into LLM pipelines. It includes template schemas for common document types like invoices, bank statements, and receipts, so you can start extracting data without writing custom schemas.

A skill wrapping Documind gives an AI agent the ability to process incoming documents and extract exactly the data it needs. The agent could parse invoices for accounting workflows, extract form data for processing, or convert documents into structured formats for database ingestion. Install via npm install documind. The project requires Node.js 18+, Ghostscript, and GraphicsMagick as system dependencies. Documind is MIT-licensed with active development.

Best fit

When to reach for it

Best when the job fits Data Extraction & Transformation.
Works naturally with Custom Agents setups.

Trust & provenance

Why this listing is credible

Built around the documind toolchain.
Trust status: Security Reviewed.
1.5k GitHub stars on the linked upstream source.
14/week npm downloads recorded.
Last updated Mar 25, 2026.

View source ↗