Legal Ops & Compliance - Agent Skill Marketplace

Industry collection

⚖️ Legal Ops & Compliance

Contract risk review, redline preparation, forms, document review, archive search, and evidence-oriented legal and compliance support.

Back to hub Browse Skills Previous: Ecommerce & Retail Operations Next: Healthcare Documentation & Intake

Who this is for

Legal operations, compliance, contract, and records teams that need document intake, review, redline preparation, and archive workflows.
Teams preparing evidence packets where provenance, human legal approval, and review boundaries matter.

Jobs covered

Review contract risk and prepare lawyer-ready redline suggestions for human legal approval.
Convert scanned PDFs and office files into searchable text.
Extract clauses, tables, attachments, and metadata from mixed records.
Run cited research and matter knowledge retrieval with source boundaries.
Build diligence review tables and route higher-risk agent actions through approval gates.
Redact sensitive data before sharing or indexing.
Search large archives before manual review.

Workflow Stacks

Contract risk and redline support: Extract contract text → flag clauses and risk areas → prepare redline suggestions → route to human legal review → record approval evidence
Document review packet: OCR → extract text and tables → redact PII → search archive → export review notes
Signing and forms: Prepare PDF forms → route signature → store final packet → index metadata
Research and diligence support: Search cited sources → ingest matter documents → extract review-table fields → gate external actions → preserve decision evidence

Curated Skills (30)

Review contract risk and redlines with Claude Legal Skill

Supports contract risk review and lawyer-ready redline preparation while keeping final interpretation, negotiation, and legal advice with qualified human counsel.

Legal ops / contract review leadMedium install355 stars

Documenso Open Source Document Signing Platform

Adds an auditable signing path for contract and approval packets.

Legal ops / contract adminHigh install12.6k stars

DocuSeal Open Source Document Signing and PDF Form Platform

Combines PDF form preparation and signatures for document-heavy approval flows.

Legal ops / forms administratorMedium install11.7k stars

OCRmyPDF Searchable PDF OCR Pipeline

Turns scanned evidence and records into searchable PDFs before review.

Records manager / compliance analystMedium install33.2k stars

Apache Tika Document Extractor

Provides broad-format document extraction when matter files include Office docs, PDFs, and attachments.

eDiscovery engineer / records opsHigh install3.7k stars

Extract structured text, metadata, tables, and images from mixed documents through an MCP server with Kreuzberg

Adds an MCP-accessible extraction layer for PDFs, Office files, images, HTML, and other mixed matter inputs before review or indexing.

Matter knowledge engineer / eDiscovery opsHigh install7.6k stars

pdfplumber Python PDF Text and Table Extraction Library

Pulls tables, text, and layout clues from contract exhibits and regulatory PDFs.

Legal analyst / data wranglerMedium install10.1k stars

Parse local PDFs into agent-ready text, JSON, and screenshots with LiteParse

Creates text, spatial JSON, and screenshots so reviewers can inspect what an agent saw.

Document review lead / AI opsMedium install5.1k stars

Search PDFs, Office files, ebooks, and archives with one query before manual review

Finds relevant records across mixed archives before humans spend time opening files one by one.

Investigator / records analystLow install9.6k stars

Paperless-ngx Document OCR and Archive Management System

Provides a durable archive system for scanned paperwork, tags, correspondents, and retrieval.

Compliance ops / records managerHigh install38.1k stars

LangExtract LLM-Powered Structured Text Extraction

Extracts named entities, obligations, dates, and clauses into auditable structured outputs.

Legal analyst / compliance reviewerMedium install35k stars

Turn messy document collections into structured rows with DocETL

Turns large contract, diligence, or evidence sets into repeatable structured rows with failure review across the corpus.

Diligence lead / legal data analystHigh install3.7k stars

Redact PII from text before sharing or indexing with scrubadub

Redacts sensitive identifiers before content enters search, summarization, or external review.

Privacy analyst / compliance opsLow install421 stars

Search large PDFs and read only the relevant pages before answering

Limits review to relevant pages of long PDFs instead of pushing full documents through an agent.

Legal researcher / review analystMedium install17 stars

Run local deep research workflows with Local Deep Research

Runs private cited research across web, academic, and local document sources while preserving source links and a controlled knowledge base.

Legal researcher / knowledge managerHigh install7.9k stars

Process, redact, OCR, and sign documents with Nutrient Agent Skill

Bundles OCR, redaction, form filling, conversion, and signing for governed document operations.

Document automation leadHigh install5 stars

Convert dense PDFs into LLM-ready text and page-aligned markdown with olmOCR

Converts dense scanned or layout-heavy PDFs into page-aligned text for cited review.

eDiscovery analyst / knowledge engineerHigh install17.1k stars

Turn documents into validated knowledge graphs with Docling Graph

Extracts schema-checked entities and relationships when matters need structured fact maps.

Knowledge engineer / compliance analystHigh install134 stars

Use RAGFlow as a retrieval and context layer for agent workflows

Provides a supervised RAG layer for matter document knowledge bases with traceable source support before agent answers are reviewed.

Matter knowledge manager / legal AI opsHigh install79.8k stars

Extract structured markdown, JSON, and tagged-PDF-ready outputs from PDFs with OpenDataLoader PDF

Produces markdown, coordinate-aware JSON, and accessibility-oriented outputs from PDF packets.

Document processing engineerHigh install19.1k stars

Enrich Paperless-ngx documents with AI-generated titles tags and correspondents using paperless-gpt

Improves archive metadata after ingestion so humans can search and route records faster.

Records manager / knowledge opsHigh install2.3k stars

Capture a live webpage as a clean PDF or readable archive for offline review with Percollate

Preserves web evidence as readable offline artifacts for citation and handoff.

Investigator / compliance analystLow install4.6k stars

Extract structured data and attachments from raw email with MailParser

Normalizes raw email evidence and attachments before archive search or review.

Legal ops / mailbox reviewerMedium install1.7k stars

Strip quoted email history and signatures before summarizing inbound replies

Separates the newest human reply from long threads so summaries do not duplicate history.

Case manager / legal assistantLow install78 stars

Load .mbox mail archives into SQLite for offline search, audits, and dataset joins

Turns mailbox archives into queryable SQLite evidence stores for offline audit work.

Investigator / data analystMedium install39 stars

MarkItDown Document-to-Markdown Converter by Microsoft

Converts Office files, PDFs, email-like documents, and other matter inputs into Markdown for review packets and audit summaries.

Legal ops analyst / compliance reviewerMedium install93.2k stars

MinerU PDF-to-Markdown Document Parser

Handles complex PDFs with layout-aware Markdown and JSON output for contract packets, exhibits, and long-form compliance evidence.

eDiscovery engineer / records analystHigh install57.8k stars

Put approval gates and audit-ready policy checks between agents and external actions with DashClaw

Adds approval gates and replayable decision evidence when legal AI workflows need human review before external actions.

Legal AI governance lead / compliance opsHigh install241 stars

Extract OCR-ready Markdown from documents with Zerox

Turns scanned contracts, exhibits, and evidence PDFs into reviewable Markdown before legal search or redaction workflows.

Legal ops / document review leadMedium install12.2k stars

Prepare agent-ready PDF and document extraction with PyMuPDF

Gives legal ops a fast source-backed PDF extraction step before contract review, diligence packets, or clause analysis.

Legal ops analyst / contract reviewerMedium install10.1k stars

Skill	What it does here	Persona	Install	Stars
Review contract risk and redlines with Claude Legal Skill	Supports contract risk review and lawyer-ready redline preparation while keeping final interpretation, negotiation, and legal advice with qualified human counsel.	Legal ops / contract review lead	Medium	355
Documenso Open Source Document Signing Platform	Adds an auditable signing path for contract and approval packets.	Legal ops / contract admin	High	12.6k
DocuSeal Open Source Document Signing and PDF Form Platform	Combines PDF form preparation and signatures for document-heavy approval flows.	Legal ops / forms administrator	Medium	11.7k
OCRmyPDF Searchable PDF OCR Pipeline	Turns scanned evidence and records into searchable PDFs before review.	Records manager / compliance analyst	Medium	33.2k
Apache Tika Document Extractor	Provides broad-format document extraction when matter files include Office docs, PDFs, and attachments.	eDiscovery engineer / records ops	High	3.7k
Extract structured text, metadata, tables, and images from mixed documents through an MCP server with Kreuzberg	Adds an MCP-accessible extraction layer for PDFs, Office files, images, HTML, and other mixed matter inputs before review or indexing.	Matter knowledge engineer / eDiscovery ops	High	7.6k
pdfplumber Python PDF Text and Table Extraction Library	Pulls tables, text, and layout clues from contract exhibits and regulatory PDFs.	Legal analyst / data wrangler	Medium	10.1k
Parse local PDFs into agent-ready text, JSON, and screenshots with LiteParse	Creates text, spatial JSON, and screenshots so reviewers can inspect what an agent saw.	Document review lead / AI ops	Medium	5.1k
Search PDFs, Office files, ebooks, and archives with one query before manual review	Finds relevant records across mixed archives before humans spend time opening files one by one.	Investigator / records analyst	Low	9.6k
Paperless-ngx Document OCR and Archive Management System	Provides a durable archive system for scanned paperwork, tags, correspondents, and retrieval.	Compliance ops / records manager	High	38.1k
LangExtract LLM-Powered Structured Text Extraction	Extracts named entities, obligations, dates, and clauses into auditable structured outputs.	Legal analyst / compliance reviewer	Medium	35k
Turn messy document collections into structured rows with DocETL	Turns large contract, diligence, or evidence sets into repeatable structured rows with failure review across the corpus.	Diligence lead / legal data analyst	High	3.7k
Redact PII from text before sharing or indexing with scrubadub	Redacts sensitive identifiers before content enters search, summarization, or external review.	Privacy analyst / compliance ops	Low	421
Search large PDFs and read only the relevant pages before answering	Limits review to relevant pages of long PDFs instead of pushing full documents through an agent.	Legal researcher / review analyst	Medium	17
Run local deep research workflows with Local Deep Research	Runs private cited research across web, academic, and local document sources while preserving source links and a controlled knowledge base.	Legal researcher / knowledge manager	High	7.9k
Process, redact, OCR, and sign documents with Nutrient Agent Skill	Bundles OCR, redaction, form filling, conversion, and signing for governed document operations.	Document automation lead	High	5
Convert dense PDFs into LLM-ready text and page-aligned markdown with olmOCR	Converts dense scanned or layout-heavy PDFs into page-aligned text for cited review.	eDiscovery analyst / knowledge engineer	High	17.1k
Turn documents into validated knowledge graphs with Docling Graph	Extracts schema-checked entities and relationships when matters need structured fact maps.	Knowledge engineer / compliance analyst	High	134
Use RAGFlow as a retrieval and context layer for agent workflows	Provides a supervised RAG layer for matter document knowledge bases with traceable source support before agent answers are reviewed.	Matter knowledge manager / legal AI ops	High	79.8k
Extract structured markdown, JSON, and tagged-PDF-ready outputs from PDFs with OpenDataLoader PDF	Produces markdown, coordinate-aware JSON, and accessibility-oriented outputs from PDF packets.	Document processing engineer	High	19.1k
Enrich Paperless-ngx documents with AI-generated titles tags and correspondents using paperless-gpt	Improves archive metadata after ingestion so humans can search and route records faster.	Records manager / knowledge ops	High	2.3k
Capture a live webpage as a clean PDF or readable archive for offline review with Percollate	Preserves web evidence as readable offline artifacts for citation and handoff.	Investigator / compliance analyst	Low	4.6k
Extract structured data and attachments from raw email with MailParser	Normalizes raw email evidence and attachments before archive search or review.	Legal ops / mailbox reviewer	Medium	1.7k
Strip quoted email history and signatures before summarizing inbound replies	Separates the newest human reply from long threads so summaries do not duplicate history.	Case manager / legal assistant	Low	78
Load .mbox mail archives into SQLite for offline search, audits, and dataset joins	Turns mailbox archives into queryable SQLite evidence stores for offline audit work.	Investigator / data analyst	Medium	39
MarkItDown Document-to-Markdown Converter by Microsoft	Converts Office files, PDFs, email-like documents, and other matter inputs into Markdown for review packets and audit summaries.	Legal ops analyst / compliance reviewer	Medium	93.2k
MinerU PDF-to-Markdown Document Parser	Handles complex PDFs with layout-aware Markdown and JSON output for contract packets, exhibits, and long-form compliance evidence.	eDiscovery engineer / records analyst	High	57.8k
Put approval gates and audit-ready policy checks between agents and external actions with DashClaw	Adds approval gates and replayable decision evidence when legal AI workflows need human review before external actions.	Legal AI governance lead / compliance ops	High	241
Extract OCR-ready Markdown from documents with Zerox	Turns scanned contracts, exhibits, and evidence PDFs into reviewable Markdown before legal search or redaction workflows.	Legal ops / document review lead	Medium	12.2k
Prepare agent-ready PDF and document extraction with PyMuPDF	Gives legal ops a fast source-backed PDF extraction step before contract review, diligence packets, or clause analysis.	Legal ops analyst / contract reviewer	Medium	10.1k

Editorial Notes

The collection avoids legal-advice framing; these are contract review, redline preparation, intake, evidence, and operations tools for human legal approval.
Document-centric entries are favored over general security scanners unless they support compliance evidence work directly.
Research and RAG picks are framed as source-grounded support for legal operations and human review, not automated legal advice.
Do not let infra-policy scanners take over this collection. Keep v1 document-centric.

Adjacent Collections

Finance & Filings Healthcare Documentation & Intake Real Estate Workflows

Editorial Context

Legal technology has a long history of overpromising. Tools land with claims about AI-powered contract review, autonomous compliance monitoring, or predictive litigation outcomes—then quietly reduce scope once lawyers actually use them. The result: a deep skepticism in legal and compliance departments toward anything labeled “AI.”

Agent skills for legal ops take a different approach. The best ones do not promise to practice law. They handle documents: extracting text from PDFs, routing signature requests, searching archived filings, structuring extracted fields, and packaging evidence for human review. That is genuinely useful work—and it is work legal teams spend hours on every week.

This post maps the legal ops and compliance skill category on Agent Skill Exchange, explains what each type of skill does, and lays out the limits that make these skills trustworthy for regulated work.

The Core Thesis: Legal Agents Are Evidence Infrastructure

Before walking through specific skill types, it helps to have a clear mental model. Legal and compliance workflows are not primarily reasoning workflows—they are document workflows. The bottleneck is rarely “someone needs to think harder.” It is almost always “someone needs to find the right document, extract the right clause, confirm a signature is in place, or gather enough evidence to brief the decision-maker.”

That is exactly the gap agent skills fill well. An agent that surfaces the right document, extracts the right fields, and packages them with source links reduces the time a paralegal or compliance officer spends on mechanical search—without replacing their judgment about what those documents mean.

The clearest sign a legal skill is well-designed: its output is a structured artifact with provenance (source file, page number, extraction method), not a confident plain-English conclusion handed down without references.

Skill Category: OCR and PDF Text Extraction

A significant fraction of legal documents still arrive as scanned PDFs: older contracts, court filings, notarized agreements, legacy regulatory submissions. Before any structured extraction can happen, the text has to be readable.

OCR skills handle this step using tools like Tesseract (open-source, works offline, suitable for high-volume batch jobs) or cloud OCR APIs where accuracy on degraded scans matters more. The skill’s job is to take a scanned image or PDF and return clean, searchable text the agent can work with downstream.

What a good OCR skill includes in its gotchas section:

Column layout detection (court filings often have two-column formats that confuse naive OCR)
Confidence thresholds—output that falls below a minimum confidence score should be flagged for human review, not silently passed downstream
Page rotation handling (scanned faxes are frequently sideways or upside-down)
Language detection when a document corpus spans multiple jurisdictions

OCR is infrastructure. It has no opinion about what the text means. That makes it one of the safer skills in any regulated workflow—it fails visibly when it cannot read something, rather than hallucinating content.

Skill Category: Structured Field Extraction

Once text is readable, the next common task is extraction: pulling specific fields out of a document and populating a structured record. For contracts, this might be effective date, governing law clause, termination notice period, and party names. For invoices, it might be vendor, amount, line items, and due date. For regulatory filings, it might be entity identifiers, reporting period, and attestation signatures.

Extraction skills typically combine a parsing library (like pdfplumber or pypdf for native PDFs) with prompt-guided field identification for the model. The output is a JSON or structured table, not a narrative summary.

The important constraint: extraction results should always include the source text excerpt and page reference alongside the extracted value. A downstream reviewer—whether human or another agent—needs to confirm the extraction, not just trust it. A skill that returns {"governing_law": "Delaware"} without showing where in the document it found that value is harder to audit than one that returns {"governing_law": "Delaware", "source": "§14.3, p. 22, 'This Agreement shall be governed by the laws of the State of Delaware'"}.

Browse legal ops skills on ASE for extraction tools built around common contract types and compliance document formats.

Skill Category: Archive Search and Retrieval

Legal and compliance teams often maintain large archives: past contracts by counterparty, prior regulatory correspondence, internal policy versions, historical filings. Finding the right document quickly—without relying on someone’s memory of where they saved it—is a consistent pain point.

Archive search skills index document collections and expose a retrieval interface the agent can call: “Find all contracts with Company X that include a right-of-first-refusal clause” or “Show me every version of the data processing addendum we’ve executed since 2023.”

These skills work best when:

Documents have been pre-ingested with consistent metadata (counterparty, document type, execution date, status)
Search results include a relevance score and source reference, not just the matched text
The skill exposes a citation format the agent can include in downstream reports

Archive retrieval is one area where the investment in document hygiene pays off quickly. A skill can only find what has been indexed—and what has been indexed accurately.

Skill Category: Document Signing and Routing

E-signature routing is a well-understood workflow, but wiring it into an agent’s operating context removes a significant amount of manual back-and-forth. Signing skills typically wrap an API like DocuSign, HelloSign, or PandaDoc to:

Prepare a document for signature (tag fields, assign signatories)
Dispatch the envelope and track completion status
Retrieve a completed, signed document and store it in the appropriate location
Surface unsigned documents that are past their expected turnaround time

The agent’s role here is orchestration, not authorization. The humans named as signatories still decide whether to sign. The skill removes the overhead of manually creating envelopes, chasing status, and filing completed documents—it does not make the signing decision for anyone.

One gotcha worth noting: signing skills should validate that the correct version of a document is being sent for signature before dispatching. Sending a draft for execution is a classic ops error that a pre-flight check in the skill’s logic can prevent.

Skill Category: Compliance Document Workflows

Compliance work often involves repetitive document cycles: annual policy acknowledgments, vendor security questionnaires, due diligence requests, audit evidence packages, regulatory attestations. The pattern is consistent—collect, compile, verify, route, store—but doing it manually for dozens of documents and stakeholders is tedious and error-prone.

Compliance workflow skills encode these cycles as repeatable agent tasks. A well-built skill for annual policy sign-off, for example, might:

Retrieve the current approved policy document
Check which employees are missing current-year acknowledgment based on HRIS data
Dispatch acknowledgment requests with a deadline
Track completion and escalate to managers for overdue recipients
Produce an evidence log with timestamps for audit purposes

The skill handles the mechanical cycle. The compliance officer still sets policy, approves document versions, and decides what to do about exceptions. The skill makes the gap between “policy exists” and “policy acknowledged by all staff, evidenced” much shorter to close.

What Legal Skills Do Not Do (And Should Not)

This is worth making explicit, because the legal-tech space has conditioned people to expect inflated claims and then be disappointed.

Legal skills on ASE do not:

Provide legal advice or legal opinions
Autonomously approve, execute, or bind anyone to a contract
Make compliance determinations (e.g., “this contract complies with GDPR”)
Replace attorney review for material agreements
Guarantee the accuracy of extracted fields without human verification for high-stakes decisions

This is not a limitation caused by poor skill design—it is the correct boundary. Agent skills are most trustworthy when they are explicit about what they hand off to humans. A skill that surfaces a potential missing indemnification clause and flags it for review is useful. A skill that concludes the contract is legally sound is overstepping what any extraction system should claim to do.

The practical test: if a skill’s output would be used directly to make a legal or regulatory decision without any human review, the workflow design is wrong—not the skill.

Building a Legal Ops Workflow With Agent Skills

Here is a realistic workflow that combines several skill types into a useful legal document intake process, without overpromising:

Step 1 — Intake: New documents arrive via email attachment, shared drive, or contract management system webhook. A document intake skill identifies file type, runs OCR if needed, and saves the result to a processing queue with metadata (source, received date, document type if identifiable from filename or header).

Step 2 — Extraction: An extraction skill reads the processed text and pulls standard fields into a structured record. For a vendor agreement: party names, effective date, term length, governing law, and key clause flags (limitation of liability, indemnification, data processing). Output includes source excerpts for every field.

Step 3 — Archive matching: An archive search skill checks whether the agent has processed prior agreements with the same counterparty. If so, it surfaces prior terms for comparison, flagging any material differences (a change in governing law or a shorter notice period than usual).

Step 4 — Review packet assembly: The agent compiles a review packet: extracted fields table, comparison with prior terms, flagged clauses, and links to the source document. This packet goes to the reviewer—attorney, paralegal, or contracts manager—who can confirm, correct, or escalate.

Step 5 — Signing (post-review): Once the reviewer approves, a signing skill routes the document for execution and tracks status through completion.

Step 6 — Archive: The completed, signed document is stored in the appropriate archive with metadata, so future archive searches can find it.

This workflow does not require autonomous decision-making at any point. It requires fast, accurate document handling—which is exactly what the skills are built for. The attorney’s judgment is applied at Step 4, where it matters, rather than being consumed by Steps 1 through 3.

Choosing Skills for Regulated Environments

When evaluating legal ops skills for production use, a few factors matter beyond basic functionality:

Data handling: Does the skill process documents locally, or does it make external API calls? For documents covered by attorney-client privilege, NDAs, or regulatory data handling obligations, the data path matters. Skills that support local model inference or self-hosted deployments give legal teams more control.

Audit trail: Does the skill log what it extracted, from where, and when? A skills workflow without an audit trail is harder to defend if extraction results are later questioned.

Failure modes: What does the skill do when it cannot confidently extract a field? The correct behavior is to return a null or flagged result, not a guess. Check the skill’s gotchas section for how it handles ambiguous or low-confidence extractions.

Version control: For compliance document cycles, skills should be versioned and skills should reference specific policy document versions—not just the latest available file—so audit evidence is reproducible.

The Legal Ops & Compliance collection on ASE surfaces skills that have passed the ASE editorial review for domain fit: they include domain-specific objects (contracts, filings, clauses, signatories), a realistic workflow scope, and honest documentation of what they do not handle.

The Right Frame: Operations Infrastructure, Not Legal Intelligence

The legal teams that get the most out of agent skills are the ones who stop asking “can the AI understand this contract?” and start asking “can the AI handle the document ops that surround this contract so my lawyers can focus on the contract itself?”

That reframe is the entire value proposition. OCR, extraction, archive search, signing routing, and evidence packaging are not glamorous—but they consume a disproportionate share of legal and compliance bandwidth. Skills that handle them reliably, with clear evidence and honest limits, are the ones that earn lasting trust in regulated environments.

The legal-tech hype cycle is long. Skills that are transparent about their scope—and that hold that line even when it is less impressive-sounding—are the ones worth building on.

Explore the Legal Ops & Compliance skill collection on ASE, or read about how ASE approaches agent skill taxonomy and writing effective gotchas sections for skills in regulated workflows.