Skill Detail

Search PDFs, Office files, ebooks, and archives with one query before manual review

Uses ripgrep-all to run one full-text search across mixed document and archive formats so an agent can find evidence without separately extracting every file type first. Best when a workflow has PDFs, Office documents, ebooks, media sidecars, or compressed bundles that need fast on-demand search.

Research & ScrapingMulti-Framework

Research & Scraping Multi-Framework Security Reviewed

⭐ 9.6k GitHub stars

INSTALL WITH ANY AGENT

npx skills add agentskillexchange/skills --skill search-pdfs-office-files-ebooks-and-archives-with-one-query-before-manual-review

Copy

Works best when you want a reusable capability, not another fragile one-off prompt.

View source

At a glance

Tools required

ripgrep; optional adapters and dependencies such as pandoc, poppler, and ffmpeg.

Install & setup

Install via Homebrew (`brew install rga`) or Cargo (`cargo install –locked ripgrep_all`).

Author

phiresky

Publisher

User

Last updated

Apr 12, 2026

Quick brief

This skill wraps `ripgrep-all` (`rga`) as a bounded retrieval workflow for heterogeneous document collections. An agent should invoke it when the working set includes PDFs, Office files, ebooks, archives, or other non-plain-text assets and the immediate goal is to locate evidence, names, phrases, IDs, or other keywords before doing deeper review. That makes it useful for investigations, due diligence, compliance sweeps, migration audits, support triage, and any research task where the fastest next step is finding which files are relevant.

How it works

What this skill actually does

The scope boundary keeps this from collapsing into a generic search product listing. This skill is not a hosted index, document-management system, or general desktop search replacement. The agent is using the upstream CLI to run targeted searches across a local corpus on demand, relying on `ripgrep-all` adapters to extract searchable text from supported formats. If the task needs long-lived indexing, semantic retrieval, or a full knowledge base, the agent should use a different tool.

Integration points are practical and agent-friendly. Point the workflow at a directory of attachments, exports, or archive dumps, run `rga` with the search term, then pass matching paths and snippets into summarization, escalation, tagging, or evidence-pack generation. Upstream documentation shows installation through package managers such as Homebrew (`brew install rga`) or Cargo (`cargo install –locked ripgrep_all`). The README also documents adapter dependencies like `ripgrep`, `pandoc`, `poppler`, and `ffmpeg`, which expand the file types the agent can search in one pass.

Best fit

When to reach for it

Best when the job fits Research & Scraping.
Works naturally with Multi-Framework setups.
Requires ripgrep; optional adapters and dependencies such as pandoc, poppler, and….
Installation is straightforward: Install via Homebrew (`brew install rga`) or Cargo (`cargo install –locked ripgrep_all`).

Trust & provenance

Why this listing is credible

Trust status: Security Reviewed.
9.6k GitHub stars on the linked upstream source.
Last updated Apr 12, 2026.

View source ↗