Skill Detail

Cheerio DOM Scraping Toolkit

Parses static HTML using Cheerio's jQuery-like API for fast server-side DOM traversal and data extraction. Generates extraction patterns with CSS selectors optimized for resilience to layout changes.

Research & ScrapingCursor

Research & Scraping Cursor Published

Tool match: cheerio ⭐ 30.3k GitHub stars ⬇ 19.6M/wk npm MIT license

INSTALL WITH ANY AGENT

npx skills add agentskillexchange/skills --skill cheerio-dom-scraping-toolkit Copy

Works best when you want a reusable capability, not another fragile one-off prompt.

View source

At a glance

Last updated

Mar 24, 2026

Quick brief

The Cheerio DOM Scraping Toolkit skill uses Cheerio, the fast and lightweight jQuery-like library for server-side HTML parsing and manipulation. Unlike browser-based scraping, Cheerio operates on raw HTML strings without a DOM environment, making it significantly faster for static page extraction tasks.

How it works

What this skill actually does

The skill generates extraction code using Cheerio’s CSS selector engine, supporting complex selectors including attribute selectors, pseudo-classes (:nth-child, :contains, :has), and combinators. It constructs resilient selectors that tolerate minor layout changes by preferring semantic attributes (data-*, aria-*, role) over positional selectors.

Key capabilities include table-to-JSON conversion with header detection, structured data extraction from Schema.org/JSON-LD embedded markup, and form field enumeration for understanding submission requirements. The skill handles character encoding detection and conversion, relative URL resolution using Cheerio’s built-in utilities, and generates streaming extraction pipelines using htmlparser2 for memory-efficient processing of large HTML documents. Output includes typed TypeScript interfaces matching the extracted data structure.

Best fit

When to reach for it

Best when the job fits Research & Scraping.
Works naturally with Cursor setups.

Trust & provenance

Why this listing is credible

Built around the cheerio toolchain.
Trust status: Published.
30.3k GitHub stars on the linked upstream source.
19.6M/week npm downloads recorded.
License: MIT.
Last updated Mar 24, 2026.

View source ↗