Skill Detail

Normalize article metadata from URLs before generating link previews or content briefs

Uses metascraper to turn a URL plus its fetched HTML into normalized fields such as title, description, author, date, publisher, and lead image. This is useful when an agent needs reliable preview or briefing data from a page without building a custom parser for every site.

Content Writing & SEOMulti-Framework
Content Writing & SEO Multi-Framework Published
โญ 2.7k GitHub stars โฌ‡ 416k/wk npm
INSTALL WITH ANY AGENT
npx skills add agentskillexchange/skills --skill normalize-article-metadata-from-urls-before-generating-link-previews-or-content-briefs Copy
Works best when you want a reusable capability, not another fragile one-off prompt.
At a glance
Tools required
Node.js and an HTML retrieval step such as fetch, Playwright, or browserless
Install & setup
npm install metascraper
Author
microlinkhq
Publisher
Organization
Last updated
Apr 12, 2026
Quick brief

This ASE entry is built around metascraper, the open source project from microlinkhq/metascraper that extracts unified metadata from web pages using Open Graph, Twitter Cards, JSON-LD, Microdata, RDFa, and ordinary HTML fallbacks. In an agent workflow, the job is not โ€œdo SEOโ€ in the abstract. The real job is much narrower: take a target URL, fetch the page markup, and return a consistent metadata object that downstream automations can trust for previews, research briefs, content queues, or lightweight editorial intake. That usually means normalized fields like title, description, author, date, publisher, image, logo, and URL, even when the page exposes those signals in slightly different formats.

How it works

What this skill actually does

Use this when an agent is preparing link cards for chat, assembling newsletter candidates, enriching bookmarks, building a reading list, or generating a first-pass content brief from multiple sources. It is especially useful when the page HTML is already available from a browser or fetch step and the next system needs structured metadata instead of raw markup. An agent can pair metascraper with Playwright, browserless, or a simple HTTP fetch, then pass the normalized result into a CMS, a queue, a spreadsheet, a Slack or Discord formatter, or a summarization step.

The scope boundary matters. metascraper is not a crawler, a CMS, an analytics platform, or a full SEO suite. It does not publish content, rewrite copy, or audit an entire site. Its role is focused extraction and normalization for one page at a time, which is exactly why it fits ASE as a skill-shaped, operator-friendly job-to-be-done instead of a generic product card.