Skill Detail

Maxun No-Code Web Data Extraction Platform

Maxun is an open-source no-code web data platform for turning any website into structured, reliable data. It supports extraction via recorder mode and LLM-powered natural language mode, plus crawling, scraping, and search capabilities. With 15,000+ GitHub stars and both SDK and CLI interfaces, it handles everything from simple page scrapes to complex automated workflows.

Data Extraction & TransformationCustom Agents

Data Extraction & Transformation Custom Agents Security Reviewed

Tool match: maxun ⭐ 15.3k GitHub stars

INSTALL WITH ANY AGENT

npx skills add agentskillexchange/skills --skill maxun-no-code-web-data-extraction Copy

Works best when you want a reusable capability, not another fragile one-off prompt.

View source

At a glance

Last updated

Mar 26, 2026

Quick brief

Maxun is an open-source no-code platform for web scraping, crawling, search, and AI-powered data extraction. It lets users turn websites into structured APIs and datasets without writing code, using either a visual recorder that captures browser interactions or an LLM-powered extraction mode where you describe what you want in natural language.

How it works

What this skill actually does

How It Works

Maxun operates through four types of robots, each designed for a different data collection job. Extract robots emulate real user behavior to capture structured data from websites. In Recorder Mode, you browse a site while Maxun records your actions and turns them into a reusable extraction robot. In AI Mode, you describe the data you want in plain language and an LLM-powered engine handles the rest, identifying elements, navigating pagination, and structuring output. Scrape robots convert full webpages into clean Markdown or HTML and capture screenshots. Crawl robots traverse entire websites following links and extracting content from every relevant page with configurable scope controls.

SDK and CLI

Beyond the visual interface, Maxun provides a complete developer SDK (available as an npm package) for programmatic extraction, scheduling, and robot management. The CLI lets you create robots, trigger runs, and retrieve extracted data directly from the terminal, making it suitable for integration into agent workflows and automation pipelines. Agents can create extraction robots via the SDK, schedule recurring runs, and retrieve structured JSON results through the API.

Output and Integration

Maxun outputs structured JSON data, clean Markdown, HTML content, and screenshots depending on the robot type. It handles common web complexities including pagination, infinite scrolling, dynamic content loading, cookie consent dialogs, and anti-bot detection. The platform supports scheduling for recurring extraction jobs, webhook notifications for completed runs, and proxy configuration for geographic targeting. Data can be exported to Google Sheets, databases, or consumed via the REST API.

Deployment

Maxun runs locally via Docker Compose or without Docker, and also offers a hosted cloud version. The self-hosted option gives full control over data flow and privacy. Configuration is managed through environment variables for database connections, browser settings, LLM API keys, and proxy configuration.

Best fit

When to reach for it

Best when the job fits Data Extraction & Transformation.
Works naturally with Custom Agents setups.

Trust & provenance

Why this listing is credible

Built around the maxun toolchain.
Trust status: Security Reviewed.
15.3k GitHub stars on the linked upstream source.
Last updated Mar 26, 2026.

View source ↗