Skill Detail

Scrapy Spider Generator

Generates production-ready Scrapy spiders with middleware configuration and item pipeline setup. Uses the Scrapy Framework API, Selector (XPath/CSS), and Twisted reactor for concurrent crawling.

Research & ScrapingChatGPT Agents
Research & Scraping ChatGPT Agents Security Reviewed
Tool match: scrapy โญ 61.3k GitHub stars BSD-3-Clause license
INSTALL WITH ANY AGENT
npx skills add agentskillexchange/skills --skill scrapy-spider-generator Copy
Works best when you want a reusable capability, not another fragile one-off prompt.
At a glance
Author
scrapy
Last updated
Mar 24, 2026
Quick brief

The Scrapy Spider Generator creates production-grade web scraping spiders using the Scrapy framework. It generates Spider classes with properly configured start_urls, parse methods, and Item definitions with Field declarations and ItemLoader processors.

How it works

What this skill actually does

The agent builds comprehensive Scrapy projects with settings.py configuration for CONCURRENT_REQUESTS, DOWNLOAD_DELAY, and AUTOTHROTTLE settings. It generates custom Downloader Middleware for request fingerprinting, proxy rotation via scrapy-rotating-proxies, and user-agent randomization using scrapy-fake-useragent.

Key features include CrawlSpider generation with Rule and LinkExtractor definitions for automated link following, SitemapSpider configuration for XML sitemap-based crawling, and Feed Export setup for JSON Lines, CSV, and direct database output via scrapy-djangoitem. The agent also configures Item Pipelines for data validation, deduplication using scrapy-deltafetch, and export to Elasticsearch, MongoDB, or PostgreSQL. Supports Splash integration for JavaScript-rendered content via scrapy-splash middleware.