Skill Detail

ScrapyCloud Job Manager

Manages Scrapy spider deployments and job scheduling on ScrapyCloud via the Scrapinghub API. Handles spider argument injection, job prioritization, and item export to S3 or BigQuery.

Research & ScrapingChatGPT Agents
Research & Scraping ChatGPT Agents Security Reviewed
Tool match: scrapy
INSTALL WITH ANY AGENT
npx skills add agentskillexchange/skills --skill scrapycloud-job-manager Copy
Works best when you want a reusable capability, not another fragile one-off prompt.
At a glance
Author
Zyte
Last updated
Mar 24, 2026
Quick brief

The ScrapyCloud Job Manager skill automates the deployment and scheduling of Scrapy spiders on the ScrapyCloud (Zyte) platform. It uses the Scrapinghub API client (python-scrapinghub) for programmatic spider management, job scheduling, and data retrieval.

How it works

What this skill actually does

Core functionality includes spider deployment via shub CLI integration, job scheduling with cron-like periodic execution, and argument injection for parameterized spider runs. The skill manages job priorities, concurrency slots, and resource allocation across project units.

Data pipeline features include automatic item export to external storage (Amazon S3, Google BigQuery, Azure Blob Storage) via the Collections API and feed exports. The skill monitors job states, handles log retrieval for debugging, and implements automatic retry logic for failed jobs with configurable backoff strategies.

Advanced capabilities include AutoThrottle configuration for polite crawling, Crawlera (Zyte Smart Proxy) integration for anti-ban measures, and spider versioning with rollback support through the Dash deployment system.