Skill Detail

ScrapyCloud Job Manager

Manages Scrapy spider deployments and job scheduling on ScrapyCloud via the Scrapinghub API. Handles spider argument injection, job prioritization, and item export to S3 or BigQuery.

Research & ScrapingChatGPT Agents

Research & Scraping ChatGPT Agents Security Reviewed

Tool match: scrapy

INSTALL WITH ANY AGENT

npx skills add agentskillexchange/skills --skill scrapycloud-job-manager Copy

Works best when you want a reusable capability, not another fragile one-off prompt.

View source Documentation

At a glance

Author

Zyte

Last updated

Mar 24, 2026

Quick brief

The ScrapyCloud Job Manager skill automates the deployment and scheduling of Scrapy spiders on the ScrapyCloud (Zyte) platform. It uses the Scrapinghub API client (python-scrapinghub) for programmatic spider management, job scheduling, and data retrieval.

How it works

What this skill actually does

Core functionality includes spider deployment via shub CLI integration, job scheduling with cron-like periodic execution, and argument injection for parameterized spider runs. The skill manages job priorities, concurrency slots, and resource allocation across project units.

Data pipeline features include automatic item export to external storage (Amazon S3, Google BigQuery, Azure Blob Storage) via the Collections API and feed exports. The skill monitors job states, handles log retrieval for debugging, and implements automatic retry logic for failed jobs with configurable backoff strategies.

Advanced capabilities include AutoThrottle configuration for polite crawling, Crawlera (Zyte Smart Proxy) integration for anti-ban measures, and spider versioning with rollback support through the Dash deployment system.

Best fit

When to reach for it

Best when the job fits Research & Scraping.
Works naturally with ChatGPT Agents setups.

Trust & provenance

Why this listing is credible

Built around the scrapy toolchain.
Trust status: Security Reviewed.
Last updated Mar 24, 2026.

View source ↗ Documentation ↗