By zytedata
Build and deploy Scrapy spiders with web-poet page objects through an end-to-end workflow: explore websites, define extraction schemas from HTML and JSON-LD, generate page objects and wiring code, validate with previews, and deploy to Scrapy Cloud.
Add an empty web-poet page object to a Scrapy project
Extract all available fields with values from a detail page
Analyze an HTML page to produce field extraction instructions for code generation
Generate web-poet page object code from per-page extraction analyses
Generate web-poet page object code from an extraction spec
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
From a plain-English prompt to a working Scrapy spider.
claude plugin marketplace add zytedata/claude-skills
claude plugin install zyte-web-data@zyte-ai
If Claude Code is already running, reload plugins in the active session:
/reload-plugins
See also: Discovering and installing plugins
This is Zyte's official Claude Code plugin that generates production-ready Scrapy spiders with web-poet page objects from a plain-English prompt. Give it a URL and describe what you want to extract. It handles site exploration, schema discovery, code generation, and smoke testing: no boilerplate, no manual selector hunting.
The plugin explores the target site, discovers available fields, and presents a schema for your approval before generating a single line of code. After you confirm the schema, it creates a Scrapy project with all dependencies configured, generates web-poet page objects and test fixtures, wires up the spider, and runs a smoke test to verify that extraction is working before handing the project back to you.
Optionally, use /scrape-scrapy-cloud to deploy directly to Scrapy Cloud for scheduled runs, job history, and monitoring. A free tier is available.
The /scrape skill works on any website with repeating structured content: detail pages linked from a listing or category page. Examples from the skill:
The /scrape skill orchestrates five stages automatically:
1. Decide which fields to extract → /scrape-define
2. Analyze the website → /scrape-spec
3. Create the Scrapy project → /scrape-ensure-project
4. Generate the extraction code → /scrape-codegen
5. Generate the spider → /scrape-create-spider
Each stage feeds directly into the next. When the pipeline completes, you have a runnable spider and a passing test suite:
uv run scrapy crawl <spider_name>
uv run pytest fixtures/
| Skill | Description |
|---|---|
scrape | End-to-end web scraping workflow — from URL to working spider with web-poet page objects |
/scrape)| Skill | Description |
|---|---|
scrape-define | Quick schema definition: explore one detail page, discover fields, fast approval loop |
scrape-spec | Explore diverse pages and validate the extraction spec: downloads pages, compares variants, optional browser review |
scrape-explore-site | Explore a website to find and save diverse pages (start, list, detail) with classified links |
scrape-analyze-page | Extract all available fields with values from a detail page |
scrape-ensure-project | Ensure a Scrapy project exists with scrapy-poet and Zyte API support |
scrape-codegen | Generate web-poet page object code from an extraction spec |
scrape-codegen-analyze | Analyze an HTML page to produce field extraction instructions for code generation |
scrape-codegen-generate | Generate web-poet page object code from per-page extraction analyses |
scrape-create-spider | Generate a Scrapy spider that wires page objects together |
| Skill | Description |
|---|---|
scrape-add-page-object | Add an empty web-poet page object to a Scrapy project |
scrape-review-schema | Generate an HTML review page for schema and extracted data verification |
| Skill | Description |
|---|---|
scrape-scrapy-cloud | Deploy projects, schedule spiders, list/stop jobs, and view items or logs on Scrapy Cloud |
scrape-zyte-login | Set up your Zyte account and credentials |
uv — used to create and manage the Scrapy projectProject dependencies (scrapy, scrapy-poet, scrapy-zyte-api, web-poet, extruct, price-parser, pytest) are installed automatically by the skills.
Any scraping prompt triggers the skill automatically. For example:
/scrape https://books.toscrape.com/ products
Zyte Web Data for Claude Code
npx claudepluginhub zytedata/claude-skills --plugin zyte-web-dataComprehensive skill pack with 66 specialized skills for full-stack developers: 12 language experts (Python, TypeScript, Go, Rust, C++, Swift, Kotlin, C#, PHP, Java, SQL, JavaScript), 10 backend frameworks, 6 frontend/mobile, plus infrastructure, DevOps, security, and testing. Features progressive disclosure architecture for 50% faster loading.
A growing collection of Claude-compatible academic workflow bundles. Covers scientific figures, manuscript writing and polishing, reviewer assessment, citation retrieval, data availability, paper reading, literature search, response letters, paper-to-PPTX conversion, and evidence-grounded Chinese invention patent drafting. Rules are organized as reusable skill folders with explicit workflows and quality checks.
Intelligent draw.io diagramming plugin with AI-powered diagram generation, multi-platform embedding (GitHub, Confluence, Azure DevOps, Notion, Teams, Harness), conditional formatting, live data binding, and MCP server integration for programmatic diagram creation and management.
Persistent file-based planning for AI coding agents. Crash-proof markdown plans (task_plan.md, findings.md, progress.md) that survive context loss and /clear, with an opt-in completion gate and multi-agent shared state. Manus-style. Works with Claude Code, Codex CLI, Cursor, Kiro, OpenCode and 60+ agents via the SKILL.md standard. Includes Arabic, German, Spanish, and Chinese (Simplified and Traditional).
Complete creative writing suite with 10 specialized agents covering the full writing process: research gathering, character development, story architecture, world-building, dialogue coaching, editing/review, outlining, content strategy, believability auditing, and prose style/voice analysis. Includes genre-specific guides, templates, and quality checklists.
Payload Development plugin - covers collections, fields, hooks, access control, plugins, and database adapters.