Scrapy middleware to handle javascript pages using selenium
β958Jul 8, 2024Updated last year
Alternatives and similar repositories for scrapy-selenium
Users that are interested in scrapy-selenium are comparing it to the libraries listed below
Sorting:
- Scrapy+Splash for JavaScript integrationβ3,241Feb 11, 2025Updated last year
- π Playwright integration for Scrapyβ1,368Jan 21, 2026Updated last month
- Random User-Agent middleware based on fake-useragentβ690Sep 18, 2023Updated 2 years ago
- Scrapy + Puppeteerβ110Jun 11, 2021Updated 4 years ago
- A service daemon to run Scrapy spidersβ3,085Jan 16, 2026Updated last month
- use multiple proxies with Scrapyβ772Feb 10, 2026Updated 3 weeks ago
- Lightweight, scriptable browser as a service with an HTTP APIβ4,199Aug 2, 2024Updated last year
- Web app for Scrapyd cluster management, Scrapy log analysis & visualization, Auto packaging, Timer tasks, Monitor & Alert, and Mobile UI.β¦β3,401Feb 19, 2025Updated last year
- HTTP API for Scrapy spidersβ879Feb 16, 2026Updated 2 weeks ago
- Random proxy middleware for Scrapyβ1,672Oct 1, 2019Updated 6 years ago
- Command line client for Scrapyd serverβ778Dec 15, 2025Updated 2 months ago
- This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.β1,231Nov 7, 2023Updated 2 years ago
- A daemon for scheduling Scrapy spidersβ66May 28, 2021Updated 4 years ago
- A curated list of awesome packages, articles, and other cool resources from the Scrapy community.β556Dec 28, 2022Updated 3 years ago
- Scrapy Extension for monitoring spiders execution.β553Updated this week
- Redis-based components for Scrapy.β5,643Jul 6, 2024Updated last year
- Distributed Crawler Management Framework Based on Scrapy, Scrapyd, Django and Vue.jsβ3,500Oct 29, 2024Updated last year
- Scrapy, a fast high-level web crawling & scraping framework for Python.β60,007Feb 23, 2026Updated last week
- β29Apr 28, 2021Updated 4 years ago
- MongoDB pipeline for Scrapy. This module supports both MongoDB in standalone setups and replica sets. scrapy-mongodb will insert the itemβ¦β358Apr 6, 2021Updated 4 years ago
- Scrapy schema validation pipeline and Item builder using JSON Schemaβ45Mar 26, 2021Updated 4 years ago
- Visual scraping for Scrapyβ9,493Jun 26, 2024Updated last year
- Scrapy extension to control spiders using JSON-RPCβ300Aug 26, 2019Updated 6 years ago
- admin ui for scrapy/open source scrapinghubβ2,777May 4, 2023Updated 2 years ago
- Page Object pattern for Scrapyβ127Jan 28, 2026Updated last month
- Scrapy spider middleware to ignore requests to pages containing items seen in previous crawlsβ277Feb 26, 2025Updated last year
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.β57Mar 16, 2022Updated 3 years ago
- Splash + HAProxy + Docker Composeβ195Feb 10, 2026Updated 3 weeks ago
- β167Mar 4, 2020Updated 5 years ago
- Zyte Smart Proxy Manager (formerly Crawlera) middleware for Scrapyβ368Mar 24, 2025Updated 11 months ago
- Web scraping Page Objects core libraryβ104Jan 27, 2026Updated last month
- Scrapoxy has been discontinued.β2,432Feb 7, 2026Updated 3 weeks ago
- A scalable frontier for web crawlersβ1,328Jun 6, 2025Updated 8 months ago
- Extends Selenium's Python bindings to give you the ability to inspect requests made by the browser.β1,979Jan 3, 2024Updated 2 years ago
- A downloader middleware to change user-agent of scrapyβ21Jun 1, 2019Updated 6 years ago
- Scrapy middleware which allows to crawl only new contentβ79Feb 10, 2026Updated 3 weeks ago
- A scrapy pipeline which send items to Elastic Search serverβ320Jun 10, 2022Updated 3 years ago
- Python clients for Zyte AutoExtract APIβ41Jan 17, 2022Updated 4 years ago
- A decorator to write coroutine-like spider callbacks.β109Dec 26, 2022Updated 3 years ago