brandicted/scrapy-webdriver

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/brandicted/scrapy-webdriver)

brandicted / scrapy-webdriver

☆143

Alternatives and similar repositories for scrapy-webdriver

Users that are interested in scrapy-webdriver are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

flisky / scrapy-phantomjs-downloader
View on GitHub
PhantomJS Downloader for Scrapy, Yeah!
☆93Aug 11, 2014Updated 11 years ago
rmax / scrapy-boilerplate
View on GitHub
Small set of utilities to simplify writing Scrapy spiders.
☆50Jul 24, 2015Updated 11 years ago
voliveirajr / seleniumcrawler
View on GitHub
An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site
☆128Feb 28, 2019Updated 7 years ago
scrapy-plugins / scrapy-splash
View on GitHub
Scrapy+Splash for JavaScript integration
☆3,229Feb 11, 2025Updated last year
snowball-one / django-oscar-support
View on GitHub
Customer services and ticketing plugin for Oscar
☆20Feb 10, 2018Updated 8 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
ramses-tech / ramses-example
View on GitHub
Example of a Pyramid app that uses ramses
☆18Jun 6, 2016Updated 10 years ago
immzz / zhihu-scrapy
View on GitHub
A scrapy zhihu crawler
☆77Nov 6, 2018Updated 7 years ago
scrapy / scrapely
View on GitHub
A pure-python HTML screen-scraping library
☆1,884Apr 4, 2022Updated 4 years ago
scrapinghub / kafka-scanner
View on GitHub
High Level Kafka Scanner
☆19Sep 29, 2017Updated 8 years ago
ramses-tech / ra
View on GitHub
Ra is a test suite generator and helper library for testing APIs described in RAML
☆15Oct 5, 2017Updated 8 years ago
twaddington / python-rdio-export
View on GitHub
A tool for exporting an Rdio collection to a portable file format like markdown.
☆11Sep 4, 2016Updated 9 years ago
scrapinghub / scrapylib
View on GitHub
Collection of Scrapy utilities (extensions, middlewares, pipelines, etc)
☆33Feb 22, 2018Updated 8 years ago
cnu / scrapy-random-useragent
View on GitHub
Scrapy Middleware to set a random User-Agent for every Request.
☆201Aug 16, 2019Updated 6 years ago
wuchong / scrapy-dynamic-configurable
View on GitHub
A dynamic configurable news crawler based Scrapy
☆164Jul 24, 2017Updated 9 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
scrapy-plugins / scrapy-querycleaner
View on GitHub
Scrapy spider middleware to clean up query parameters in request URLs
☆24Jun 30, 2016Updated 10 years ago
scrapinghub / docker-images
View on GitHub
☆33Oct 20, 2025Updated 9 months ago
istresearch / scrapy-cluster
View on GitHub
This Scrapy project uses Redis and Kafka to create a distributed on demand scraping cluster.
☆1,226Nov 7, 2023Updated 2 years ago
habnabit / txsocksx
View on GitHub
SOCKS{4,4a,5} endpoints for twisted
☆61Feb 29, 2020Updated 6 years ago
scrapinghub / page_finder
View on GitHub
Find which links on a web page are pagination links
☆29Jan 12, 2017Updated 9 years ago
scrapinghub / frontera
View on GitHub
A scalable frontier for web crawlers
☆1,332Jun 6, 2025Updated last year
aivarsk / scrapy-proxies
View on GitHub
Random proxy middleware for Scrapy
☆1,669Oct 1, 2019Updated 6 years ago
TeamHG-Memex / arachnado
View on GitHub
Web Crawling UI and HTTP API, based on Scrapy and Tornado
☆162Apr 8, 2026Updated 3 months ago
scrapinghub / scrapy-mosquitera
View on GitHub
Restrict crawl and scraping scope using matchers.
☆26Jun 8, 2016Updated 10 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
atbaker / docker-workshop
View on GitHub
A two-hour Docker workshop for DockerDC
☆15Apr 10, 2015Updated 11 years ago
holgerd77 / django-dynamic-scraper
View on GitHub
Creating Scrapy scrapers via the Django admin interface
☆1,158Feb 19, 2022Updated 4 years ago
magopian / django-inspect-model
View on GitHub
Model inspection for Django
☆29Mar 8, 2018Updated 8 years ago
scrapy / xtractmime
View on GitHub
https://mimesniff.spec.whatwg.org/ implementation for Python
☆13Jul 9, 2026Updated 2 weeks ago
ramses-tech / nefertari
View on GitHub
Nefertari is a REST API framework sitting on top of Pyramid and ElasticSearch
☆53Mar 29, 2020Updated 6 years ago
scrapy-plugins / scrapy-djangoitem
View on GitHub
Scrapy extension to write scraped items using Django models
☆502Oct 15, 2023Updated 2 years ago
scrapinghub / scrapyrt
View on GitHub
HTTP API for Scrapy spiders
☆882Jun 29, 2026Updated 3 weeks ago
tetframework / Tonnikala
View on GitHub
Python templating engine - the one ton solution
☆14Jan 29, 2026Updated 5 months ago
morcilab / uml2raml
View on GitHub
UML to RAML generator for MDE toolchains
☆12Jun 19, 2018Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
scrapinghub / webstruct
View on GitHub
NER toolkit for HTML data
☆259May 3, 2024Updated 2 years ago
TeamHG-Memex / scrapy-dockerhub
View on GitHub
[UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.
☆12Apr 8, 2026Updated 3 months ago
wavii / listy-django-cache
View on GitHub
A deterministic list cache for Django
☆15Aug 22, 2011Updated 14 years ago
scrapinghub / exporters
View on GitHub
Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations
☆39May 21, 2024Updated 2 years ago
julien-duponchelle / scrapy-elasticsearch
View on GitHub
A scrapy pipeline which send items to Elastic Search server
☆97Jan 2, 2018Updated 8 years ago
droundy / visual-hash
View on GitHub
Python package for creating visual hashes of data.
☆12Apr 20, 2015Updated 11 years ago
bluedazzle / multithreading-spider
View on GitHub
a simple demo use threading and queue get proxies from proxy sites
☆17Mar 29, 2016Updated 10 years ago