scrapinghub/python-scrapinghub

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/scrapinghub/python-scrapinghub)

scrapinghub / python-scrapinghub

A client interface for Scrapinghub's API

☆206

Alternatives and similar repositories for python-scrapinghub

Users that are interested in python-scrapinghub are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

scrapinghub / shub
View on GitHub
Scrapinghub Command Line Client
☆129Updated this week
scrapinghub / scrapinghub-entrypoint-scrapy
View on GitHub
Scrapy entrypoint for Scrapinghub job runner
☆24Feb 26, 2026Updated 4 months ago
zytedata / zyte-autoextract
View on GitHub
Python clients for Zyte AutoExtract API
☆41Jan 17, 2022Updated 4 years ago
scrapinghub / python-hubstorage
View on GitHub
Deprecated HubStorage client library - please use python-scrapinghub>=1.9.0 instead
☆16Dec 5, 2016Updated 9 years ago
scrapinghub / arche
View on GitHub
Analyze scraped data
☆47Dec 9, 2019Updated 6 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
scrapinghub / scrapinghub-stack-scrapy
View on GitHub
Software stack with latest Scrapy and updated deps
☆64Jul 8, 2026Updated last week
realslimshanky / Spider-Sense
View on GitHub
A browser extension to monitor your spiders deployed on Scrapy Cloud.
☆16Mar 8, 2025Updated last year
scrapy-plugins / scrapy-zyte-api
View on GitHub
Zyte API integration for Scrapy
☆43Jun 26, 2026Updated 3 weeks ago
scrapinghub / sample-projects
View on GitHub
Sample projects showcasing Scrapinghub tech
☆137Feb 14, 2024Updated 2 years ago
ejulio / spider-feeder
View on GitHub
A library to make it easier to load input URLs to start scrapy processes
☆14Feb 21, 2021Updated 5 years ago
scrapinghub / frontera
View on GitHub
A scalable frontier for web crawlers
☆1,332Jun 6, 2025Updated last year
scrapinghub / scrapyrt
View on GitHub
HTTP API for Scrapy spiders
☆882Jun 29, 2026Updated 3 weeks ago
scrapinghub / kafka-scanner
View on GitHub
High Level Kafka Scanner
☆19Sep 29, 2017Updated 8 years ago
scrapinghub / docker-images
View on GitHub
☆33Oct 20, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
scrapy / w3lib
View on GitHub
Python library of web-related functions
☆419Updated this week
scrapy-plugins / scrapy-dotpersistence
View on GitHub
A scrapy extension to sync `.scrapy` folder to an S3 bucket
☆18Mar 28, 2022Updated 4 years ago
scrapy / itemloaders
View on GitHub
Library to populate items using XPath and CSS with a convenient API
☆49Updated this week
scrapinghub / exporters
View on GitHub
Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations
☆39May 21, 2024Updated 2 years ago
scrapinghub / crawlera-tools
View on GitHub
Crawlera tools
☆26Feb 9, 2016Updated 10 years ago
scrapinghub / webstruct
View on GitHub
NER toolkit for HTML data
☆259May 3, 2024Updated 2 years ago
klynch / python-logstash-handler
View on GitHub
Ships logs to logstash
☆12May 30, 2015Updated 11 years ago
scrapinghub / splash
View on GitHub
Lightweight, scriptable browser as a service with an HTTP API
☆4,190Aug 2, 2024Updated last year
scrapinghub / scrapy-poet
View on GitHub
Page Object pattern for Scrapy
☆127Jun 8, 2026Updated last month
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
scrapinghub / scrapy-training
View on GitHub
Scrapy Training companion code
☆173Jan 30, 2019Updated 7 years ago
scrapinghub / dateparser
View on GitHub
python parser for human readable dates
☆2,843Updated this week
TeamHG-Memex / undercrawler
View on GitHub
A generic crawler
☆81Apr 8, 2026Updated 3 months ago
scrapinghub / andi
View on GitHub
Library for annotation-based dependency injection
☆24Updated this week
scrapy / parsel
View on GitHub
Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors
☆1,342Updated this week
TeamHG-Memex / scrapy-dockerhub
View on GitHub
[UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.
☆12Apr 8, 2026Updated 3 months ago
scrapy-plugins / scrapy-splash
View on GitHub
Scrapy+Splash for JavaScript integration
☆3,229Feb 11, 2025Updated last year
dmclain / scrapy-heroku
View on GitHub
☆68Sep 7, 2018Updated 7 years ago
TeamHG-Memex / MaybeDont
View on GitHub
A component that tries to avoid downloading duplicate content
☆28Apr 8, 2026Updated 3 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
scrapinghub / aduana
View on GitHub
Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…
☆54May 21, 2024Updated 2 years ago
rmax / scrapy-inline-requests
View on GitHub
A decorator to write coroutine-like spider callbacks.
☆109Dec 26, 2022Updated 3 years ago
scrapy / scrapyd
View on GitHub
A service daemon to run Scrapy spiders
☆3,097Updated this week
scrapinghub / scmongo
View on GitHub
MongoDB extensions for Scrapy
☆44Oct 2, 2014Updated 11 years ago
povilasb / scrapy-html-storage
View on GitHub
Scrapy downloader middleware that stores response HTMLs to disk.
☆18Apr 14, 2026Updated 3 months ago
zytedata / zyte-smartproxy-headless-proxy
View on GitHub
A complimentary proxy to help to use SPM with headless browsers
☆109May 20, 2026Updated 2 months ago
scrapy-plugins / scrapy-jsonrpc
View on GitHub
Scrapy extension to control spiders using JSON-RPC
☆299Aug 26, 2019Updated 6 years ago