A client interface for Scrapinghub's API
☆204Oct 3, 2025Updated 5 months ago
Alternatives and similar repositories for python-scrapinghub
Users that are interested in python-scrapinghub are comparing it to the libraries listed below
Sorting:
- Scrapinghub Command Line Client☆131Nov 6, 2025Updated 4 months ago
- Scrapy entrypoint for Scrapinghub job runner☆25Feb 26, 2026Updated 3 weeks ago
- Python clients for Zyte AutoExtract API☆41Jan 17, 2022Updated 4 years ago
- Analyze scraped data☆46Dec 9, 2019Updated 6 years ago
- Software stack with latest Scrapy and updated deps☆64Mar 1, 2026Updated 2 weeks ago
- A browser extension to monitor your spiders deployed on Scrapy Cloud.☆16Mar 8, 2025Updated last year
- Sample projects showcasing Scrapinghub tech☆137Feb 14, 2024Updated 2 years ago
- Extract embedded metadata from HTML markup☆956Oct 1, 2025Updated 5 months ago
- A scalable frontier for web crawlers☆1,330Jun 6, 2025Updated 9 months ago
- HTTP API for Scrapy spiders☆881Feb 16, 2026Updated last month
- High Level Kafka Scanner☆19Sep 29, 2017Updated 8 years ago
- ☆33Oct 20, 2025Updated 5 months ago
- Python library of web-related functions☆418Feb 19, 2026Updated last month
- Extensions for using Scrapy on Amazon AWS☆32Dec 5, 2012Updated 13 years ago
- Library to populate items using XPath and CSS with a convenient API☆48Jan 29, 2026Updated last month
- Crawlera tools☆26Feb 9, 2016Updated 10 years ago
- Skinfer is a tool for inferring and merging JSON schemas☆141Apr 24, 2024Updated last year
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40May 21, 2024Updated last year
- Zyte Automatic Extraction integration for Scrapy☆56Feb 4, 2022Updated 4 years ago
- Ships logs to logstash☆12May 30, 2015Updated 10 years ago
- Lightweight, scriptable browser as a service with an HTTP API☆4,202Aug 2, 2024Updated last year
- Page Object pattern for Scrapy☆127Updated this week
- Scrapy Training companion code☆173Jan 30, 2019Updated 7 years ago
- python parser for human readable dates☆2,794Mar 2, 2026Updated 2 weeks ago
- Library for annotation-based dependency injection☆24Mar 3, 2026Updated 2 weeks ago
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,320Jan 29, 2026Updated last month
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆12Feb 23, 2026Updated 3 weeks ago
- Scrapy+Splash for JavaScript integration☆3,235Feb 11, 2025Updated last year
- ☆68Sep 7, 2018Updated 7 years ago
- A component that tries to avoid downloading duplicate content☆28Feb 10, 2026Updated last month
- Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…☆55May 21, 2024Updated last year
- MongoDB extensions for Scrapy☆44Oct 2, 2014Updated 11 years ago
- A decorator to write coroutine-like spider callbacks.☆109Dec 26, 2022Updated 3 years ago
- A service daemon to run Scrapy spiders☆3,087Mar 2, 2026Updated 2 weeks ago
- A complimentary proxy to help to use SPM with headless browsers☆107May 29, 2023Updated 2 years ago
- Scrapy extension to control spiders using JSON-RPC☆299Aug 26, 2019Updated 6 years ago
- Scrapy Middleware to set a random User-Agent for every Request.☆202Aug 16, 2019Updated 6 years ago
- Random User-Agent middleware based on fake-useragent☆689Sep 18, 2023Updated 2 years ago
- xlvector's solution of github contest☆33Aug 30, 2009Updated 16 years ago