Crochet-based blocking API for Scrapy.
☆47Feb 24, 2017Updated 9 years ago
Alternatives and similar repositories for scrapydo
Users that are interested in scrapydo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!☆41May 29, 2017Updated 9 years ago
- Show summary of a large number of URLs in a Jupyter Notebook☆19Apr 8, 2026Updated 2 months ago
- Find which links on a web page are pagination links☆29Jan 12, 2017Updated 9 years ago
- Crochet: use Twisted anywhere!☆239Sep 3, 2024Updated last year
- Sentry component for Scrapy☆84Aug 21, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆13Dec 4, 2019Updated 6 years ago
- DataBrewer Recipes Repository.☆21Jul 5, 2016Updated 9 years ago
- Automatic Item List Extraction☆85Jun 15, 2016Updated 10 years ago
- Detect and classify pagination links☆15Sep 9, 2020Updated 5 years ago
- Web Crawling UI and HTTP API, based on Scrapy and Tornado☆162Apr 8, 2026Updated 2 months ago
- A component that tries to avoid downloading duplicate content☆28Apr 8, 2026Updated 2 months ago
- A tool for manage website extraction configs☆37Oct 4, 2013Updated 12 years ago
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆12Apr 8, 2026Updated 2 months ago
- Python implementation of the Parsley language for extracting structured data from web pages☆92Oct 26, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- TwoFold (2✂︎f). Text files breathe fire.☆24Jan 28, 2026Updated 5 months ago
- A CLI for dealing with the features of ScrapingHub☆16Apr 20, 2021Updated 5 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆50Jul 24, 2015Updated 10 years ago
- Price and currency parsing utility☆27Mar 6, 2023Updated 3 years ago
- A Scrapy pipeline to categorize items using MonkeyLearn☆38Apr 28, 2017Updated 9 years ago
- HTTP API for Scrapy spiders☆882Updated this week
- ☆16Apr 10, 2026Updated 2 months ago
- Scrapy GUI☆12Feb 26, 2021Updated 5 years ago
- A linter for Scrapy projects.☆22Feb 25, 2026Updated 4 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Intelligent Web Data Extractor☆74Dec 5, 2022Updated 3 years ago
- A classifier for detecting soft 404 pages☆61Apr 8, 2026Updated 2 months ago
- Python bindings for html5ever, using CFFI☆39Nov 9, 2017Updated 8 years ago
- Page Object pattern for Scrapy☆127Jun 8, 2026Updated 3 weeks ago
- Modularly extensible semantic metadata validator☆85Dec 10, 2015Updated 10 years ago
- A generic crawler☆80Apr 8, 2026Updated 2 months ago
- 使用anyproxy获取wx_gzh文章☆11Apr 18, 2018Updated 8 years ago
- High Level Kafka Scanner☆19Sep 29, 2017Updated 8 years ago
- A distributed in-memory fabric based on shared-memory blocks and datashape. Any language can operate on the data.☆13Feb 12, 2016Updated 10 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A very simple mobile-friendly game that teaches CSS selectors.☆29Dec 20, 2022Updated 3 years ago
- Skinfer is a tool for inferring and merging JSON schemas☆141Apr 24, 2024Updated 2 years ago
- Scrapy中,将网络资源(文件、图像等)存储在七牛上的Pipeline扩展☆24Dec 26, 2015Updated 10 years ago
- Scrapy Eagle is a tool that allow us to run any Scrapy based project in a distributed fashion and monitor how it is going on and how many…☆24Sep 4, 2020Updated 5 years ago
- The Clever Algorithms project is an effort to describe a large number of algorithmic techniques from the field of Artificial Intelligence…☆29Oct 28, 2018Updated 7 years ago
- Failover AWS Spot Instances☆11Dec 8, 2017Updated 8 years ago
- Tool to flatten stream of JSON-like objects, configured via schema☆33Oct 19, 2019Updated 6 years ago