DormyMo / scrappy
scrapy best practice
☆37Updated 4 years ago
Alternatives and similar repositories for scrappy:
Users that are interested in scrappy are comparing it to the libraries listed below
- Simple Web UI for Scrapy spider management via Scrapyd☆51Updated 6 years ago
- A decorator to write coroutine-like spider callbacks.☆110Updated 2 years ago
- MongoDB extensions for Scrapy☆44Updated 10 years ago
- Useful test spiders for Scrapy☆185Updated 5 years ago
- Kafka-based components for Scrapy☆79Updated 6 years ago
- Collection of Scrapy utilities (extensions, middlewares, pipelines, etc)☆32Updated 6 years ago
- Scrapy extension which writes crawled items to Kafka☆30Updated 6 years ago
- Use pyppeteer from a Scrapy spider☆60Updated 5 years ago
- Some scrapy and web.py exmaples☆79Updated 7 years ago
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆56Updated 2 years ago
- ☆32Updated last year
- Scrapinghub Command Line Client☆127Updated 9 months ago
- A complimentary proxy to help to use SPM with headless browsers☆108Updated last year
- Scrapy extension to control spiders using JSON-RPC☆297Updated 5 years ago
- Scrapy + Puppeteer☆111Updated 3 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated 9 months ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 9 years ago
- An example using Selenium webdrivers for python and Scrapy framework to create a web scraper to crawl an ASP site☆126Updated 5 years ago
- Scrapy pipeline which allows you to store scrapy items in appery.io database.☆14Updated 7 years ago
- a proxy address crawler which crawl xici.net.co based on scrapy☆8Updated 10 years ago
- 🕶 Awesome list of Scrapy tools and libraries☆59Updated 4 years ago
- A daemon for scheduling Scrapy spiders☆65Updated 3 years ago
- A client interface for Scrapinghub's API☆205Updated 2 weeks ago
- docker scrapyd scrapy boot2docker crawler - a spider Python application that can be "Dockerized".☆42Updated 9 years ago
- Scrapy middleware which allows to crawl only new content☆80Updated 2 years ago
- Mobilenium allows you to use Selenium and have access to status codes and HTTP headers, without the need for manual labor.☆20Updated 5 years ago
- an awesome public proxy server crawler based on scrapy framework☆96Updated 7 years ago
- Scrapy extension to write items using sqlalchemy models☆37Updated 7 years ago
- ☆143Updated 9 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago