scalingexcellence / scrapy-apperyioLinks
Scrapy pipeline which allows you to store scrapy items in appery.io database.
☆14Updated 8 years ago
Alternatives and similar repositories for scrapy-apperyio
Users that are interested in scrapy-apperyio are comparing it to the libraries listed below
Sorting:
- Simple Web UI for Scrapy spider management via Scrapyd☆51Updated 7 years ago
- Scrapy middleware which allows to crawl only new content☆80Updated 2 years ago
- Some scrapy and web.py exmaples☆79Updated 8 years ago
- A client interface for Scrapinghub's API☆208Updated 4 months ago
- RenRen Python Library☆28Updated 9 years ago
- Pre-built Scrapy spiders for AutoExtract☆19Updated last year
- Scrapy entrypoint for Scrapinghub job runner☆26Updated last week
- Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modificati…☆102Updated 2 years ago
- A scrapy extension to store requests and responses information in storage service☆26Updated 3 years ago
- Zyte Automatic Extraction integration for Scrapy☆56Updated 3 years ago
- Scrapinghub Command Line Client☆133Updated 2 months ago
- ☆29Updated 4 years ago
- Scrapy middleware for the autologin☆37Updated 7 years ago
- Chunks of Python I've found useful.☆63Updated 4 years ago
- Python API for parsehub.com web scraping service☆46Updated 7 years ago
- Python scripts for scraping bus ticket data from the websites of BoltBus, Greyhound, Megabus, GoBus, Amtrak, Peterpan, and EasternTravel.☆38Updated 4 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆23Updated last month
- A Scrapy crawler for http://books.toscrape.com☆27Updated 8 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 7 years ago
- ScraperWiki Python library for scraping and saving data☆159Updated 2 years ago
- Search engine base (crawler, indexer and parser) using Python, Celery, RabbitMQ, CouchDB and Whoosh.☆11Updated last month
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆56Updated 3 years ago
- A scrapy pipeline which send items to Elastic Search server☆98Updated 7 years ago
- Simple RSS feed reader for HackerNews.☆28Updated 2 years ago
- A CLI for benchmarking Scrapy.☆32Updated 2 weeks ago
- A simple, Qt-Webengine powered web browser with built in functionality for basic scrapy webscraping support.☆109Updated last year
- Django based application that allows creating, deploying and running Scrapy spiders in a distributed manner☆114Updated 7 years ago
- A crawler for http://books.toscrape.com☆42Updated last year
- Easy extraction of keywords and engines from search engine results pages (SERPs).☆90Updated 3 years ago