RockyZ / Scrapy-sqlite-item-exporter
Export items to sqlite3 database crawled by scrapy 1.4
☆32Updated 11 years ago
Alternatives and similar repositories for Scrapy-sqlite-item-exporter
Users that are interested in Scrapy-sqlite-item-exporter are comparing it to the libraries listed below
Sorting:
- A scrapy extension to store requests and responses information in storage service☆26Updated 3 years ago
- Python implementation of the Parsley language for extracting structured data from web pages☆92Updated 7 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Simple Python cache and memoizing module☆84Updated last year
- imgspy finds the metadata (type, size) of an image given its url by fetching as little as needed☆55Updated 4 years ago
- Scrapy middleware which allows to crawl only new content☆80Updated 2 years ago
- ☆12Updated 3 years ago
- PyQuery-based scraping micro-framework.☆116Updated 3 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated 11 months ago
- Simple HTTP cache for Python Requests☆97Updated 8 years ago
- Python 3 AsyncIO powered scraping framework with batteries included☆20Updated 8 years ago
- Dump (freeze) SQL query results from a database into a selection of file formats☆92Updated 6 years ago
- A python framework to generate html and JavaScript from reusable and combine-able widgets.☆23Updated 2 years ago
- Scrapinghub Command Line Client☆132Updated 3 weeks ago
- Scrapy spider middleware to split an item into multiple items using a multi-valued key☆20Updated 8 years ago
- Scrapy extension to write items using sqlalchemy models☆37Updated 8 years ago
- MongoDB extensions for Scrapy☆44Updated 10 years ago
- Bringing sanity to world of messed-up data☆66Updated 10 years ago
- An Eve extension for MongoEngine ODM support☆40Updated 3 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆46Updated 7 years ago
- Sentry component for Scrapy☆86Updated last year
- Python bindings to the Tesseract API☆66Updated 8 years ago
- Scrapy schema validation pipeline and Item builder using JSON Schema☆44Updated 4 years ago
- Scrapy spider middleware to clean up query parameters in request URLs☆24Updated 8 years ago
- ☆143Updated 9 years ago
- Feedbuffer buffers RSS and Atom syndication feeds, that is to say it caches new feed entries until the news aggregator requests them and …☆19Updated 8 years ago
- A Python library for generating fake user data.☆141Updated 8 years ago
- Nefertari is a REST API framework sitting on top of Pyramid and ElasticSearch☆53Updated 5 years ago
- Flask extension for resizing, cropping and caching images.☆49Updated 3 years ago
- ☆120Updated 9 years ago