matiasb / demiurge
PyQuery-based scraping micro-framework.
☆112Updated 2 years ago
Related projects: ⓘ
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆56Updated 2 years ago
- MongoDB Python logging handler, Centralized logging made simple using MongoDB.☆135Updated 5 years ago
- Bringing sanity to world of messed-up data☆65Updated 9 years ago
- Python implementation of the Parsley language for extracting structured data from web pages☆92Updated 6 years ago
- Sentry component for Scrapy☆86Updated last year
- Display money format and its filthy currencies, for all money lovers out there.☆72Updated 3 years ago
- Flask extension that takes care of API representation and authentication.☆55Updated 8 years ago
- CSS Selectors for Python☆291Updated 4 months ago
- An Extensible Image Crawler☆158Updated 7 years ago
- Friendly Python Dates☆190Updated 4 years ago
- ☆53Updated this week
- Asynchronous Python HTTP Requests for Humans using twisted☆31Updated 5 years ago
- Python light ORM for Redis☆79Updated 5 years ago
- A CachingQuery implementation to Flask using Flask-SQLAlchemy and Flask-Cache☆39Updated 5 years ago
- A simple, immutable URL class with a clean API for interrogation and manipulation.☆294Updated last year
- A tiny WSGI web framework☆45Updated 8 years ago
- Simple Web UI for Scrapy spider management via Scrapyd☆49Updated 6 years ago
- MongoDB extensions for Scrapy☆44Updated 9 years ago
- Yet another cli library , click-like but sub-command friendly and designed for cli auto-generating.☆55Updated 6 years ago
- WTForms integration for peewee☆111Updated 10 months ago
- Flask-Sitemap is a Flask extension helping with sitemap generation.☆52Updated 5 months ago
- Tools that will make writing tests, bots and scrapers using Selenium much easier☆141Updated this week
- A CLI for benchmarking Scrapy.☆30Updated 3 years ago
- Embed the Duktape JS interpreter in Python☆81Updated last year
- A flask API for running your scrapy spiders☆128Updated 6 years ago
- Scrapinghub Command Line Client☆125Updated 4 months ago
- Django mail system ported to tornado and made asynchronous☆104Updated 6 years ago
- URL Transformation, Sanitization☆103Updated 8 months ago
- Scrapy spider middleware to split an item into multiple items using a multi-valued key☆20Updated 7 years ago
- A module for querying the DOM tree and writing XPath expressions using native Python syntax.☆130Updated 6 years ago