ncouture / python-search-engineLinks
Search engine base (crawler, indexer and parser) using Python, Celery, RabbitMQ, CouchDB and Whoosh.
☆11Updated 2 months ago
Alternatives and similar repositories for python-search-engine
Users that are interested in python-search-engine are comparing it to the libraries listed below
Sorting:
- PyQuery-based scraping micro-framework.☆117Updated 3 years ago
- Bringing sanity to world of messed-up data☆66Updated 10 years ago
- A Flask full-text search engine☆83Updated 6 years ago
- Jabba's headless webkit browser for scraping AJAX-powered webpages.☆90Updated 10 years ago
- Assorted generic flask views, blueprints, Jinja2 filters, macros, forms and more.☆24Updated 5 years ago
- Want to handle 100,000 messages in 90 seconds? Celery and Kombu are that awesome - Multiple publisher-subscriber demos for processing jso…☆41Updated 6 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated last year
- SimpleSQLite is a Python library to simplify SQLite database operations: table creation, data insertion and get data as other data format…☆135Updated last week
- Find which links on a web page are pagination links☆29Updated 8 years ago
- dank key/value store high-level APIs☆18Updated 7 years ago
- 🕷Configuration based html scraper☆23Updated 5 months ago
- browser based file editor, built on flask-xxl -> https://github.com/jstacoder/flask-xxl☆42Updated 2 years ago
- Analytics snippets generator extension for the Flask framework.☆83Updated 8 years ago
- Balanced API library in python.☆69Updated 3 years ago
- Python powered spreadsheets☆174Updated 7 years ago
- Python implementation of the Parsley language for extracting structured data from web pages☆92Updated 7 years ago
- Minimal, prototype RESTful server for basic CRUD transactions☆38Updated 8 years ago
- A language for filtering, matching, and validating Python dictionaries☆47Updated 2 years ago
- Python library to convert YAML/JSON into SQLAlchemy SELECT queries☆44Updated 7 years ago
- Task manager built around the gevent green threads library.☆18Updated 6 years ago
- A middleware to use random user agent in Scrapy crawler.☆33Updated 12 years ago
- Check WHOIS information for a list of domains☆38Updated 5 years ago
- An API (for Humans) for converting timestamps.☆24Updated 8 years ago
- Tiny python web crawler☆169Updated 9 years ago
- SQLAlchemy->Datatables☆54Updated last year
- Tools that will make writing tests, bots and scrapers using Selenium much easier☆140Updated 8 months ago
- xmldataset: xml parsing made easy 🗃️☆79Updated 5 years ago
- One interface to read and write the data in various excel formats, import the data into and export the data from databases☆60Updated 5 months ago
- Scrapy spider middleware to split an item into multiple items using a multi-valued key☆20Updated 8 years ago
- Python SMTP client and Email for Humans™☆82Updated 6 years ago