ncouture / python-search-engineLinks
Search engine base (crawler, indexer and parser) using Python, Celery, RabbitMQ, CouchDB and Whoosh.
☆11Updated 5 months ago
Alternatives and similar repositories for python-search-engine
Users that are interested in python-search-engine are comparing it to the libraries listed below
Sorting:
- A Flask full-text search engine☆83Updated 6 years ago
- PyQuery-based scraping micro-framework.☆118Updated 3 years ago
- SimpleSQLite is a Python library to simplify SQLite database operations: table creation, data insertion and get data as other data format…☆135Updated 3 weeks ago
- A middleware to use random user agent in Scrapy crawler.☆33Updated 12 years ago
- Bringing sanity to world of messed-up data☆66Updated 11 years ago
- Turn your IPython console into a cross-database SQL client☆31Updated 9 years ago
- One interface to read and write the data in various excel formats, import the data into and export the data from databases☆60Updated 3 weeks ago
- A language for filtering, matching, and validating Python dictionaries☆47Updated 2 years ago
- 🕷Configuration based html scraper☆23Updated 2 weeks ago
- Analytics snippets generator extension for the Flask framework.☆82Updated 9 years ago
- Supervisor On/Off: an alternative web interface for supervisor☆50Updated 8 years ago
- Jabba's headless webkit browser for scraping AJAX-powered webpages.☆90Updated 11 years ago
- Python implementation of the Parsley language for extracting structured data from web pages☆92Updated 8 years ago
- Async wrapper for requests / aiohttp, and some crawler toolkits. Let synchronization code enjoy the performance of asynchronous programmi…☆24Updated 9 months ago
- Minimalist Selenium WebDriver wrapper to work within rate limits of any amount of websites simultaneously. Parallel processing friendly.☆51Updated 8 years ago
- Want to handle 100,000 messages in 90 seconds? Celery and Kombu are that awesome - Multiple publisher-subscriber demos for processing jso…☆41Updated 7 years ago
- browser based file editor, built on flask-xxl -> https://github.com/jstacoder/flask-xxl☆42Updated 2 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Tools that will make writing tests, bots and scrapers using Selenium much easier☆140Updated 11 months ago
- Create scheduled tasks at runtime easily (Django, Flask, Bottle etc.)☆45Updated 9 years ago
- Assorted generic flask views, blueprints, Jinja2 filters, macros, forms and more.☆24Updated 6 years ago
- Flask extension that takes care of API representation and authentication.☆55Updated 10 years ago
- Python powered spreadsheets☆173Updated 7 years ago
- A/B testing for your Flask application.☆121Updated 5 years ago
- Scrapy middleware to add extra fields to items, like timestamp, response fields, spider attributes etc.☆56Updated 3 years ago
- Python SMTP client and Email for Humans™☆82Updated 6 years ago
- simple rss reader by PyFladesk☆61Updated 7 years ago
- Scrapy spider middleware to split an item into multiple items using a multi-valued key☆20Updated 8 years ago
- a scaleable and efficient crawelr with docker cluster , crawl million pages in 2 hours with a single machine☆97Updated last year
- A high-performance distributed web crawling & scraping framework written with golang and python.☆30Updated 9 years ago