jayzeng / dirbotLinks
Scrapy project to scrape public web directories (educational)
☆22Updated 8 years ago
Alternatives and similar repositories for dirbot
Users that are interested in dirbot are comparing it to the libraries listed below
Sorting:
- A scrapy pipeline which send items to Elastic Search server☆98Updated 7 years ago
 - ☆167Updated 7 years ago
 - Sometimes sites make crawling hard. Selenium-crawler uses selenium automation to fix that.☆125Updated 12 years ago
 - ☆33Updated 2 weeks ago
 - docker scrapyd scrapy boot2docker crawler - a spider Python application that can be "Dockerized".☆42Updated 10 years ago
 - a scaleable and efficient crawelr with docker cluster , crawl million pages in 2 hours with a single machine☆97Updated last year
 - ScraperWiki Python library for scraping and saving data☆158Updated 2 years ago
 - Fill HTML login forms automatically☆276Updated last year
 - MongoDB extensions for Scrapy☆44Updated 11 years ago
 - Crawlera tools☆26Updated 9 years ago
 - A scrapy pipeline which send items to Elastic Search server☆323Updated 3 years ago
 - Minimalist Win/OSX/Linux System Dashboard using Flask and Freeboard☆202Updated 9 years ago
 - Small set of utilities to simplify writing Scrapy spiders.☆49Updated 10 years ago
 - Let's perform Twitter sentiment analysis using Python, Docker, Elasticsearch, and Kibana!☆138Updated 5 years ago
 - **Available for Contract Work** A boilerplate for building your own subscription site with stripe integration or stripe subscription serv…☆222Updated 13 years ago
 - Simple Web UI for Scrapy spider management via Scrapyd☆51Updated 7 years ago
 - Send bulk html emails from the commandline or in your python script by specifying a database of recipients in csv form, a html template w…☆101Updated 5 years ago
 - Small demo for a "search-as-you-type" app in AngularJS + Python/Flask + Elasticsearch☆69Updated 8 years ago
 - Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated last year
 - Automated NLP sentiment predictions- batteries included, or use your own data☆18Updated 7 years ago
 - Bringing sanity to world of messed-up data☆66Updated 11 years ago
 - Site Hound (previously THH) is a Domain Discovery Tool☆23Updated 4 years ago
 - The fastest way to start using Twilio with Python.☆99Updated 6 years ago
 - A very simple API consumer, used to reset the 'Eve Demo' Web REST API☆53Updated 7 years ago
 - ☆223Updated 10 years ago
 - Traptor -- A distributed Twitter feed☆26Updated 3 years ago
 - Dynamic data analysis over the web. The logic to your data dashboards.☆156Updated 10 years ago
 - PyQuery-based scraping micro-framework.☆118Updated 3 years ago
 - Converts JSON files to CSV (pulling data from nested structures). Useful for Mongo data☆264Updated 4 years ago
 - [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆11Updated 10 years ago