tjake / stormscraper
A Storm based web crawler with Cassandra backend
☆28Updated 11 years ago
Alternatives and similar repositories for stormscraper:
Users that are interested in stormscraper are comparing it to the libraries listed below
- A collection of efficient utilities for a data scientist.☆41Updated 9 years ago
- Data Science Research Architecture, Data Center OS☆21Updated 8 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago
- chef cookbook to install Apache Spark☆10Updated 9 years ago
- Cubes over ElasticSearch. Aggregation library for Business Intelligence☆20Updated 10 years ago
- A chef cookbook for deploying spark☆30Updated 11 years ago
- A Seriously Fun guide to Big Data Analytics in Practice☆169Updated 9 years ago
- Deprecated - Check out MemSQL Pipelines instead!☆8Updated 7 years ago
- Dumps state of Storm Kafka consumers☆96Updated 7 years ago
- juttle execution engine☆38Updated 8 years ago
- A big data cluster management tool that creates and manages clusters of different technologies.☆21Updated 9 years ago
- Light-weight monitoring for DCOS☆9Updated 9 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 7 years ago
- Hadoop log aggregator and dashboard☆191Updated 11 years ago
- Mesos+Marathon/Ochopod proxy + toolkit + CLI !☆56Updated 8 years ago
- Storm Spout + Kafka State Inspector☆58Updated 5 years ago
- Elastic Sentiment Analysis (using Apache Mesos, Marathon and Apache Spark)☆35Updated 9 years ago
- A javascript shell for elasticsearch☆105Updated 9 years ago
- ☆9Updated last year
- ☆27Updated 8 years ago
- Stand-alone ANSI SQL for Cascading on Apache Hadoop☆48Updated 6 years ago