tjake / stormscraper
A Storm based web crawler with Cassandra backend
☆28Updated 11 years ago
Alternatives and similar repositories for stormscraper:
Users that are interested in stormscraper are comparing it to the libraries listed below
- Data Science Research Architecture, Data Center OS☆21Updated 8 years ago
- A Seriously Fun guide to Big Data Analytics in Practice☆169Updated 9 years ago
- A chef cookbook for deploying spark☆30Updated 12 years ago
- Deprecated - Check out MemSQL Pipelines instead!☆8Updated 7 years ago
- They only live to get radical.☆13Updated 6 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 9 years ago
- This is an introduction of Apache Spark DataFrames.☆41Updated 10 years ago
- [Deprecated] Simple docker image to run an Elasticsearch server☆22Updated 8 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- VoltDB Click Stream Processing Example.☆16Updated 7 years ago
- Storm Spout + Kafka State Inspector☆58Updated 5 years ago
- ☆27Updated 9 years ago
- Cubes over ElasticSearch. Aggregation library for Business Intelligence☆20Updated 10 years ago
- DCOS CLI in a Docker Container☆10Updated 8 years ago
- Elephant Twin is a framework for creating indexes in Hadoop☆94Updated 4 years ago
- Docker image for Consul ElasticSearch☆12Updated 9 years ago
- Light-weight monitoring for DCOS☆9Updated 9 years ago
- Reduce your data. A unix filter for algebird-powered aggregation.☆138Updated 8 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- riemann tool for cassandra☆32Updated 8 years ago
- ☆45Updated 7 years ago
- Hadoop log aggregator and dashboard☆191Updated 11 years ago
- Templates for projects based on top of H2O.☆38Updated last month
- Apache Mesos Platform as a Service Deploy☆21Updated 8 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Experimental packages not ready to be in mesosphere/universe☆19Updated 9 years ago
- CSV river for ElasticSearch☆91Updated 8 years ago
- juttle execution engine☆36Updated 9 years ago
- A package full of linear algebra operators for Apache Spark MLlib's linalg package☆10Updated 9 years ago
- docker image with graphite-api and graphite-influxdb☆39Updated 7 years ago