tjake / stormscraperLinks
A Storm based web crawler with Cassandra backend
☆28Updated 11 years ago
Alternatives and similar repositories for stormscraper
Users that are interested in stormscraper are comparing it to the libraries listed below
Sorting:
- Data Science Research Architecture, Data Center OS☆21Updated 9 years ago
- A chef cookbook for deploying spark☆30Updated 12 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 9 years ago
- VoltDB Click Stream Processing Example.☆16Updated 7 years ago
- Hadoop log aggregator and dashboard☆191Updated 11 years ago
- Storm Spout + Kafka State Inspector☆58Updated 5 years ago
- riemann tool for cassandra☆32Updated 9 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- Deprecated - Check out MemSQL Pipelines instead!☆8Updated 7 years ago
- Docker container to locally run Spark and Kafka☆15Updated 8 years ago
- ☆27Updated 9 years ago
- Utilities for building distributed systems on top of mesos☆24Updated 6 years ago
- On demand presto cluster with mesos, marathon and docker.☆30Updated 7 years ago
- Docker containers for Druid nodes☆26Updated 8 years ago
- Muppet☆126Updated 4 years ago
- A Mesos Framework for Tachyon, a memory-centric distributed file system.☆32Updated 10 years ago
- They only live to get radical.☆13Updated 6 years ago
- Elastic Sentiment Analysis (using Apache Mesos, Marathon and Apache Spark)☆35Updated 10 years ago
- CSV river for ElasticSearch☆91Updated 8 years ago
- ☆9Updated 2 years ago
- Apache Mesos Platform as a Service Deploy☆21Updated 9 years ago
- Kafka on Mesos☆33Updated 9 years ago
- Deprecated, use https://github.com/mozilla-services/iprepd☆15Updated 7 years ago
- Mesos+Marathon/Ochopod proxy + toolkit + CLI !☆56Updated 8 years ago
- Periscope brings SLA policy based autoscaling to Hadoop☆35Updated 9 years ago
- Stand-alone ANSI SQL for Cascading on Apache Hadoop☆48Updated 7 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 10 years ago
- juttle execution engine☆36Updated 9 years ago
- Tail a log file and send log lines automatically to a kafka topic☆57Updated 12 years ago