tjake / stormscraper
A Storm based web crawler with Cassandra backend
☆28Updated 11 years ago
Alternatives and similar repositories for stormscraper:
Users that are interested in stormscraper are comparing it to the libraries listed below
- VoltDB Click Stream Processing Example.☆16Updated 7 years ago
- riemann tool for cassandra☆32Updated 8 years ago
- [Deprecated] Simple docker image to run an Elasticsearch server☆22Updated 7 years ago
- Muppet☆126Updated 3 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 5 years ago
- Storm Spout + Kafka State Inspector☆58Updated 5 years ago
- Data Science Research Architecture, Data Center OS☆21Updated 8 years ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 8 years ago
- juttle execution engine☆37Updated 8 years ago
- Docker containers for Druid nodes☆27Updated 8 years ago
- A big data cluster management tool that creates and manages clusters of different technologies.☆21Updated 9 years ago
- Deprecated - Check out MemSQL Pipelines instead!☆8Updated 7 years ago
- Usage examples for Divolte collector☆17Updated 7 years ago
- docker image with graphite-api and graphite-influxdb☆39Updated 7 years ago
- A chef cookbook for deploying spark☆30Updated 11 years ago
- Kerberos, LDAP, Active Directory, PKI/SSL/TLS and host/ip based ACL coarse-grained and document level security for elasticsearch (Authent…☆170Updated 5 years ago
- DCOS CLI in a Docker Container☆10Updated 7 years ago
- Stand-alone ANSI SQL for Cascading on Apache Hadoop☆48Updated 7 years ago
- Apache Mesos Platform as a Service Deploy☆21Updated 8 years ago
- Utilities and examples to asssist in working with PySpark and Cassandra.☆36Updated 10 years ago
- A collection of efficient utilities for a data scientist.☆41Updated 9 years ago
- A set of components designed to retrieve data from third-party APIs and storage systems, and to pass that data in to a DataSift account.☆9Updated 7 years ago
- Periscope brings SLA policy based autoscaling to Hadoop☆35Updated 9 years ago
- CSV river for ElasticSearch☆91Updated 7 years ago
- Jetstream is a streaming processing framework☆113Updated 9 years ago
- Dockerfile with graphite, grafana, elasticsearch, statsd, fake data generator☆45Updated 9 years ago
- A Seriously Fun guide to Big Data Analytics in Practice☆169Updated 9 years ago
- Mesos containerizer hooks for Docker☆249Updated 6 years ago
- ☆27Updated 8 years ago
- Kafka on Mesos☆33Updated 9 years ago