tjake / stormscraperLinks
A Storm based web crawler with Cassandra backend
☆28Updated 12 years ago
Alternatives and similar repositories for stormscraper
Users that are interested in stormscraper are comparing it to the libraries listed below
Sorting:
- Storm Spout + Kafka State Inspector☆58Updated 6 years ago
- Kerberos, LDAP, Active Directory, PKI/SSL/TLS and host/ip based ACL coarse-grained and document level security for elasticsearch (Authent…☆171Updated 5 years ago
- CSV river for ElasticSearch☆91Updated 8 years ago
- Hadoop log aggregator and dashboard☆190Updated 12 years ago
- Dumps state of Storm Kafka consumers☆96Updated 8 years ago
- VoltDB Click Stream Processing Example.☆16Updated 8 years ago
- Periscope brings SLA policy based autoscaling to Hadoop☆35Updated 10 years ago
- RabbitMQ River Plugin for elasticsearch (STOPPED)☆174Updated 2 years ago
- A nozzle to spray a kafka topic at an HTTP endpoint. This project is deprecated and not maintained.☆49Updated 6 years ago
- Jetstream is a streaming processing framework☆115Updated 10 years ago
- Ambari View for the Ambari Store☆15Updated 10 years ago
- real time log event processing using storm, kafka, logstash & cassandra☆47Updated 12 years ago
- Whatson, an Elasticsearch Consulting Detective☆145Updated 8 years ago
- riemann tool for cassandra☆32Updated 9 years ago
- Ferry lets you define, run, and deploy big data applications on AWS, OpenStack, and your local machine using Docker☆254Updated 10 years ago
- Mesos+Marathon/Ochopod proxy + toolkit + CLI !☆56Updated 9 years ago
- Cookbook to install Hadoop 2.0+ using Chef☆82Updated 2 years ago
- recordbus: mysql binlog to apache kafka☆80Updated 10 years ago
- Apache ZooKeeper quorum configuration generator☆75Updated 3 years ago
- Framework for creating and deploying Apache Storm Topologies☆46Updated 10 years ago
- Turn-key deployments of DC/OS on AWS (template and onprem), Azure, and GCE☆14Updated 2 years ago
- ☆27Updated 9 years ago
- Docker image for ZooKeeper (Maestro orchestration)☆68Updated 4 years ago
- [PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a …☆330Updated 3 years ago
- A Powerstrip plugin that runs weave inside a container and ensures that containers are connected to the weave network before running thei…☆36Updated 5 months ago
- Big GeoSpatial Data Points Visualization Tool☆19Updated 9 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Updated 8 years ago
- Data Science Research Architecture, Data Center OS☆21Updated 9 years ago
- DEPRECATED—Open source Apache Cassandra running on DC/OS is now replaced by mesosphere/dcos-commons/frameworks/cassandra. This repositor…☆116Updated 6 years ago
- Vagrant files creating multi-node virtual Hadoop clusters with or without security.☆67Updated 5 years ago