Netflix / aegisthusLinks
A Bulk Data Pipeline out of Cassandra
☆323Updated 6 years ago
Alternatives and similar repositories for aegisthus
Users that are interested in aegisthus are comparing it to the libraries listed below
Sorting:
- ☆204Updated 2 years ago
- Netflix's distributed Data Pipeline☆797Updated 2 years ago
- Mirror of Apache Apex core☆349Updated 4 years ago
- Cassandra Java Client☆1,036Updated 7 months ago
- Co-Process for backup/recovery, Token Management, and Centralized Configuration management for Cassandra.☆1,035Updated 2 weeks ago
- Real²time Exploratory Analytics on Large Datasets☆122Updated 5 years ago
- Apache Kafka on Apache Mesos☆413Updated 7 years ago
- Kite SDK☆394Updated 2 years ago
- LinkedIn's previous generation Kafka to HDFS pipeline.☆882Updated 4 years ago
- https://github.com/apache/incubator-myriad is our new home. See☆253Updated 9 years ago
- [PROJECT IS NO LONGER MAINTAINED] Wirbelsturm is a Vagrant and Puppet based tool to perform 1-click local and remote deployments, with a …☆328Updated 3 years ago
- Pig Visualization framework☆466Updated 2 years ago
- Netflix Data Store Benchmark☆362Updated last year
- Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.☆1,136Updated 2 years ago
- Hadoop mapreduce job to bulk load data into Cassandra☆75Updated 3 years ago
- Apache Aurora - A Mesos framework for long-running services, cron jobs, and ad-hoc jobs☆635Updated 5 years ago
- A repository of information, examples and good practices around the Lambda Architecture☆369Updated 7 years ago
- Extensible Scheduler for Mesos Frameworks☆699Updated 2 years ago
- Mirror of Apache Samza☆828Updated 2 months ago
- Fast and efficient batch computation engine for complex analysis and reporting of massive datasets on Hadoop☆244Updated 9 years ago
- A platform for visualization and real-time monitoring of data workflows☆1,172Updated 5 years ago
- Tranquility helps you send real-time event streams to Druid and handles partitioning, replication, service discovery, and schema rollover…☆516Updated 5 years ago
- kafka-connect-s3 : Ingest data from Kafka to Object Stores(s3)☆94Updated 6 years ago
- Next-generation web analytics processing with Scala, Spark, and Parquet.☆331Updated 10 years ago
- Java client for Dynomite☆186Updated 7 months ago
- Storm on Mesos!☆136Updated 3 years ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆279Updated 6 years ago
- Google Dataflow Runner for Apache Flink™ (deprecated; please use the up-to-date Beam Runner)☆88Updated 9 years ago
- Hadoop log aggregator and dashboard☆191Updated 11 years ago
- Streaming MapReduce with Scalding and Storm☆2,132Updated 3 years ago