Netflix / genie
Distributed Big Data Orchestration Service
☆1,708Updated this week
Related projects: ⓘ
- Mirror of Apache Samza☆811Updated 3 weeks ago
- ☆1,606Updated this week
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,214Updated last week
- Secor is a service implementing Kafka log persistence☆1,844Updated last month
- Apache Drill is a distributed MPP query layer for self describing data☆1,928Updated 3 weeks ago
- In-memory dimensional time series database.☆3,431Updated 2 weeks ago
- Distributed Prometheus time series database☆1,428Updated this week
- An extensible distributed system for reliable nearline data streaming at scale☆911Updated 4 months ago
- Netflix's distributed Data Pipeline☆794Updated last year
- SQL-based streaming analytics platform at scale☆1,222Updated 4 years ago
- A Bulk Data Pipeline out of Cassandra☆323Updated 5 years ago
- Project SnappyData - memory optimized analytics database, based on Apache Spark™ and Apache Geode™. Stream, Transact, Analyze, Predict in…☆1,039Updated last year
- Confluent Schema Registry for Kafka☆2,194Updated this week
- Iceberg is a table format for large, slow-moving tabular data☆476Updated last year
- Mirror of Apache Oozie☆708Updated 2 months ago
- Client library for collecting metrics.☆741Updated this week
- Apache Parquet Java☆2,558Updated 3 weeks ago
- Apache Geode☆2,280Updated last month
- Apache Pinot - A realtime distributed OLAP datastore☆5,401Updated this week
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,647Updated last year
- Streaming MapReduce with Scalding and Storm☆2,139Updated 2 years ago
- Change Data Capture (CDC) service☆433Updated 2 months ago
- Dr. Elephant is a job and flow-level performance monitoring and tuning tool for Apache Hadoop and Apache Spark☆1,353Updated last year
- Dremio - the missing link in modern data☆1,356Updated 2 weeks ago
- Apache Avro is a data serialization system.☆2,891Updated this week
- The Heroic Time Series Database☆848Updated 3 years ago
- Real-time Query for Hadoop; mirror of Apache Impala☆31Updated last year
- The database purpose-built for stream processing applications.☆87Updated this week
- Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large scale machine learning☆1,786Updated 3 years ago
- Livy is an open source REST interface for interacting with Apache Spark from anywhere☆1,008Updated last year