jeoffreylim / maelstrom
Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.
☆22Updated 8 years ago
Alternatives and similar repositories for maelstrom:
Users that are interested in maelstrom are comparing it to the libraries listed below
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated last year
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- ☆26Updated 5 years ago
- Flink performance tests☆28Updated 5 years ago
- Collection of generic Apache Flink operators☆17Updated 7 years ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 8 years ago
- A distributed generic query layer for Apache Kafka Interactive Queries☆26Updated 7 years ago
- Common utilities for Apache Kafka☆36Updated last year
- A set of tools to ease working with Zookeeper and Kafka.☆23Updated 9 years ago
- Post: Kafka - Rewind Consumer Offset☆23Updated 2 years ago
- Kafka Connect Integration with Kafka Streams + KSQL☆11Updated 6 years ago
- Dropwizard Metrics reporter for Apache Spark☆28Updated 10 years ago
- Integrate Grafana with Ambari Metrics System☆27Updated 3 months ago
- Toolkit that can bundle any Spring Boot application into an Apache Ambari Service, enabling Ambari to provision, manage and monitor the s…☆13Updated 9 years ago
- Flink Examples☆39Updated 8 years ago
- Camus Compressor merges files created by Camus and saves them in a compressed format.☆12Updated last year
- An example of using Flink for Fault-Tolerant Stream Processing☆12Updated 6 years ago
- Read druid segments from hadoop☆10Updated 8 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Updated 9 years ago
- Mirror of Apache DirectMemory☆53Updated last year
- Discover Flink clusters on Hadoop YARN for Prometheus☆23Updated 4 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- a smart, automated non-intrusive driver for hbase region-level major-compact☆8Updated 8 years ago
- Apache Flink as a Cloudera Manager Service☆12Updated 8 years ago
- OpenTSDB Metrics Reporter☆54Updated last week
- The ScaleOut Time Windowing Library for Java provides a set of windowing functions for time-ordered lists of events.☆21Updated 6 years ago
- A library for financial and time series calculations on Apache Spark☆28Updated 9 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 6 years ago
- Spark Connector for Hazelcast☆22Updated 3 years ago