jeoffreylim / maelstrom
Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.
☆22Updated 8 years ago
Alternatives and similar repositories for maelstrom:
Users that are interested in maelstrom are comparing it to the libraries listed below
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated last year
- Flink performance tests☆28Updated 5 years ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- Camus Compressor merges files created by Camus and saves them in a compressed format.☆12Updated 2 years ago
- Collection of generic Apache Flink operators☆17Updated 7 years ago
- ☆26Updated 5 years ago
- INACTIVE: A daemon to transfer syslog messages to Apache Kafka.☆24Updated 8 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 8 years ago
- Apache Flink as a Cloudera Manager Service☆12Updated 8 years ago
- An example of using Flink for Fault-Tolerant Stream Processing☆12Updated 6 years ago
- Post: Kafka - Rewind Consumer Offset☆23Updated 2 years ago
- Cascading on Apache Flink®☆54Updated last year
- Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafka☆25Updated 4 years ago
- Common utilities for Apache Kafka☆36Updated last year
- Toolkit that can bundle any Spring Boot application into an Apache Ambari Service, enabling Ambari to provision, manage and monitor the s…☆13Updated 9 years ago
- Example how to integrate Esper with Akka in the form of an Akka event bus☆29Updated 10 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Updated 9 years ago
- A distributed generic query layer for Apache Kafka Interactive Queries☆26Updated 7 years ago
- The ScaleOut Time Windowing Library for Java provides a set of windowing functions for time-ordered lists of events.☆21Updated 6 years ago
- Mirror of Apache DirectMemory☆53Updated last year
- Starter examples to writes distributed fault-tolerant YARN applications☆9Updated 9 years ago
- Read druid segments from hadoop☆10Updated 8 years ago
- JVM agent based metrics with Prometheus and Dropwizard support (Java, Scala, Clojure, Kotlin, etc)☆25Updated 7 years ago
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Quick Akka Micro Dag Prototype☆13Updated 8 years ago
- A collection of akka based nice frameworks, libraries and software.☆28Updated 7 years ago
- Flink Examples☆39Updated 8 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- Collection of HDP Tuning Tricks & Tips (unofficial guide)☆17Updated 7 years ago