jeoffreylim / maelstrom
Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.
☆22Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for maelstrom
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated 8 months ago
- Collection of generic Apache Flink operators☆17Updated 7 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 8 years ago
- ☆26Updated 4 years ago
- Cascading on Apache Flink®☆54Updated 9 months ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- Common utilities for Apache Kafka☆36Updated last year
- Schema Registry integration for Apache Spark☆39Updated 2 years ago
- A distributed generic query layer for Apache Kafka Interactive Queries☆26Updated 7 years ago
- Flink performance tests☆29Updated 5 years ago
- Camus Compressor merges files created by Camus and saves them in a compressed format.☆12Updated last year
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 8 years ago
- A library for financial and time series calculations on Apache Spark☆28Updated 8 years ago
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- Apache Flink as a Cloudera Manager Service☆12Updated 8 years ago
- A small project to show how to add lineage to Atlas when using Spark as ETL tool☆12Updated 7 years ago
- Flink Examples☆39Updated 8 years ago
- The ScaleOut Time Windowing Library for Java provides a set of windowing functions for time-ordered lists of events.☆21Updated 6 years ago
- Real-time analytics in Apache Flume☆52Updated 8 years ago
- An application to monitor and drive the Spark JobServer☆11Updated 9 years ago
- This is a datasource implementation for quick query in Kafka with Spark☆9Updated last year
- Starter project for building MemSQL Streamliner Pipelines☆32Updated 7 years ago
- Plot live-stats as graph from ApacheSpark application using Lightning-viz☆18Updated 7 years ago
- Example using Grafana with Druid☆11Updated 9 years ago
- Streaming Analytics platform, built with Apache Flink and Kafka☆34Updated last year
- ☆21Updated last year