jeoffreylim / maelstrom
Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.
☆22Updated 8 years ago
Alternatives and similar repositories for maelstrom:
Users that are interested in maelstrom are comparing it to the libraries listed below
- Collection of generic Apache Flink operators☆17Updated 7 years ago
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated 11 months ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆29Updated 9 years ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 8 years ago
- Flink Examples☆39Updated 8 years ago
- Flink performance tests☆28Updated 5 years ago
- A distributed generic query layer for Apache Kafka Interactive Queries☆26Updated 7 years ago
- Camus Compressor merges files created by Camus and saves them in a compressed format.☆12Updated last year
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 8 years ago
- ☆26Updated 5 years ago
- Common utilities for Apache Kafka☆36Updated last year
- High performance HBase / Spark SQL engine☆28Updated 2 years ago
- spark structured streaming via HTTP communication☆18Updated 2 years ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- a smart, automated non-intrusive driver for hbase region-level major-compact☆8Updated 8 years ago
- A library for financial and time series calculations on Apache Spark☆28Updated 9 years ago
- An example of using Flink for Fault-Tolerant Stream Processing☆12Updated 6 years ago
- Mirror of Apache DirectMemory☆53Updated last year
- Reads a HBase table and writes the out as Text, Seq, Avro, or Parquet☆28Updated 10 years ago
- A playground to get familiar with Apache Calcite☆8Updated 4 years ago
- Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafka☆25Updated 4 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Updated 9 years ago
- ☆30Updated 8 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Very fast & efficient grep for Kafka stream☆41Updated 9 years ago
- Post: Kafka - Rewind Consumer Offset☆23Updated 2 years ago
- Easy metrics collection for Storm topologies using Coda Hale Metrics☆100Updated 11 years ago
- A sink to save Spark Structured Streaming DataFrame into Hive table☆23Updated 6 years ago
- Example how to integrate Esper with Akka in the form of an Akka event bus☆29Updated 10 years ago
- Graphite reporter for Kafka Offset Monitor.☆44Updated 9 years ago