jeoffreylim / maelstromLinks
Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.
☆22Updated 8 years ago
Alternatives and similar repositories for maelstrom
Users that are interested in maelstrom are comparing it to the libraries listed below
Sorting:
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆30Updated 9 years ago
- Flink performance tests☆28Updated 5 years ago
- Flink Examples☆39Updated 9 years ago
- Common utilities for Apache Kafka☆36Updated 2 years ago
- ☆26Updated 5 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆63Updated last year
- Cascading on Apache Flink®☆54Updated last year
- Collection of generic Apache Flink operators☆17Updated 8 years ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 9 years ago
- Sql interface to druid.☆77Updated 9 years ago
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated last year
- HDFS compatible Distributed Filesystem backed Cassandra☆25Updated 9 years ago
- Quark is a data virtualization engine over analytic databases.☆100Updated 8 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 2 years ago
- Mirror of Apache DirectMemory☆52Updated last year
- A set of tools to ease working with Zookeeper and Kafka.☆23Updated 9 years ago
- An example of using Flink for Fault-Tolerant Stream Processing☆12Updated 6 years ago
- spark structured streaming via HTTP communication☆18Updated 3 years ago
- Camus Compressor merges files created by Camus and saves them in a compressed format.☆13Updated 2 years ago
- Java event logs collector for hadoop and frameworks☆41Updated 4 months ago
- Schema Registry integration for Apache Spark☆40Updated 2 years ago
- The ScaleOut Time Windowing Library for Java provides a set of windowing functions for time-ordered lists of events.☆21Updated 7 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 9 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- A library for financial and time series calculations on Apache Spark☆28Updated 9 years ago
- High performance HBase / Spark SQL engine☆28Updated 3 years ago
- Recipes and examples for Apache Spark☆13Updated 10 years ago
- Web Based Kafka Consumer and Producer☆69Updated 5 years ago
- Druid indexing plugin for using Spark in batch jobs☆101Updated 3 years ago
- An application that records stats about consumer group offset commits and reports them as prometheus metrics☆14Updated 6 years ago