jeoffreylim / maelstromLinks
Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.
☆22Updated 8 years ago
Alternatives and similar repositories for maelstrom
Users that are interested in maelstrom are comparing it to the libraries listed below
Sorting:
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 9 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 3 years ago
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆30Updated 9 years ago
- Flink Examples☆38Updated 9 years ago
- Sql interface to druid.☆77Updated 9 years ago
- Cascading on Apache Flink®☆54Updated last year
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated last year
- Very fast & efficient grep for Kafka stream☆42Updated 10 years ago
- Mirror of Apache DirectMemory☆53Updated last year
- A library for financial and time series calculations on Apache Spark☆28Updated 9 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆63Updated last year
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 9 years ago
- Collection of generic Apache Flink operators