jeoffreylim / maelstromLinks
Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.
☆22Updated 8 years ago
Alternatives and similar repositories for maelstrom
Users that are interested in maelstrom are comparing it to the libraries listed below
Sorting:
- Collection of generic Apache Flink operators☆17Updated 8 years ago
- Flink Examples☆38Updated 9 years ago
- Flink performance tests☆28Updated 6 years ago
- A High Performance Cluster Consumer for Kafka that creates Avro (boom) files in Hadoop in time based directory paths☆42Updated 9 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆63Updated 2 years ago
- Cascading on Apache Flink®☆54Updated last year
- Spooker is a dynamic framework for processing high volume data streams via processing pipelines☆30Updated 9 years ago
- Camus Compressor merges files created by Camus and saves them in a compressed format.☆13Updated 2 years ago
- Yet Another Spark SQL JDBC/ODBC server based on the PostgreSQL V3 protocol☆34Updated 3 years ago
- SamzaSQL: Streaming SQL implementation on top of Apache Samza and Apache Kafka☆29Updated 9 years ago
- Sql interface to druid.☆77Updated 10 years ago
- A library for financial and time series calculations on Apache Spark☆28Updated 9 years ago
- Schema Registry integration for Apache Spark☆40Updated 3 years ago
- Example using Grafana with Druid☆11Updated 10 years ago
- ☆26Updated 6 years ago
- Common utilities for Apache Kafka☆36Updated 2 years ago
- HDFS compatible Distributed Filesystem backed Cassandra☆25Updated 10 years ago
- Spark Connector for Hazelcast☆22Updated 4 years ago
- INACTIVE: A daemon to transfer syslog messages to Apache Kafka.☆24Updated 8 years ago
- A library for strong, schema based conversion between 'natural' JSON documents and Avro☆18Updated last year
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29Updated 5 years ago
- A set of tools to ease working with Zookeeper and Kafka.☆23Updated 9 years ago
- A light Kafka to HDFS/S3 ETL library based on Apache Spark☆40Updated 8 years ago
- A spark sbt blueprint to build your own spark apps off of (for cloud native runtime, see the kube/spark examples)☆57Updated 6 years ago
- Java event logs collector for hadoop and frameworks☆41Updated 9 months ago
- spark structured streaming via HTTP communication☆18Updated 3 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆57Updated 8 years ago
- High performance HBase / Spark SQL engine☆28Updated 3 years ago
- Extensions, custom & experimental panels☆53Updated 10 years ago
- Mirror of Apache DirectMemory☆53Updated 2 years ago