jeoffreylim / maelstromView external linksLinks
Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.
☆22Feb 6, 2017Updated 9 years ago
Alternatives and similar repositories for maelstrom
Users that are interested in maelstrom are comparing it to the libraries listed below
Sorting:
- Open source task scheduler with dependency management☆15Jul 1, 2018Updated 7 years ago
- A file system backed by AntidoteDB.☆13Jun 10, 2021Updated 4 years ago
- riemann tool for cassandra☆32May 19, 2016Updated 9 years ago
- Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark☆12Mar 14, 2016Updated 9 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29May 15, 2020Updated 5 years ago
- A quotation-based Scala DSL for scalable data analysis.☆64Jul 7, 2022Updated 3 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆57Apr 12, 2017Updated 8 years ago
- This is a simple CEP Engine leveraging the Kafka Streams platform☆16Apr 25, 2017Updated 8 years ago
- Plot live-stats as graph from ApacheSpark application using Lightning-viz☆18Jul 3, 2017Updated 8 years ago
- Parallel Streaming Transformation Loader☆10Apr 23, 2019Updated 6 years ago
- Euphoria is an open source Java API for creating unified big-data processing flows. It provides an engine independent programming model w…☆82Nov 15, 2022Updated 3 years ago
- Open-source distribute workflow schedule tools, also support streaming task.☆39Nov 11, 2017Updated 8 years ago
- Monitoring cassandra cluster by ELK (Elasticsearch , logstash and Kibana)☆20Mar 16, 2017Updated 8 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Mar 12, 2021Updated 4 years ago
- ☆50Feb 11, 2020Updated 6 years ago
- THIS REPOSITORY IS DEPRECATED☆19Jul 6, 2023Updated 2 years ago
- an open source dataworks platform☆21Jun 4, 2021Updated 4 years ago
- Provides an abstract data access layer on top of metric stores. Supports both SQL and structured JSON queries.☆22Nov 6, 2015Updated 10 years ago
- A core AST and utilities to manipulate geographical data☆22Sep 30, 2022Updated 3 years ago
- Protobuf serialization support for Apache Flink☆21Jun 1, 2021Updated 4 years ago
- Parquet file generator☆22Apr 17, 2018Updated 7 years ago
- A set of tools to ease working with Zookeeper and Kafka.☆23Jan 22, 2016Updated 10 years ago
- A distributed generic query layer for Apache Kafka Interactive Queries☆26Nov 8, 2017Updated 8 years ago
- Temporal_Graph_library☆25Feb 2, 2019Updated 7 years ago
- Presto connector for Apache Kudu☆48Mar 22, 2019Updated 6 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆63Dec 17, 2023Updated 2 years ago
- Framework for running macro benchmarks in a clustered environment☆25Aug 29, 2022Updated 3 years ago
- DataFibers Data Service☆31Feb 11, 2022Updated 4 years ago
- Serverless proxy for Spark cluster☆324Oct 29, 2020Updated 5 years ago
- Flink performance tests☆28Nov 13, 2019Updated 6 years ago
- Web Based Kafka Consumer and Producer☆69Jan 29, 2020Updated 6 years ago
- Write your Spark data to Kafka seamlessly☆174Jul 10, 2024Updated last year
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆29Nov 4, 2024Updated last year
- 提供清晰、实用的Akka应用指导☆31Jan 17, 2022Updated 4 years ago
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.☆76Apr 24, 2024Updated last year
- ☆29Dec 31, 2018Updated 7 years ago
- li-apache-kafka-clients is a wrapper library for the Apache Kafka vanilla clients. It provides additional features such as large message …☆136Jul 7, 2023Updated 2 years ago
- TimeSeries Java client for Facebook Beringei. It also includes query service with tags support for metrics.☆10May 13, 2017Updated 8 years ago
- Streaming Analytics platform, built with Apache Flink and Kafka☆36Oct 6, 2023Updated 2 years ago