Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.
☆22Feb 6, 2017Updated 9 years ago
Alternatives and similar repositories for maelstrom
Users that are interested in maelstrom are comparing it to the libraries listed below
Sorting:
- Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafka☆25Oct 16, 2020Updated 5 years ago
- A file system backed by AntidoteDB.☆13Jun 10, 2021Updated 4 years ago
- Open source task scheduler with dependency management☆15Jul 1, 2018Updated 7 years ago
- Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark☆12Mar 14, 2016Updated 9 years ago
- A quotation-based Scala DSL for scalable data analysis.☆64Jul 7, 2022Updated 3 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆57Apr 12, 2017Updated 8 years ago
- DataStax Enterprise (DSE) Deployment Guide for Google Cloud Platform (GCP)☆10Apr 10, 2020Updated 5 years ago
- This is a simple CEP Engine leveraging the Kafka Streams platform☆16Apr 25, 2017Updated 8 years ago
- Plot live-stats as graph from ApacheSpark application using Lightning-viz☆18Jul 3, 2017Updated 8 years ago
- Euphoria is an open source Java API for creating unified big-data processing flows. It provides an engine independent programming model w…☆81Nov 15, 2022Updated 3 years ago
- Monitoring cassandra cluster by ELK (Elasticsearch , logstash and Kibana)☆20Mar 16, 2017Updated 8 years ago
- ☆50Feb 11, 2020Updated 6 years ago
- Protobuf serialization support for Apache Flink☆21Jun 1, 2021Updated 4 years ago
- THIS REPOSITORY IS DEPRECATED☆19Jul 6, 2023Updated 2 years ago
- Provides an abstract data access layer on top of metric stores. Supports both SQL and structured JSON queries.☆22Nov 6, 2015Updated 10 years ago
- an open source dataworks platform☆21Jun 4, 2021Updated 4 years ago
- A core AST and utilities to manipulate geographical data☆22Sep 30, 2022Updated 3 years ago
- A set of tools to ease working with Zookeeper and Kafka.☆23Jan 22, 2016Updated 10 years ago
- Temporal_Graph_library☆25Feb 2, 2019Updated 7 years ago
- A distributed generic query layer for Apache Kafka Interactive Queries☆26Nov 8, 2017Updated 8 years ago
- A minimal seed template for an Akka gRPC with Scala build☆19Jan 22, 2026Updated last month
- Presto connector for Apache Kudu☆48Mar 22, 2019Updated 6 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 8 years ago
- akkaflow是一个基于akka架构上构建的分布式高可用DAG工作流调度工具,可以把子节点分配在集群机器上并行执行,高效利用集群资源。☆107Sep 14, 2019Updated 6 years ago
- DataFibers Data Service☆31Feb 11, 2022Updated 4 years ago
- Develop streaming applications for IBM Streams in Python, Java & Scala.☆28Jul 24, 2022Updated 3 years ago
- Serverless proxy for Spark cluster☆325Oct 29, 2020Updated 5 years ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Jun 7, 2021Updated 4 years ago
- Flink performance tests☆28Nov 13, 2019Updated 6 years ago
- Web Based Kafka Consumer and Producer☆69Jan 29, 2020Updated 6 years ago
- Write your Spark data to Kafka seamlessly☆174Jul 10, 2024Updated last year
- A dynamic data completeness and accuracy library at enterprise scale for Apache Spark☆29Nov 4, 2024Updated last year
- 提供清晰、实用的Akka应用指导☆31Jan 17, 2022Updated 4 years ago
- Waimak is an open-source framework that makes it easier to create complex data flows in Apache Spark.☆76Apr 24, 2024Updated last year
- ☆29Dec 31, 2018Updated 7 years ago
- Hive-JDBC-Proxy是一个高性能的HiveServer2和Spark ThriftServer的代理服务,具备负载均衡、基于规则转发Hive JDBC Client的请求给到HiveServer2和Spark ThriftServer的能力。☆33Apr 12, 2022Updated 3 years ago
- li-apache-kafka-clients is a wrapper library for the Apache Kafka vanilla clients. It provides additional features such as large message …☆136Jul 7, 2023Updated 2 years ago
- Sparkline BI Accelerator provides fast ad-hoc query capability over Logical Cubes. This has been folded into our SNAP Platform(http://bit…☆280Aug 3, 2018Updated 7 years ago
- TimeSeries Java client for Facebook Beringei. It also includes query service with tags support for metrics.☆10May 13, 2017Updated 8 years ago