Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.
☆22Feb 6, 2017Updated 9 years ago
Alternatives and similar repositories for maelstrom
Users that are interested in maelstrom are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A parser for haproxy logs in Python☆14Feb 13, 2011Updated 15 years ago
- Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark☆12Mar 14, 2016Updated 10 years ago
- Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafka☆25Oct 16, 2020Updated 5 years ago
- riemann tool for cassandra☆31May 19, 2016Updated 10 years ago
- Open source task scheduler with dependency management☆15Jul 1, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A quotation-based Scala DSL for scalable data analysis.☆65Jul 7, 2022Updated 3 years ago
- This is a simple CEP Engine leveraging the Kafka Streams platform☆16Apr 25, 2017Updated 9 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆28May 15, 2020Updated 6 years ago
- A file system backed by AntidoteDB.☆13Jun 10, 2021Updated 4 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆58Apr 12, 2017Updated 9 years ago
- DataMine is Turn's data warehouse to address the challenges of information management and analytics.☆10Mar 23, 2018Updated 8 years ago
- Open-source distribute workflow schedule tools, also support streaming task.☆40Nov 11, 2017Updated 8 years ago
- DataStax Enterprise (DSE) Deployment Guide for Google Cloud Platform (GCP)☆10Apr 10, 2020Updated 6 years ago
- Temporal_Graph_library☆25Feb 2, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Scala API for Apache Spark SQL high-order functions☆14Aug 4, 2023Updated 2 years ago
- Ruby tools for Elastic Search☆29Mar 7, 2025Updated last year
- A distributed generic query layer for Apache Kafka Interactive Queries☆26Nov 8, 2017Updated 8 years ago
- A core AST and utilities to manipulate geographical data☆21Sep 30, 2022Updated 3 years ago
- Simple URI routing and generation in Javascript☆58Mar 26, 2014Updated 12 years ago
- Parquet file generator☆22Apr 17, 2018Updated 8 years ago
- THIS REPOSITORY IS DEPRECATED☆19Jul 6, 2023Updated 2 years ago
- A set of tools to ease working with Zookeeper and Kafka.☆23Jan 22, 2016Updated 10 years ago
- an elasticsearch plugin that allows to update a specify fileds of a document,avoid full reindex and reduce traffic costs☆39Mar 1, 2014Updated 12 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆51May 21, 2026Updated 2 weeks ago
- A minimal seed template for an Akka gRPC with Scala build☆19May 18, 2026Updated 3 weeks ago
- Parallel Streaming Transformation Loader☆10Apr 23, 2019Updated 7 years ago
- Presto connector for Apache Kudu☆48Mar 22, 2019Updated 7 years ago
- Framework for running macro benchmarks in a clustered environment☆25Aug 29, 2022Updated 3 years ago
- an open source dataworks platform☆20Jun 4, 2021Updated 5 years ago
- Monitoring cassandra cluster by ELK (Elasticsearch , logstash and Kibana)☆20Mar 16, 2017Updated 9 years ago
- Protobuf serialization support for Apache Flink☆21Jun 1, 2021Updated 5 years ago
- Docker Alpine image with Elasticsearch Curator cron job☆13Nov 19, 2017Updated 8 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Flink performance tests☆29Nov 13, 2019Updated 6 years ago
- Prometheus exporter which fetches JSON from a URL and exports one of the values as gauge metrics☆24Mar 16, 2019Updated 7 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆65Dec 17, 2023Updated 2 years ago
- An open source enterprise data warehousing and analysis platform.☆22Nov 8, 2021Updated 4 years ago
- akkaflow是一个基于akka架构上构建的分布式高可用DAG工作流调度工具,可以把子节点分配在集群机器上并行执行,高效利用集群资源。☆106Sep 14, 2019Updated 6 years ago
- Be a silentor,just focus on mark your words down!☆12Jul 18, 2015Updated 10 years ago
- Web Based Kafka Consumer and Producer☆70Jan 29, 2020Updated 6 years ago