Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.
☆22Feb 6, 2017Updated 9 years ago
Alternatives and similar repositories for maelstrom
Users that are interested in maelstrom are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A parser for haproxy logs in Python☆14Feb 13, 2011Updated 15 years ago
- Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark☆12Mar 14, 2016Updated 10 years ago
- Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafka☆25Oct 16, 2020Updated 5 years ago
- riemann tool for cassandra☆31May 19, 2016Updated 10 years ago
- Open source task scheduler with dependency management☆15Jul 1, 2018Updated 7 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This is a simple CEP Engine leveraging the Kafka Streams platform☆16Apr 25, 2017Updated 9 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆28May 15, 2020Updated 6 years ago
- A file system backed by AntidoteDB.☆13Jun 10, 2021Updated 4 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆58Apr 12, 2017Updated 9 years ago
- Open-source distribute workflow schedule tools, also support streaming task.☆40Nov 11, 2017Updated 8 years ago
- DataStax Enterprise (DSE) Deployment Guide for Google Cloud Platform (GCP)☆10Apr 10, 2020Updated 6 years ago
- Temporal_Graph_library☆25Feb 2, 2019Updated 7 years ago
- Scala API for Apache Spark SQL high-order functions☆14Aug 4, 2023Updated 2 years ago
- Ruby tools for Elastic Search☆29Mar 7, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A distributed generic query layer for Apache Kafka Interactive Queries☆26Nov 8, 2017Updated 8 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Mar 12, 2021Updated 5 years ago
- Simple URI routing and generation in Javascript☆58Mar 26, 2014Updated 12 years ago
- Plot live-stats as graph from ApacheSpark application using Lightning-viz☆18Jul 3, 2017Updated 8 years ago
- Parquet file generator☆22Apr 17, 2018Updated 8 years ago
- THIS REPOSITORY IS DEPRECATED☆19Jul 6, 2023Updated 2 years ago
- A set of tools to ease working with Zookeeper and Kafka.☆23Jan 22, 2016Updated 10 years ago
- an elasticsearch plugin that allows to update a specify fileds of a document,avoid full reindex and reduce traffic costs☆39Mar 1, 2014Updated 12 years ago
- ☆50Feb 11, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A minimal seed template for an Akka gRPC with Scala build☆19Jan 22, 2026Updated 3 months ago
- Parallel Streaming Transformation Loader☆10Apr 23, 2019Updated 7 years ago
- Presto connector for Apache Kudu☆48Mar 22, 2019Updated 7 years ago
- ☆12Dec 15, 2023Updated 2 years ago
- Framework for running macro benchmarks in a clustered environment☆25Aug 29, 2022Updated 3 years ago
- an open source dataworks platform☆21Jun 4, 2021Updated 4 years ago
- Monitoring cassandra cluster by ELK (Elasticsearch , logstash and Kibana)☆20Mar 16, 2017Updated 9 years ago
- Examples for upgrading HBase applications to 1.0 APIs☆13Jul 21, 2015Updated 10 years ago
- Protobuf serialization support for Apache Flink☆21Jun 1, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Concurrent streaming upload to Amazon S3☆15Feb 4, 2015Updated 11 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy