Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.
☆22Feb 6, 2017Updated 9 years ago
Alternatives and similar repositories for maelstrom
Users that are interested in maelstrom are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A parser for haproxy logs in Python☆14Feb 13, 2011Updated 15 years ago
- Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark☆12Mar 14, 2016Updated 10 years ago
- Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafka☆24Oct 16, 2020Updated 5 years ago
- riemann tool for cassandra☆31May 19, 2016Updated 9 years ago
- Open source task scheduler with dependency management☆15Jul 1, 2018Updated 7 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A quotation-based Scala DSL for scalable data analysis.☆63Jul 7, 2022Updated 3 years ago
- This is a simple CEP Engine leveraging the Kafka Streams platform☆16Apr 25, 2017Updated 8 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29May 15, 2020Updated 5 years ago
- A file system backed by AntidoteDB.☆13Jun 10, 2021Updated 4 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆57Apr 12, 2017Updated 8 years ago
- Open-source distribute workflow schedule tools, also support streaming task.☆39Nov 11, 2017Updated 8 years ago
- DataStax Enterprise (DSE) Deployment Guide for Google Cloud Platform (GCP)☆10Apr 10, 2020Updated 5 years ago
- Temporal_Graph_library☆25Feb 2, 2019Updated 7 years ago
- Scala API for Apache Spark SQL high-order functions☆14Aug 4, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A distributed generic query layer for Apache Kafka Interactive Queries☆26Nov 8, 2017Updated 8 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Mar 12, 2021Updated 5 years ago
- A core AST and utilities to manipulate geographical data☆22Sep 30, 2022Updated 3 years ago
- Plot live-stats as graph from ApacheSpark application using Lightning-viz☆18Jul 3, 2017Updated 8 years ago
- Euphoria is an open source Java API for creating unified big-data processing flows. It provides an engine independent programming model w…☆81Nov 15, 2022Updated 3 years ago
- Parquet file generator☆22Apr 17, 2018Updated 7 years ago
- THIS REPOSITORY IS DEPRECATED☆19Jul 6, 2023Updated 2 years ago
- A set of tools to ease working with Zookeeper and Kafka.☆23Jan 22, 2016Updated 10 years ago
- ☆50Feb 11, 2020Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A minimal seed template for an Akka gRPC with Scala build☆19Jan 22, 2026Updated 2 months ago
- Parallel Streaming Transformation Loader☆10Apr 23, 2019Updated 6 years ago
- Presto connector for Apache Kudu☆48Mar 22, 2019Updated 7 years ago
- Framework for running macro benchmarks in a clustered environment☆25Aug 29, 2022Updated 3 years ago
- an open source dataworks platform☆21Jun 4, 2021Updated 4 years ago
- Monitoring cassandra cluster by ELK (Elasticsearch , logstash and Kibana)☆20Mar 16, 2017Updated 9 years ago
- Protobuf serialization support for Apache Flink☆21Jun 1, 2021Updated 4 years ago
- Prometheus exporter which fetches JSON from a URL and exports one of the values as gauge metrics☆24Mar 16, 2019Updated 7 years ago
- Develop streaming applications for IBM Streams in Python, Java & Scala.☆28Jul 24, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆64Dec 17, 2023Updated 2 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 8 years ago
- An open source enterprise data warehousing and analysis platform.☆22Nov 8, 2021Updated 4 years ago
- akkaflow是一个基于akka架构上构建的分布式高可用DAG工作流调度工具,可以把子节点分配在集群机器上并行执行,高效利用集群资源。☆107Sep 14, 2019Updated 6 years ago
- A Go wrapper for the CyberArk Vault API☆13Mar 9, 2017Updated 9 years ago
- 📝 A generic list implementation in Go for easy functional programming☆11Aug 8, 2024Updated last year
- Be a silentor,just focus on mark your words down!☆12Jul 18, 2015Updated 10 years ago