Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.
☆21Feb 6, 2017Updated 9 years ago
Alternatives and similar repositories for maelstrom
Users that are interested in maelstrom are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A parser for haproxy logs in Python☆14Feb 13, 2011Updated 15 years ago
- Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark☆12Mar 14, 2016Updated 10 years ago
- Library and a Framework for building fast, scalable, fault-tolerant Data APIs based on Akka, Avro, ZooKeeper and Kafka☆25Oct 16, 2020Updated 5 years ago
- Open source task scheduler with dependency management☆15Jul 1, 2018Updated 7 years ago
- A quotation-based Scala DSL for scalable data analysis.☆65Jul 7, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is a simple CEP Engine leveraging the Kafka Streams platform☆16Apr 25, 2017Updated 9 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆28May 15, 2020Updated 6 years ago
- A file system backed by AntidoteDB.☆13Jun 10, 2021Updated 5 years ago
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆58Apr 12, 2017Updated 9 years ago
- Open-source distribute workflow schedule tools, also support streaming task.☆40Nov 11, 2017Updated 8 years ago
- DataStax Enterprise (DSE) Deployment Guide for Google Cloud Platform (GCP)☆10Apr 10, 2020Updated 6 years ago
- Scala API for Apache Spark SQL high-order functions☆14Aug 4, 2023Updated 2 years ago
- A distributed generic query layer for Apache Kafka Interactive Queries☆26Nov 8, 2017Updated 8 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Mar 12, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A core AST and utilities to manipulate geographical data☆21Sep 30, 2022Updated 3 years ago
- Simple URI routing and generation in Javascript☆58Mar 26, 2014Updated 12 years ago
- Plot live-stats as graph from ApacheSpark application using Lightning-viz☆18Jul 3, 2017Updated 8 years ago
- Euphoria is an open source Java API for creating unified big-data processing flows. It provides an engine independent programming model w…☆81Nov 15, 2022Updated 3 years ago
- THIS REPOSITORY IS DEPRECATED☆19Jul 6, 2023Updated 2 years ago
- an elasticsearch plugin that allows to update a specify fileds of a document,avoid full reindex and reduce traffic costs☆39Mar 1, 2014Updated 12 years ago
- ☆51May 21, 2026Updated last month
- Parallel Streaming Transformation Loader☆10Apr 23, 2019Updated 7 years ago
- Presto connector for Apache Kudu☆48Mar 22, 2019Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Dec 15, 2023Updated 2 years ago
- Framework for running macro benchmarks in a clustered environment☆25Aug 29, 2022Updated 3 years ago
- an open source dataworks platform☆20Jun 4, 2021Updated 5 years ago
- Monitoring cassandra cluster by ELK (Elasticsearch , logstash and Kibana)☆20Mar 16, 2017Updated 9 years ago
- Examples for upgrading HBase applications to 1.0 APIs☆13Jul 21, 2015Updated 10 years ago
- Protobuf serialization support for Apache Flink☆21Jun 1, 2021Updated 5 years ago
- Prometheus exporter which fetches JSON from a URL and exports one of the values as gauge metrics☆24Mar 16, 2019Updated 7 years ago
- Develop streaming applications for IBM Streams in Python, Java & Scala.☆29Jul 24, 2022Updated 3 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆65Dec 17, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 9 years ago
- A Go wrapper for the CyberArk Vault API☆13Mar 9, 2017Updated 9 years ago
- 📝 A generic list implementation in Go for easy functional programming☆11May 6, 2026Updated last month
- Be a silentor,just focus on mark your words down!☆12Jul 18, 2015Updated 10 years ago
- Web Based Kafka Consumer and Producer☆70Jan 29, 2020Updated 6 years ago
- Node gadgets for Elastic Search☆55Oct 22, 2010Updated 15 years ago
- HashiCorp Vault Connector for Mule 4☆12May 23, 2024Updated 2 years ago