Maelstrom is an open source Kafka integration with Spark that is designed to be developer friendly, high performance (millisecond stream processing), scalable (consumes messges at Spark worker nodes), and is extremely reliable.
☆22Feb 6, 2017Updated 9 years ago
Alternatives and similar repositories for maelstrom
Users that are interested in maelstrom are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A parser for haproxy logs in Python☆14Feb 13, 2011Updated 15 years ago
- Distributed SQL base Realtime Streaming Computation Framework On Apache Storm, Spark☆12Mar 14, 2016Updated 10 years ago
- riemann tool for cassandra☆31May 19, 2016Updated 9 years ago
- Open source task scheduler with dependency management☆15Jul 1, 2018Updated 7 years ago
- A quotation-based Scala DSL for scalable data analysis.☆64Jul 7, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Spark CEP is an extension of Spark Streaming to support SQL-based query processing☆57Apr 12, 2017Updated 8 years ago
- Open-source distribute workflow schedule tools, also support streaming task.☆39Nov 11, 2017Updated 8 years ago
- DataStax Enterprise (DSE) Deployment Guide for Google Cloud Platform (GCP)☆10Apr 10, 2020Updated 5 years ago
- Scala API for Apache Spark SQL high-order functions☆14Aug 4, 2023Updated 2 years ago
- golang microservice prototype☆10Oct 23, 2016Updated 9 years ago
- Human-friendly Cron replacement in NodeJS☆96May 21, 2013Updated 12 years ago
- Ruby tools for Elastic Search☆29Mar 7, 2025Updated last year
- A distributed generic query layer for Apache Kafka Interactive Queries☆26Nov 8, 2017Updated 8 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Mar 12, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A core AST and utilities to manipulate geographical data☆21Sep 30, 2022Updated 3 years ago
- Plot live-stats as graph from ApacheSpark application using Lightning-viz☆18Jul 3, 2017Updated 8 years ago
- Euphoria is an open source Java API for creating unified big-data processing flows. It provides an engine independent programming model w…☆81Nov 15, 2022Updated 3 years ago
- Parquet file generator☆22Apr 17, 2018Updated 7 years ago
- THIS REPOSITORY IS DEPRECATED☆19Jul 6, 2023Updated 2 years ago
- A set of tools to ease working with Zookeeper and Kafka.☆23Jan 22, 2016Updated 10 years ago
- A proper shell library client for the Librato API☆18Sep 21, 2015Updated 10 years ago
- an elasticsearch plugin that allows to update a specify fileds of a document,avoid full reindex and reduce traffic costs☆39Mar 1, 2014Updated 12 years ago
- ☆50Feb 11, 2020Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A minimal seed template for an Akka gRPC with Scala build☆19Jan 22, 2026Updated 2 months ago
- Parallel Streaming Transformation Loader☆10Apr 23, 2019Updated 6 years ago
- Standalone executable to read the tags of an EC2 instance☆11Jan 7, 2017Updated 9 years ago
- Presto connector for Apache Kudu☆48Mar 22, 2019Updated 7 years ago
- Framework for running macro benchmarks in a clustered environment☆25Aug 29, 2022Updated 3 years ago
- ☆12Dec 15, 2023Updated 2 years ago
- an open source dataworks platform☆21Jun 4, 2021Updated 4 years ago
- Monitoring cassandra cluster by ELK (Elasticsearch , logstash and Kibana)☆20Mar 16, 2017Updated 9 years ago
- Examples for upgrading HBase applications to 1.0 APIs☆13Jul 21, 2015Updated 10 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Protobuf serialization support for Apache Flink☆21Jun 1, 2021Updated 4 years ago
- Concurrent streaming upload to Amazon S3☆15Feb 4, 2015Updated 11 years ago
- Develop streaming applications for IBM Streams in Python, Java & Scala.☆28Jul 24, 2022Updated 3 years ago
- UberScriptQuery, a SQL-like DSL to make writing Spark jobs super easy☆64Dec 17, 2023Updated 2 years ago
- Example code for building your own MemSQL Streamliner Pipelines☆23Apr 18, 2017Updated 8 years ago
- An open source enterprise data warehousing and analysis platform.☆22Nov 8, 2021Updated 4 years ago
- Sample Silex Based Rest API that uses Json Web Tokens for Authentication☆12Nov 3, 2016Updated 9 years ago