An extensible distributed system for reliable nearline data streaming at scale
☆958Mar 17, 2026Updated this week
Alternatives and similar repositories for brooklin
Users that are interested in brooklin are comparing it to the libraries listed below
Sorting:
- Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides…☆3,004Nov 6, 2025Updated 4 months ago
- Cruise Control Frontend (CCFE): Single Page Web Application to Manage Large Scale of Kafka Clusters☆371Aug 20, 2024Updated last year
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,260Updated this week
- Improvement of Apache Kafka Mirrormaker☆936Dec 16, 2023Updated 2 years ago
- Kafka Consumer Lag Checking☆3,944Jan 14, 2026Updated 2 months ago
- Apache Pinot - A realtime distributed OLAP datastore☆6,048Updated this week
- Pravega - Streaming as a new software defined storage primitive☆2,005Mar 2, 2025Updated last year
- Distributed object store☆1,782Updated this week
- Mirror of Apache Helix☆493Updated this week
- Mirus is a cross data-center data replication tool for Apache Kafka☆208Mar 1, 2026Updated 2 weeks ago
- Xinfra Monitor monitors the availability of Kafka clusters by producing synthetic workloads using end-to-end pipelines to obtain derived …☆2,056Mar 9, 2025Updated last year
- SQL-based streaming analytics platform at scale☆1,226Jun 21, 2020Updated 5 years ago
- Waltz is a quorum-based distributed write-ahead log for replicating transactions☆424Mar 29, 2023Updated 2 years ago
- Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.☆12,514Updated this week
- Flink Controller implements a Kubernetes Custom Controller (aka Kubernetes Operator) for Apache Flink☆52Jan 26, 2026Updated last month
- Apache BookKeeper - a scalable, fault tolerant and low latency storage service optimized for append-only workloads☆1,988Updated this week
- A collection of open source Apache 2.0 Kafka Connector maintained by Lenses.io.☆1,059Feb 10, 2026Updated last month
- Apache Pulsar - distributed pub-sub messaging system☆15,168Updated this week
- ☆1,685Feb 27, 2026Updated 3 weeks ago
- An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Tr…☆8,635Mar 13, 2026Updated last week
- Upserts, Deletes And Incremental Processing on Big Data.☆6,118Updated this week
- Apache Iceberg☆8,636Updated this week
- Source-agnostic distributed change data capture system☆3,677Sep 28, 2023Updated 2 years ago
- Multi-hop declarative data pipelines☆125Updated this week
- A JVM-embeddable Distributed Database☆326Sep 1, 2025Updated 6 months ago
- A Relational Database Backed by Apache Kafka☆390Oct 15, 2025Updated 5 months ago
- Stream Processing and Complex Event Processing Engine☆1,579Mar 12, 2026Updated last week
- Distributed Big Data Orchestration Service☆1,763Jan 31, 2026Updated last month
- Change Data Capture (CDC) service☆451Jun 24, 2024Updated last year
- CMAK is a tool for managing Apache Kafka clusters☆11,942Aug 2, 2023Updated 2 years ago
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,688Mar 1, 2023Updated 3 years ago
- Secor is a service implementing Kafka log persistence☆1,857Mar 10, 2026Updated last week
- Generic Data Ingestion & Dispersal Library for Hadoop☆482Mar 19, 2023Updated 3 years ago
- The Metadata Platform for your Data and AI Stack☆11,685Updated this week
- The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL☆6,251Updated this week
- A simplified, lightweight ETL Framework based on Apache Spark☆587Jan 24, 2024Updated 2 years ago
- li-apache-kafka-clients is a wrapper library for the Apache Kafka vanilla clients. It provides additional features such as large message …☆135Jul 7, 2023Updated 2 years ago
- Apache Druid: a high performance real-time analytics database.☆13,962Updated this week
- Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)☆12,645Updated this week