Change Data Capture (CDC) service
☆451Jun 24, 2024Updated last year
Alternatives and similar repositories for SpinalTap
Users that are interested in SpinalTap are comparing it to the libraries listed below
Sorting:
- Maxwell's daemon, a mysql-to-json kafka producer☆4,231Feb 16, 2026Updated 2 weeks ago
- Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.☆12,472Updated this week
- A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (in…☆257Dec 8, 2025Updated 2 months ago
- MySQLStreamer is a database change data capture and publish system.☆411Aug 17, 2022Updated 3 years ago
- ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.☆281Feb 27, 2019Updated 7 years ago
- A scalable Netflix DBLog implementation for PostgreSQL☆280Dec 23, 2025Updated 2 months ago
- A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…☆2,261Updated this week
- Source-agnostic distributed change data capture system☆3,679Sep 28, 2023Updated 2 years ago
- StreamAlert is a serverless, realtime data analysis framework which empowers you to ingest, analyze, and alert on data from any environme…☆2,888Oct 23, 2023Updated 2 years ago
- An extensible distributed system for reliable nearline data streaming at scale☆956Feb 26, 2026Updated last week
- Collect, aggregate, and visualize a data ecosystem's metadata☆2,132Updated this week
- Apache Pinot - A realtime distributed OLAP datastore☆6,037Updated this week
- SQL-based streaming analytics platform at scale☆1,226Jun 21, 2020Updated 5 years ago
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆889Feb 9, 2026Updated 3 weeks ago
- A load balancer / proxy / gateway for prestodb☆358Jul 25, 2024Updated last year
- Egeria core☆898Updated this week
- Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…☆4,740Feb 19, 2026Updated 2 weeks ago
- Dremio - the missing link in modern data☆1,473Sep 26, 2025Updated 5 months ago
- A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.☆131Jan 17, 2025Updated last year
- A framework for writing performant user-defined functions (UDFs) that are portable across a variety of engines including Apache Spark, Ap…☆304Oct 30, 2025Updated 4 months ago
- Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Dec 14, 2022Updated 3 years ago
- ARCHIVED: A web UI for Debezium; Please log issues at https://issues.redhat.com/browse/DBZ.☆351Sep 17, 2025Updated 5 months ago
- Generic Data Ingestion & Dispersal Library for Hadoop☆482Mar 19, 2023Updated 2 years ago
- Iceberg is a table format for large, slow-moving tabular data☆490Apr 10, 2023Updated 2 years ago
- Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides…☆3,000Nov 6, 2025Updated 3 months ago
- Stream Processing and Complex Event Processing Engine☆1,577Aug 8, 2025Updated 6 months ago
- Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.☆3,588Feb 17, 2026Updated 2 weeks ago
- The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL☆6,239Updated this week
- A Relational Database Backed by Apache Kafka☆390Oct 15, 2025Updated 4 months ago
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,308Updated this week
- The Metadata Platform for your Data and AI Stack☆11,608Updated this week
- Kubernetes Operator for the Ververica Platform☆35Jan 19, 2023Updated 3 years ago
- An Open Standard for lineage metadata collection☆2,330Updated this week
- Privacy and Security focused Segment-alternative, in Golang and React☆4,369Updated this week
- Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)☆12,592Updated this week
- j.u.s.Stream alternative (synchronous only), reusable, faster, more operators, easier to use.☆18Feb 23, 2026Updated last week
- Simplified custom plugins for Trino☆16Jul 29, 2024Updated last year
- Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream processing engine from Twitter☆3,688Mar 1, 2023Updated 3 years ago
- Data Lineage Tracking And Visualization Solution☆656Updated this week