airbnb/SpinalTap

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/airbnb/SpinalTap)

airbnb / SpinalTap

Change Data Capture (CDC) service

☆450

Alternatives and similar repositories for SpinalTap

Users that are interested in SpinalTap are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

airbnb / omniduct
View on GitHub
A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (in…
☆257Apr 22, 2026Updated 3 months ago
YelpArchive / mysql_streamer
View on GitHub
MySQLStreamer is a database change data capture and publish system.
☆411Aug 17, 2022Updated 3 years ago
debezium / debezium
View on GitHub
Change data capture for a variety of databases. Please log issues at https://github.com/debezium/dbz/issues.
☆12,953Updated this week
zendesk / maxwell
View on GitHub
Maxwell's daemon, a mysql-to-json kafka producer
☆4,256Jul 18, 2026Updated last week
airbnb / reair
View on GitHub
ReAir is a collection of easy-to-use tools for replicating tables and partitions between Hive data warehouses.
☆282Feb 27, 2019Updated 7 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
linkedin / brooklin
View on GitHub
An extensible distributed system for reliable nearline data streaming at scale
☆963Jul 16, 2026Updated last week
airbnb / streamalert
View on GitHub
StreamAlert is a serverless, realtime data analysis framework which empowers you to ingest, analyze, and alert on data from any environme…
☆2,888Oct 23, 2023Updated 2 years ago
replicase / pgcapture
View on GitHub
A scalable Netflix DBLog implementation for PostgreSQL
☆283May 21, 2026Updated 2 months ago
apache / gobblin
View on GitHub
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, orga…
☆2,269Jun 24, 2026Updated last month
linkedin / databus
View on GitHub
Source-agnostic distributed change data capture system
☆3,679Sep 28, 2023Updated 2 years ago
MarquezProject / marquez
View on GitHub
Collect, aggregate, and visualize a data ecosystem's metadata
☆2,248Updated this week
lyft / presto-gateway
View on GitHub
A load balancer / proxy / gateway for prestodb
☆359Jul 25, 2024Updated 2 years ago
uber-archive / AthenaX
View on GitHub
SQL-based streaming analytics platform at scale
☆1,223Jun 21, 2020Updated 6 years ago
kjmrknsn / livy-manager
View on GitHub
Livy Manager - Web UI for Managing Apache Livy Sessions
☆16Dec 7, 2017Updated 8 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
apache / pinot
View on GitHub
Apache Pinot - A realtime distributed OLAP datastore
☆6,114Updated this week
debezium / debezium-ui
View on GitHub
ARCHIVED: A web UI for Debezium; Please log issues at https://issues.redhat.com/browse/DBZ.
☆352Sep 17, 2025Updated 10 months ago
amundsen-io / amundsen
View on GitHub
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting…
☆4,780Jul 1, 2026Updated 3 weeks ago
uber / queryparser
View on GitHub
Parsing and analysis of Vertica, Hive, and Presto SQL.
☆1,078Feb 16, 2022Updated 4 years ago
fintechstudios / ververica-platform-k8s-operator
View on GitHub
Kubernetes Operator for the Ververica Platform
☆36Jan 19, 2023Updated 3 years ago
uber / marmaray
View on GitHub
Generic Data Ingestion & Dispersal Library for Hadoop
☆483Mar 19, 2023Updated 3 years ago
yahoo / maha
View on GitHub
A framework for rapid reporting API development; with out of the box support for high cardinality dimension lookups with druid.
☆133Jan 17, 2025Updated last year
linkedin / coral
View on GitHub
Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.
☆907Jul 20, 2026Updated last week
uber / storagetapper
View on GitHub
StorageTapper is a scalable realtime MySQL change data streaming, logical backup and logical replication service
☆363Mar 19, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
OpenLineage / OpenLineage
View on GitHub
An Open Standard for lineage metadata collection
☆2,562Updated this week
dbcli / sqlcomplete
View on GitHub
SQL Completion engine written in Python
☆23Feb 21, 2022Updated 4 years ago
Radico / trino-plugins
View on GitHub
Simplified custom plugins for Trino
☆16Jul 29, 2024Updated last year
MaterializeInc / materialize
View on GitHub
The live data layer for apps and AI agents. Create up-to-the-second views into your business, just using SQL
☆6,343Updated this week
airbnb / knowledge-repo
View on GitHub
A next-generation curated knowledge sharing platform for data scientists and other technical professions.
☆5,540Sep 4, 2024Updated last year
datahub-project / datahub
View on GitHub
The Context Platform for your Data and AI Stack
☆12,356Updated this week
rayokota / kareldb
View on GitHub
A Relational Database Backed by Apache Kafka
☆390Oct 15, 2025Updated 9 months ago
dremio / dremio-oss
View on GitHub
Dremio - the missing link in modern data
☆1,490Sep 26, 2025Updated 10 months ago
odpi / egeria
View on GitHub
Egeria core
☆918Updated this week
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
linkedin / cruise-control
View on GitHub
Cruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides…
☆3,035Updated this week
confluentinc / ksql
View on GitHub
The database purpose-built for stream processing applications.
☆311Updated this week
rudderlabs / rudder-server
View on GitHub
Privacy and Security focused Segment-alternative, in Golang and React
☆4,460Updated this week
Netflix / iceberg
View on GitHub
Iceberg is a table format for large, slow-moving tabular data
☆494Apr 10, 2023Updated 3 years ago
cadence-workflow / cadence
View on GitHub
Cadence is a distributed, scalable, durable, and highly available orchestration engine to execute asynchronous long-running business logi…
☆9,376Updated this week
apache / kyuubi
View on GitHub
Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.
☆2,353Updated this week
awslabs / deequ
View on GitHub
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
☆3,638Updated this week