memiiso / debezium-server-icebergView external linksLinks
Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake
☆299Updated this week
Alternatives and similar repositories for debezium-server-iceberg
Users that are interested in debezium-server-iceberg are comparing it to the libraries listed below
Sorting:
- ☆81Apr 23, 2025Updated 9 months ago
- Replicates any database (CDC events) to Bigquery in real time☆23Dec 2, 2025Updated 2 months ago
- Lakekeeper is an Apache-Licensed, secure, fast and easy to use Apache Iceberg REST Catalog written in Rust.☆1,175Feb 8, 2026Updated last week
- Apache Amoro(incubating) is a Lakehouse management system built on open data lake formats.☆1,109Feb 9, 2026Updated last week
- Nessie: Transactional Catalog for Data Lakes with Git-like semantics☆1,413Updated this week
- Open Control Plane for Tables in Data Lakehouse☆380Updated this week
- Docker envinroment to stream data from Kafka to Iceberg tables☆30Feb 27, 2024Updated last year
- Apache Polaris, the interoperable, open source catalog for Apache Iceberg☆1,825Feb 8, 2026Updated last week
- Apache iceberg Spark s3 examples☆21Mar 1, 2024Updated last year
- Coral is a translation, analysis, and query rewrite engine for SQL and other relational languages.☆886Feb 9, 2026Updated last week
- ☆376Updated this week
- A highly efficient daemon for streaming data from Kafka into Delta Lake☆427May 5, 2025Updated 9 months ago
- 🌊 Continuously synchronize the systems where your data lives, to the systems where you _want_ it to live, with Estuary Flow. 🌊☆876Updated this week
- Apache XTable (incubating) is a cross-table converter for lakehouse table formats that facilitates interoperability across data processin…☆1,157Feb 1, 2026Updated 2 weeks ago
- Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark for both streaming and batch …☆3,182Updated this week
- Apache Iceberg☆8,534Updated this week
- A high-performance data streaming system using DuckDB and Apache Arrow Flight.☆96Feb 22, 2025Updated 11 months ago
- This repo demonstrates an Apache Arrow Flight server implementation in Kubernetes.☆12Oct 25, 2024Updated last year
- Apache Spark build compatible with AWS Glue Data Catalog.☆19Aug 9, 2021Updated 4 years ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆149Jan 26, 2026Updated 2 weeks ago
- Apache Paimon Rust The rust implementation of Apache Paimon.☆141Apr 22, 2025Updated 9 months ago
- Enables Python developers to leverage Debezium's CDC capabilities with custom event handlers and seamless integration.☆39Feb 3, 2026Updated last week
- Unofficial rust implementation of Apache Iceberg with integration for Datafusion☆233Updated this week
- A simplified, lightweight ETL Framework based on Apache Spark☆589Jan 24, 2024Updated 2 years ago
- Yet Another (Spark) ETL Framework☆21Oct 21, 2023Updated 2 years ago
- PyIceberg☆997Updated this week
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆253Updated this week
- Apache Iceberg☆1,220Updated this week
- This is a repo with links to everything you'd ever want to learn about data engineering☆11Dec 3, 2024Updated last year
- Turning PySpark Into a Universal DataFrame API☆486Updated this week
- API Framework heavily relying on the power of DuckDB and DuckDB extensions. Ready to build performant and cost-efficient APIs on top of B…☆65Feb 7, 2026Updated last week
- A CDC library in Rust.☆25Feb 17, 2024Updated last year
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆2,891Updated this week
- The AWS Glue Data Catalog is a fully managed, Apache Hive Metastore compatible, metadata repository. Customers can use the Data Catalog a…☆227Mar 19, 2025Updated 10 months ago
- Open, Multi-modal Catalog for Data & AI☆3,304Feb 6, 2026Updated last week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆83Apr 12, 2025Updated 10 months ago
- ☆25Mar 15, 2024Updated last year
- Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.☆3,103Updated this week
- Apache Kyuubi is a distributed and multi-tenant gateway to provide serverless SQL on data warehouses and lakehouses.☆2,300Feb 6, 2026Updated last week