Wuerike / kafka-iceberg-streamingLinks
Docker envinroment to stream data from Kafka to Iceberg tables
☆29Updated last year
Alternatives and similar repositories for kafka-iceberg-streaming
Users that are interested in kafka-iceberg-streaming are comparing it to the libraries listed below
Sorting:
- Presto Trino with Apache Hive Postgres metastore☆42Updated 10 months ago
- ☆58Updated this week
- Yet Another (Spark) ETL Framework☆21Updated last year
- Unity Catalog UI☆40Updated 10 months ago
- Apache Hive Metastore as a Standalone server in Docker☆79Updated 10 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated 3 weeks ago
- ☆90Updated 5 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆99Updated 2 years ago
- ☆10Updated 2 years ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆75Updated 3 years ago
- ☆80Updated 2 months ago
- Dashboard for operating Flink jobs and deployments.☆37Updated 7 months ago
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆190Updated this week
- Adapter for dbt that executes dbt pipelines on Apache Flink☆95Updated last year
- Replicates any database (CDC events) to Bigquery in real time☆22Updated this week
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆79Updated 3 months ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated 10 months ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆75Updated last week
- A Table format agnostic data sharing framework☆38Updated last year
- Iceberg Playground in a Box☆56Updated 2 weeks ago
- Python package for querying iceberg data through duckdb.☆70Updated last year
- dbt + Trino demo project, using TPC-H sample data☆19Updated last year
- Apache Flink (Pyflink) and Related Projects☆40Updated 2 months ago
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆37Updated 4 months ago
- ☆40Updated 2 years ago
- ☆25Updated last year
- 🌟 Examples of use cases that utilize Decodable, as well as demos for related open-source projects such as Apache Flink, Debezium, and Po…☆78Updated 3 weeks ago
- Enables Python developers to leverage Debezium's CDC capabilities with custom event handlers and seamless integration.☆29Updated 2 months ago
- Operator for Apache Spark-on-Kubernetes for Stackable Data Platform☆64Updated this week
- Multi-hop declarative data pipelines☆117Updated last month