Wuerike / kafka-iceberg-streamingLinks
Docker envinroment to stream data from Kafka to Iceberg tables
☆29Updated last year
Alternatives and similar repositories for kafka-iceberg-streaming
Users that are interested in kafka-iceberg-streaming are comparing it to the libraries listed below
Sorting:
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated 10 months ago
- Presto Trino with Apache Hive Postgres metastore☆41Updated 9 months ago
- Yet Another (Spark) ETL Framework☆21Updated last year
- Iceberg Playground in a Box☆52Updated 2 weeks ago
- Pythonic Iceberg REST Catalog☆1Updated this week
- Unity Catalog UI☆40Updated 9 months ago
- ☆15Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated last week
- Utility functions for dbt projects running on Trino☆21Updated last year
- Adapter for dbt that executes dbt pipelines on Apache Flink☆95Updated last year
- Apache Hive Metastore as a Standalone server in Docker☆78Updated 9 months ago
- Terraform Provider for Airbyte API☆55Updated last week
- Replicates any database (CDC events) to Bigquery in real time☆22Updated last week
- ☆87Updated 5 months ago
- ☆18Updated last year
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆37Updated 3 months ago
- ☆25Updated last year
- Dashboard for operating Flink jobs and deployments.☆36Updated 7 months ago
- ☆80Updated last month
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆35Updated last year
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- A Table format agnostic data sharing framework☆38Updated last year
- ☆10Updated 2 years ago
- dbt + Trino demo project, using TPC-H sample data☆19Updated last year
- Python package for querying iceberg data through duckdb.☆69Updated last year
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆75Updated 3 years ago
- Use dbt to manage real-time data transformations in RisingWave.☆27Updated this week
- Make simple storing test results and visualisation of these in a BI dashboard☆45Updated this week
- Snowflake connector repository for the Apache Flink project☆37Updated last month
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆20Updated 3 years ago