tj--- / iceberg-demo
A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino
☆19Updated 2 years ago
Alternatives and similar repositories for iceberg-demo:
Users that are interested in iceberg-demo are comparing it to the libraries listed below
- Presto Trino with Apache Hive Postgres metastore☆40Updated 5 months ago
- ☆40Updated last year
- Adapter for dbt that executes dbt pipelines on Apache Flink☆90Updated 11 months ago
- dbt-starrocks contains all of the code enabling dbt to work with StarRocks☆23Updated this week
- ☆47Updated 6 months ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆62Updated last year
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆97Updated 2 years ago
- Trino Connector for Apache Paimon.☆31Updated 2 months ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆50Updated last year
- ☆79Updated last year
- Real-time Data Warehouse with Apache Flink & Apache Kafka & Apache Hudi☆112Updated last year
- Apache Hive Metastore as a Standalone server in Docker☆68Updated 6 months ago
- Examples of Spark 3.0☆47Updated 4 years ago
- DataHub Actions is a framework for responding to changes to your DataHub Metadata Graph in real time.☆43Updated this week
- ☆25Updated 5 months ago
- A playground to experience Gravitino☆40Updated this week
- Storage connector for Trino☆103Updated this week
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆118Updated last week
- A tool that makes it easy to run modular Trino environments locally.☆32Updated 2 months ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- Sample code to collect Apache Iceberg metrics for table monitoring☆24Updated 6 months ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆72Updated 3 years ago
- Unity Catalog UI☆39Updated 5 months ago
- Trino plugin for logging query events into a separate log file.☆39Updated 2 years ago
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆228Updated this week
- Instructions for getting started with Ververica Platform on minikube.☆91Updated last month
- Flink dynamic CEP demo☆18Updated 2 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆42Updated 2 years ago
- Apache flink☆46Updated 2 weeks ago
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆118Updated 3 weeks ago