developer-advocacy-dremio / flink-iceberg-nessie-environmentLinks
resources for trying out a nessie-flink-iceberg setup
☆11Updated last year
Alternatives and similar repositories for flink-iceberg-nessie-environment
Users that are interested in flink-iceberg-nessie-environment are comparing it to the libraries listed below
Sorting:
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆21Updated 3 years ago
- Presto Trino with Apache Hive Postgres metastore☆42Updated 10 months ago
- ☆58Updated 11 months ago
- Yet Another (Spark) ETL Framework☆21Updated last year
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated last year
- ☆40Updated 2 years ago
- ☆17Updated 3 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- [KAFKA-9774] Un-official Docker Image for Apache Kafka Connect☆44Updated last week
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated last week
- This repository contains recipes for Apache Pinot.☆30Updated 4 months ago
- Docker images for Trino integration testing☆53Updated this week
- Python script to generate a docker-compose.yaml file based on templates and parameters☆72Updated 3 weeks ago
- BigQuery connector for Apache Flink☆32Updated this week
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆99Updated 2 years ago
- Aiven's S3 Sink Connector for Apache Kafka®☆70Updated 10 months ago
- dbt + Trino demo project, using TPC-H sample data☆19Updated last year
- Docker envinroment to stream data from Kafka to Iceberg tables☆29Updated last year
- ☆25Updated last year
- SQL CLI for Apache Flink® via docker-compose☆50Updated last year
- A testing framework for Trino☆26Updated 3 months ago
- Code snippets used in demos recorded for the blog.☆37Updated last month
- ☆14Updated last month
- ☆32Updated last week
- DataOps Observability is part of DataKitchen's Open Source Data Observability. DataOps Observability monitors every data journey from da…☆46Updated last month
- Scalable CDC Pattern Implemented using PySpark☆18Updated 6 years ago
- ☆80Updated 2 months ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆59Updated last year
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆124Updated this week
- Apache flink☆14Updated this week