wirelessr / flink-iceberg-playgroundLinks
minio as local storage and DynamoDB as catalog
☆15Updated last year
Alternatives and similar repositories for flink-iceberg-playground
Users that are interested in flink-iceberg-playground are comparing it to the libraries listed below
Sorting:
- This is a basic Apache Pinot example for ingesting real-time MySQL change logs using Debezium☆27Updated 4 years ago
- ☆22Updated 6 years ago
- Dockerized runner, utilities, and functions for FlinkSQL applications☆17Updated this week
- Java implementation for performing operations on Apache Iceberg and Hive tables☆19Updated last month
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- A testing framework for Trino☆26Updated 3 months ago
- A Python Client for Hive Metastore☆12Updated last year
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆14Updated 2 years ago
- This repository contains recipes for Apache Pinot.☆30Updated 3 months ago
- Apache iceberg Spark s3 examples☆20Updated last year
- Demos using Conduktor Gateway☆17Updated last year
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated this week
- ARCHIVED: Run Debezium/KafkaConnect CDC components in Kubernetes☆24Updated 6 years ago
- ☆10Updated 2 years ago
- Dashboard for operating Flink jobs and deployments.☆36Updated 7 months ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆29Updated last year
- Apache flink☆14Updated 7 months ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- Apache Hive Metastore in Standalone Mode With Docker☆13Updated 11 months ago
- Automatically loads new partitions in AWS Athena☆19Updated 4 years ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- NiFi processors for Apache Pulsar☆10Updated 3 years ago
- Connect DBVisualizer to Hortonwork HiveServer2☆9Updated 10 years ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20Updated 5 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated 10 months ago
- ☆58Updated 10 months ago
- a curated list of awesome lakehouse frameworks, applications, etc☆32Updated 4 months ago
- Kafka Connect playground☆10Updated 5 years ago
- Yet Another (Spark) ETL Framework☆21Updated last year
- Traditionally, engineers were needed to implement business logic via data pipelines before business users can start using it. Using this …☆12Updated this week