wirelessr / flink-iceberg-playgroundLinks
minio as local storage and DynamoDB as catalog
☆15Updated last year
Alternatives and similar repositories for flink-iceberg-playground
Users that are interested in flink-iceberg-playground are comparing it to the libraries listed below
Sorting:
- Dashboard for operating Flink jobs and deployments.☆39Updated 2 weeks ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆137Updated 3 weeks ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆30Updated last year
- ☆53Updated this week
- Snapshot manager for Amazon Kinesis Data Analytics for Apache Flink helps the users to generate a snapshot on a periodic basis.☆19Updated 2 years ago
- Python package for querying iceberg data through duckdb.☆70Updated last year
- Yet Another (Spark) ETL Framework☆21Updated last year
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 2 years ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated last year
- Apache iceberg Spark s3 examples☆20Updated last year
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated 2 weeks ago
- Multi-hop declarative data pipelines☆118Updated last week
- Amazon Managed Service for Apache Flink Benchmarking Utility helps with capacity planning, integration testing, and benchmarking of Amazo…☆20Updated 2 years ago
- ☆78Updated 4 months ago
- FUSE-based DuckDB file system 🦆☆47Updated 2 months ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆75Updated 2 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆100Updated 2 years ago
- ☆28Updated 3 months ago
- Demos using Conduktor Gateway☆18Updated last year
- Data Sketches for Apache Spark☆22Updated 2 years ago
- An open-source, community-driven REST catalog for Apache Iceberg!☆29Updated last year
- Presto Trino with Apache Hive Postgres metastore☆43Updated 11 months ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 2 months ago
- ☆59Updated last year
- This project provides fully automated one-click experience to create Cloud and Kubernetes environment to run Data Analytics workload like…☆55Updated 2 years ago
- Replicates any database (CDC events) to Bigquery in real time☆22Updated last week
- Sample code to collect Apache Iceberg metrics for table monitoring☆28Updated last year
- Parquet Command-line Tools☆19Updated 8 years ago
- A testing framework for Trino☆26Updated 5 months ago