wirelessr / flink-iceberg-playgroundLinks
minio as local storage and DynamoDB as catalog
☆15Updated last year
Alternatives and similar repositories for flink-iceberg-playground
Users that are interested in flink-iceberg-playground are comparing it to the libraries listed below
Sorting:
- Dashboard for operating Flink jobs and deployments.☆42Updated 3 months ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆30Updated last year
- Snapshot manager for Amazon Kinesis Data Analytics for Apache Flink helps the users to generate a snapshot on a periodic basis.☆19Updated 2 years ago
- Spark ETL example processing New York taxi rides public dataset on EKS☆44Updated 3 years ago
- For a series of posts on Amazon MSK, Amazon EKS, and Amazon EMR☆67Updated 4 years ago
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆65Updated 2 years ago
- Paper: A Zero-rename committer for object stores☆20Updated 2 months ago
- Yet Another (Spark) ETL Framework☆21Updated 2 years ago
- Apache iceberg Spark s3 examples☆20Updated last year
- ☆86Updated 8 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated this week
- Amazon Managed Service for Apache Flink Benchmarking Utility helps with capacity planning, integration testing, and benchmarking of Amazo…☆21Updated 2 years ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆103Updated 2 years ago
- Deploy Presto on the cloud easily, using Terraform and Packer☆45Updated 2 years ago
- ☆64Updated last year
- Spark on Kubernetes using Helm☆33Updated 5 years ago
- ☆58Updated last week
- Official repo for the Materialize + Redpanda + dbt Hack Day 2022, including a sample project to get everyone started!☆60Updated 3 years ago
- Replicates any database (CDC events) to Bigquery in real time☆23Updated last month
- Aiven's S3 Sink Connector for Apache Kafka®☆71Updated last year
- Docker image for Apache Hive Metastore☆73Updated 2 years ago
- The Amazon S3 Tables catalog is a client library that bridges control plane operations provided by S3 Tables to engines like Apache Spark…☆145Updated 5 months ago
- FUSE-based DuckDB file system 🦆☆49Updated 6 months ago
- A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations o…☆52Updated 6 months ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 9 years ago
- Apache Pinot Golang Client managed by StarTree☆33Updated 6 months ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Updated last year
- Tools for building, packaging, and OAP public cloud integrations such as AWS EMR, Google Dataproc and K8S.☆18Updated last year
- Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, shor…☆70Updated 4 months ago
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆65Updated 2 years ago