wirelessr / flink-iceberg-playgroundLinks
minio as local storage and DynamoDB as catalog
☆15Updated last year
Alternatives and similar repositories for flink-iceberg-playground
Users that are interested in flink-iceberg-playground are comparing it to the libraries listed below
Sorting:
- This is a basic Apache Pinot example for ingesting real-time MySQL change logs using Debezium☆27Updated 4 years ago
- Automatically loads new partitions in AWS Athena☆19Updated 4 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Demos using Conduktor Gateway☆16Updated last year
- A testing framework for Trino☆26Updated 2 months ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated this week
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20Updated 5 years ago
- Using the Parquet file format (with Avro) to process data with Apache Flink☆14Updated 9 years ago
- Streaming ETL with Apache Flink and Amazon Kinesis Data Analytics☆64Updated last year
- Yet Another (Spark) ETL Framework☆21Updated last year
- A curated list of awesome PrestoDB / Trino software, libraries, tools and resources☆17Updated 3 years ago
- Optimizing downstream data processing with Amazon Kinesis Data Firehose and Amazon EMR running Apache Spark☆13Updated 2 years ago
- ☆52Updated last week
- ☆14Updated 3 months ago
- Java implementation for performing operations on Apache Iceberg and Hive tables☆19Updated 3 weeks ago
- Connect DBVisualizer to Hortonwork HiveServer2☆9Updated 10 years ago
- Apache iceberg Spark s3 examples☆20Updated last year
- An application that records stats about consumer group offset commits and reports them as prometheus metrics☆14Updated 6 years ago
- Amazon EMR on EKS Custom Image CLI☆31Updated 8 months ago
- A Python Client for Hive Metastore☆12Updated last year
- Example Set up For DBT Cloud using Github Integrations☆11Updated 5 years ago
- ☆22Updated 6 years ago
- A curated list of Apache Pulsar resources☆13Updated 6 years ago
- Dashboard for operating Flink jobs and deployments.☆35Updated 6 months ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated last year
- Herd-UI is a search and discovery tool for business and technical users. Everyone in your organization can use Herd-UI to browse and unde…☆16Updated 2 years ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆29Updated last year
- Cloud Storage Connector integrates Apache Pulsar with cloud storage.☆28Updated 3 weeks ago
- Amundsen Gremlin☆21Updated 2 years ago
- Sample code to collect Apache Iceberg metrics for table monitoring☆27Updated 9 months ago