ismailsimsek / iceberg-examplesLinks
Apache iceberg Spark s3 examples
☆20Updated last year
Alternatives and similar repositories for iceberg-examples
Users that are interested in iceberg-examples are comparing it to the libraries listed below
Sorting:
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆65Updated 2 years ago
- ☆61Updated last year
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated 2 weeks ago
- Multi-hop declarative data pipelines☆122Updated last week
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆102Updated 2 years ago
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆63Updated last week
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆288Updated last week
- Apache Flink Stateful Functions Playground☆131Updated 2 years ago
- ☆80Updated 6 months ago
- ☆40Updated 2 years ago
- Spark Connector to read and write with Pulsar☆116Updated last month
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆60Updated last week
- Data Pipeline Automation Framework to build MCP servers, data APIs, and data lakes with SQL.☆135Updated last week
- In-Memory Analytics for Kafka using DuckDB☆141Updated last week
- Self-contained demo using Flink SQL and Debezium to build a CDC-based analytics pipeline. All you need is Docker!☆25Updated 4 years ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆21Updated 3 years ago
- 🌟 Examples of use cases that utilize Decodable, as well as demos for related open-source projects such as Apache Flink, Debezium, and Po…☆84Updated 4 months ago
- Yet Another (Spark) ETL Framework☆21Updated 2 years ago
- Storage connector for Trino☆116Updated last week
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- The Internals of PySpark☆26Updated 10 months ago
- ☆200Updated 3 months ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆122Updated last week
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated 2 years ago
- ☆104Updated 9 months ago
- Instructions for getting started with Ververica Platform on minikube.☆95Updated 3 months ago
- CLI tool to bulk migrate the tables from one catalog another without a data copy☆81Updated 6 months ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Updated last year
- Java binding to Apache DataFusion☆83Updated 6 months ago
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆177Updated 2 months ago