ismailsimsek / iceberg-examplesLinks
Apache iceberg Spark s3 examples
☆20Updated last year
Alternatives and similar repositories for iceberg-examples
Users that are interested in iceberg-examples are comparing it to the libraries listed below
Sorting:
- Examples for using Apache Flink® with DataStream API, Table API, Flink SQL and connectors such as MySQL, JDBC, CDC, Kafka.☆64Updated last year
- Apache Flink Stateful Functions Playground☆130Updated last year
- Spark Connector to read and write with Pulsar☆116Updated 3 weeks ago
- ☆59Updated last year
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated last week
- 🌟 Examples of use cases that utilize Decodable, as well as demos for related open-source projects such as Apache Flink, Debezium, and Po…☆83Updated 2 months ago
- Demonstration of a Hive Input Format for Iceberg☆26Updated 4 years ago
- Multi-hop declarative data pipelines☆118Updated last week
- A testing framework for Trino☆26Updated 5 months ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆124Updated this week
- Java binding to Apache DataFusion☆82Updated 5 months ago
- ☆199Updated 2 months ago
- ☆40Updated 2 years ago
- In-Memory Analytics for Kafka using DuckDB☆137Updated this week
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆283Updated this week
- A playground to experience Gravitino☆56Updated last month
- Storage connector for Trino☆115Updated this week
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆63Updated 2 weeks ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆100Updated 2 years ago
- An example of using Flink for Fault-Tolerant Stream Processing☆12Updated 6 years ago
- Self-contained demo using Flink SQL and Debezium to build a CDC-based analytics pipeline. All you need is Docker!☆25Updated 4 years ago
- Apache datasketches☆99Updated 2 years ago
- ☆22Updated 6 years ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆21Updated 3 years ago
- Example setup to demonstrate Prometheus integration of Apache Flink☆94Updated last week
- Rewrite BigQuery, Redshift, Snowflake and Databricks queries into DuckDB compatible SQL (with deep transformation of functions, data type…☆60Updated last week
- Lab project to showcase Flink's performance differences between using a SQL query and implementing the same logic via the DataStream API☆14Updated 5 years ago
- Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.☆37Updated 2 years ago
- Apache Kafka is an open-source distributed event streaming platform used by thousands of companies. uForwarder aims to address several pa…☆86Updated 6 months ago
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year