dipankarmazumdar / iceberg-in-production
A repository of blogs/videos that presents how Apache Iceberg is being used in Production by various orgs
☆16Updated last year
Alternatives and similar repositories for iceberg-in-production:
Users that are interested in iceberg-in-production are comparing it to the libraries listed below
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆29Updated this week
- A Table format agnostic data sharing framework☆38Updated last year
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆60Updated 3 months ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆27Updated last year
- ☆18Updated last year
- Quick Guides from Dremio on Several topics☆70Updated 3 months ago
- Adapter for dbt that executes dbt pipelines on Apache Flink☆95Updated last year
- Make simple storing test results and visualisation of these in a BI dashboard☆43Updated last month
- This repository contains the tpcds queries together with the code required to run this benchmark for dbt and duckdb☆18Updated last year
- Delta Lake Documentation☆49Updated 10 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆98Updated 2 years ago
- ☆75Updated 3 months ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- Cost Efficient Data Pipelines with DuckDB☆51Updated 8 months ago
- Yet Another (Spark) ETL Framework☆20Updated last year
- Official dbt adapter for Vertica☆24Updated 4 months ago
- Dry run capability for dbt projects using BigQuery☆96Updated 2 weeks ago
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆58Updated 2 years ago
- A write-audit-publish implementation on a data lake without the JVM☆46Updated 8 months ago
- Unity Catalog UI☆40Updated 7 months ago
- A GitHub Action that makes it easy to use Great Expectations to validate your data pipelines in your CI workflows.☆80Updated 11 months ago
- Evaluation Matrix for Change Data Capture☆25Updated 8 months ago
- The shared semantic layer definitions that dbt-core and MetricFlow use.☆77Updated last month
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆74Updated 3 years ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆219Updated this week
- Code snippets for Data Engineering Design Patterns book☆80Updated last month
- ☆77Updated 6 months ago
- Delta Acceptance Testing☆20Updated 9 months ago
- Apache Hive Metastore as a Standalone server in Docker☆73Updated 8 months ago
- Weekly Data Engineering Newsletter☆95Updated 9 months ago