cuebook / cuelake
Use SQL to build ELT pipelines on a data lakehouse.
☆285Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for cuelake
- Generate and Visualize Data Lineage from query history☆311Updated last year
- The Workload Analyzer collects Presto® and Trino workload statistics, and analyzes them☆135Updated last year
- Open-source metadata collector based on ODD Specification☆42Updated last year
- Replicates any database (CDC events) to Apache Iceberg (To Cloud Storage)☆191Updated this week
- Storage connector for Trino☆93Updated this week
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆163Updated last week
- A Spark UI and Spark History Server alternative with CPU and Memory metrics! Delight is free, cross-platform, and open-source.☆342Updated 5 months ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆92Updated 2 weeks ago
- Multiple node presto cluster on docker container☆121Updated 2 years ago
- A simple Spark-powered ETL framework that just works 🍺☆178Updated 11 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆96Updated last year
- Schema modelling framework for decentralised domain-driven ownership of data.☆247Updated 11 months ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆213Updated this week
- A simplified, lightweight ETL Framework based on Apache Spark☆584Updated 9 months ago
- DataHub Actions is a framework for responding to changes to your DataHub Metadata Graph in real time.☆42Updated this week
- A load balancer / proxy / gateway for prestodb☆358Updated 3 months ago
- Tool to automate data quality checks on data pipelines☆249Updated 2 years ago
- dbt (data build tool) adapter for the Dremio☆43Updated 2 months ago
- The Clickhouse plugin for dbt (data build tool)☆248Updated this week
- dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks☆400Updated this week
- A temporary home for LinkedIn's changes to Apache Iceberg (incubating)☆62Updated 6 months ago
- Open Control Plane for Tables in Data Lakehouse☆306Updated this week
- ☆252Updated 2 weeks ago
- dbt-starrocks contains all of the code enabling dbt to work with StarRocks☆18Updated 3 weeks ago
- Adapter for dbt that executes dbt pipelines on Apache Flink☆83Updated 7 months ago
- A library that provides useful extensions to Apache Spark and PySpark.☆195Updated this week
- Serverless HTAP cloud data platform powered by Arrow × DuckDB × Iceberg☆303Updated last year