minio / openlake
Build Data Lake using Open Source tools
☆84Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for openlake
- Docker envinroment to stream data from Kafka to Iceberg tables☆24Updated 8 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆96Updated last year
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆70Updated 3 years ago
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆165Updated last week
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset☆183Updated last week
- A curated list of open source tools used in analytics platforms and data engineering ecosystem☆147Updated 2 weeks ago
- ☆252Updated 3 weeks ago
- Apache Hive Metastore as a Standalone server in Docker☆67Updated 3 months ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆217Updated this week
- Minimal example to run Trino, Minio, and Hive standalone metastore on docker☆47Updated 2 years ago
- Operator for Apache Spark-on-Kubernetes for Stackable Data Platform☆58Updated this week
- ☆22Updated 8 months ago
- ☆40Updated last year
- Open Control Plane for Tables in Data Lakehouse☆312Updated this week
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated last year
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆35Updated last month
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆40Updated 11 months ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆44Updated last year
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆26Updated 8 months ago
- A Micosoft Power BI Custom Connector allowing you to import Trino data into Power BI.☆50Updated last week
- ☆151Updated this week
- ☆40Updated 3 years ago
- New generation opensource data stack☆61Updated 2 years ago
- Building a Data Pipeline with an Open Source Stack☆38Updated 4 months ago
- Presto Trino with Apache Hive Postgres metastore☆37Updated 2 months ago
- A Table format agnostic data sharing framework☆38Updated 9 months ago
- Delta Lake Documentation☆46Updated 5 months ago