minio / openlake
Build Data Lake using Open Source tools
☆80Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for openlake
- Docker envinroment to stream data from Kafka to Iceberg tables☆24Updated 8 months ago
- Data product portal created by Dataminded☆146Updated this week
- A curated list of open source tools used in analytics platforms and data engineering ecosystem☆133Updated this week
- Open Control Plane for Tables in Data Lakehouse☆308Updated this week
- Sparglim✨ makes PySpark App Configurable and Deploy Spark Connect Server Easier!☆35Updated last month
- Apache Flink (Pyflink) and Related Projects☆29Updated 5 months ago
- Collection of assets used for various articles at https://blogs.min.io☆25Updated last week
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆96Updated last year
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆68Updated 3 years ago
- The Open-Source Enterprise Data Platform in a single Portal☆214Updated this week
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB, PostgreSQL and Superset☆177Updated this week
- How to use Presto (with Hive metastore) and MinIO?☆24Updated last year
- ☆40Updated last year
- Apache Hive Metastore as a Standalone server in Docker☆65Updated 2 months ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆25Updated 8 months ago
- Repo for CDC with debezium blog post☆26Updated last month
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆214Updated this week
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆221Updated last week
- ☆44Updated this week
- Operator for Apache Spark-on-Kubernetes for Stackable Data Platform☆57Updated this week
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆55Updated last year
- ☆21Updated this week
- A curated list of awesome DataOps tools☆155Updated 3 weeks ago
- Data Quality and Observability platform for the whole data lifecycle, from profiling new data sources to full automation with Data Observ…☆111Updated this week
- A Micosoft Power BI Custom Connector allowing you to import Trino data into Power BI.☆50Updated last month
- PyAirbyte brings the power of Airbyte to every Python developer.☆229Updated this week
- Building a Data Pipeline with an Open Source Stack☆38Updated 4 months ago