kemonoske / spark-minio-delta-lakehouse-docker
A minimal docker compose setup for experimenting with cloud agnostic Lakehouse Architectures Apache Spark with Hive Metastore + Delta Lake + MinIO
☆19Updated 10 months ago
Alternatives and similar repositories for spark-minio-delta-lakehouse-docker:
Users that are interested in spark-minio-delta-lakehouse-docker are comparing it to the libraries listed below
- Query Iceberg in Trino, Nessie as Catalog, and use minio to replace AWS S3☆17Updated 9 months ago
- Apache Hive Metastore as a Standalone server in Docker☆68Updated 6 months ago
- Presto Trino with Apache Hive Postgres metastore☆40Updated 5 months ago
- Docker envinroment to stream data from Kafka to Iceberg tables☆25Updated 11 months ago
- Python wrapper for the Sling CLI tool☆45Updated this week
- ☆13Updated last year
- dbt (data build tool) adapter for the Dremio☆49Updated last week
- A tool that makes it easy to run modular Trino environments locally.☆32Updated 2 months ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆72Updated 3 years ago
- Library to convert DBT manifest metadata to Airflow tasks☆48Updated 11 months ago
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆226Updated 2 months ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆60Updated last year
- Alto is a versatile data integration tool that allows you to easily run Singer plugins, build and cache PEX files encapsulating those plu…☆59Updated last year
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆64Updated 4 months ago
- ☆67Updated last week
- Proof-of-concept extension combining the delta extension with Unity Catalog☆73Updated this week
- Repo for CDC with debezium blog post☆27Updated 5 months ago
- dbt + Trino demo project, using TPC-H sample data☆19Updated 10 months ago
- Minimal example to run Trino, Minio, and Hive standalone metastore on docker☆49Updated 2 years ago
- Utility functions for dbt projects running on Trino☆21Updated last year
- The bridge to effortless multi-engine data applications, currently supports Snowflake ❄️ and DuckDB 🦆☆157Updated this week
- Official Dockerfile for Delta Lake☆42Updated 8 months ago
- ☆15Updated last year
- A Micosoft Power BI Custom Connector allowing you to import Trino data into Power BI.☆62Updated last month
- Starburst Metabase driver☆66Updated last week
- A portable Datamart and Business Intelligence suite built with Docker, Dagster, dbt, DuckDB and Superset☆213Updated this week
- Quick Guides from Dremio on Several topics☆67Updated last month
- ☆74Updated 4 months ago
- A dbt-core python package that automates the management and creation of dbt groups, contracts, access, and versions.☆115Updated last month
- ☆258Updated 3 months ago