dgkatz / trino-hive-superset-dockerLinks
Cloud-native Trino (prestosql) + Hive + Minio + Superset
☆23Updated 3 years ago
Alternatives and similar repositories for trino-hive-superset-docker
Users that are interested in trino-hive-superset-docker are comparing it to the libraries listed below
Sorting:
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆99Updated 2 years ago
- Apache Hive Metastore as a Standalone server in Docker☆79Updated 10 months ago
- Quick Guides from Dremio on Several topics☆71Updated this week
- Adapter for dbt that executes dbt pipelines on Apache Flink☆95Updated last year
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆72Updated last year
- ☆265Updated 8 months ago
- Library to convert DBT manifest metadata to Airflow tasks☆48Updated last year
- Minimal example to run Trino, Minio, and Hive standalone metastore on docker☆52Updated 3 years ago
- ☆25Updated last year
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆58Updated last year
- Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including…☆157Updated last week
- A repository of sample code to accompany our blog post on Airflow and dbt.☆174Updated last year
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆75Updated this week
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆70Updated 9 months ago
- Code snippets used in demos recorded for the blog.☆37Updated 2 weeks ago
- ☆80Updated 2 months ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆95Updated last week
- Delta-Lake, ETL, Spark, Airflow☆47Updated 2 years ago
- ☆19Updated 2 years ago
- ☆15Updated 2 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- dbt (data build tool) adapter for the Dremio☆52Updated last week
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆75Updated 3 years ago
- ☆40Updated 2 years ago
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆44Updated 2 years ago
- Smart Automation Tool for building modern Data Lakes and Data Pipelines☆124Updated this week
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆35Updated last year
- dbt + Trino demo project, using TPC-H sample data☆19Updated last year
- ☆80Updated 8 months ago
- A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.☆200Updated last week