dgkatz / trino-hive-superset-dockerLinks
Cloud-native Trino (prestosql) + Hive + Minio + Superset
☆24Updated 3 years ago
Alternatives and similar repositories for trino-hive-superset-docker
Users that are interested in trino-hive-superset-docker are comparing it to the libraries listed below
Sorting:
- ☆268Updated 11 months ago
- Example for article Running Spark 3 with standalone Hive Metastore 3.0☆102Updated 2 years ago
- A simplified, lightweight ETL Framework based on Apache Spark☆588Updated last year
- Apache Hive Metastore as a Standalone server in Docker☆80Updated last year
- The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)☆248Updated last month
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆61Updated 2 years ago
- Multi-container environment with Hadoop, Spark and Hive☆224Updated 5 months ago
- ☆25Updated last year
- Multiple node presto cluster on docker container☆126Updated 3 years ago
- New Generation Opensource Data Stack Demo☆449Updated 2 years ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆76Updated 4 years ago
- Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pi…☆96Updated 2 weeks ago
- Repository of helm charts for deploying DataHub on a Kubernetes cluster☆195Updated last week
- Build Data Lake using Open Source tools☆113Updated 4 months ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆48Updated last year
- Quick Guides from Dremio on Several topics☆78Updated 2 weeks ago
- dbt (data build tool) adapter for the Dremio☆52Updated last month
- REST API for Apache Spark on K8S or YARN☆104Updated last month
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆40Updated last year
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆74Updated 2 years ago
- A simple Spark-powered ETL framework that just works 🍺☆182Updated 2 weeks ago
- Support for generating modern platforms dynamically with services such as Kafka, Spark, Streamsets, HDFS, ....☆77Updated this week
- Grafana dashboards and StatsD exporter config for Airflow monitoring☆288Updated last year
- Adapter for dbt that executes dbt pipelines on Apache Flink☆95Updated last year
- Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an A…☆129Updated last month
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆494Updated 2 years ago
- This is a GitHub for all of my NiFi Templates☆46Updated 5 years ago
- The most popular ClickHouse plugin for Airflow. 🔝 Top-1% downloads on PyPI: https://pypi.org/project/airflow-clickhouse-plugin! Based on…☆166Updated last week
- Low Cost, Simple and Scalable Way of Data Replication to Apache Iceberg/Cloud/Data Lake☆287Updated this week
- CSD for Apache Airflow☆20Updated 6 years ago