arsenvlad / docker-presto-adls-wasb
Example of a single node Presto with Azure Data Lake Store (ADLS) and Azure Storage Blob (WASB) access via Hive metastore
☆19Updated 4 years ago
Alternatives and similar repositories for docker-presto-adls-wasb:
Users that are interested in docker-presto-adls-wasb are comparing it to the libraries listed below
- Databricks Migration Tools☆43Updated 3 years ago
- TPCDS benchmark for various engines☆18Updated 2 years ago
- Tools for Deploying Databricks Solutions in Azure☆99Updated last year
- Builds Airflow DAGs from configuration files. Powers all DAGs on the Etsy Data Platform☆261Updated last year
- Pytest plugin for writing Azure Data Factory Integration Tests☆25Updated 3 years ago
- AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure☆151Updated 3 years ago
- A proof of concept of how to integrate Spark Lineage in Azure Purview☆22Updated 3 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆87Updated 10 months ago
- Connect your Spark Databricks clusters Log4J output to the Application Insights Appender☆19Updated 4 years ago
- How DevOps principles can be applied to Data Pipeline Solution built with Azure Databricks, Data Factory and ADL Gen2. Moved to: https://…☆59Updated 3 months ago
- Generate big TPC-DS datasets with Databricks☆18Updated 3 years ago
- A Spark connector for the Azure Common Data Model☆15Updated last year
- Airflow on Kubernetes Operator☆89Updated last year
- dbt adapter for Azure Synapse Dedicated SQL Pools☆70Updated 2 months ago
- Kubernetes custom controller and CRDs to managing Airflow☆299Updated 4 years ago
- Streaming data changes to a Data Lake with Debezium and Delta Lake pipeline☆74Updated last year
- Dremio Container Tools☆158Updated last month
- Use Azure Monitor to track your Spark jobs in Azure Databricks☆9Updated 4 years ago
- Spark on Kubernetes infrastructure Docker images repo☆37Updated 2 years ago
- Ingest data originating from Prometheus to Kusto☆19Updated 5 months ago
- Monitoring Azure Databricks jobs☆220Updated 3 months ago
- Setup for running Trino with Hive Metastore on Kubernetes☆99Updated 2 years ago
- Apiary provides modules which can be combined to create a federated cloud data lake☆36Updated 9 months ago
- An Azure Function which allows Azure Data Factory (ADF) to connect to Snowflake in a flexible way.☆26Updated last year
- Example code for doing DataOps☆47Updated 4 years ago
- A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT …☆47Updated 2 years ago
- HDInsight Kafka Tools☆21Updated last year
- Prometheus Exporter for Airflow☆160Updated 7 months ago
- Airflow support for Marquez☆32Updated 4 years ago
- Mirus is a cross data-center data replication tool for Apache Kafka☆203Updated last month