arsenvlad / docker-presto-adls-wasbLinks
Example of a single node Presto with Azure Data Lake Store (ADLS) and Azure Storage Blob (WASB) access via Hive metastore
☆19Updated 5 years ago
Alternatives and similar repositories for docker-presto-adls-wasb
Users that are interested in docker-presto-adls-wasb are comparing it to the libraries listed below
Sorting:
- Connect your Spark Databricks clusters Log4J output to the Application Insights Appender☆19Updated 5 years ago
- Pytest plugin for writing Azure Data Factory Integration Tests☆25Updated 3 years ago
- Databricks Migration Tools☆43Updated 4 years ago
- AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure☆151Updated 4 years ago
- How DevOps principles can be applied to Data Pipeline Solution built with Azure Databricks, Data Factory and ADL Gen2. Moved to: https://…☆61Updated 11 months ago
- Apache Spark Connector for Azure Kusto☆78Updated this week
- A proof of concept of how to integrate Spark Lineage in Azure Purview☆21Updated 4 years ago
- HDInsight Kafka Tools☆21Updated last year
- Tools for Deploying Databricks Solutions in Azure☆98Updated last year
- Anomaly Detection Pipeline on Azure Databricks☆28Updated 6 years ago
- Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs☆238Updated 7 months ago
- Use Azure Monitor to track your Spark jobs in Azure Databricks☆10Updated 5 years ago
- Kafka sink for Kusto☆51Updated 3 weeks ago
- A Spark connector for the Azure Common Data Model☆15Updated 2 years ago
- Example code for doing DataOps☆47Updated 4 years ago
- This project provides a client library that allows Azure SQL DB or SQL Server to act as an input source or output sink for Spark jobs.☆76Updated 5 years ago
- Setup for running Trino with Hive Metastore on Kubernetes☆103Updated 3 years ago
- A set of Build and Release tasks for Building, Deploying and Testing Databricks notebooks☆26Updated last year
- A unit test framework for Databricks notebooks☆12Updated 4 years ago
- An Azure Function which allows Azure Data Factory (ADF) to connect to Snowflake in a flexible way.☆26Updated 2 years ago
- OpenTelemetry Demo with Azure Databricks and Azure Monitor☆24Updated last year
- Python logging handlers to send logs to Microsoft Azure Storage☆27Updated 3 years ago
- Different ways to connect to storage in Azure Databricks☆11Updated 6 years ago
- TPCDS benchmark for various engines☆18Updated 3 years ago
- Apache Spark Connector for SQL Server and Azure SQL☆287Updated 7 months ago
- Spark on Kubernetes infrastructure Docker images repo☆38Updated 2 years ago
- A Helm chart to install Apache Airflow on Kubernetes☆289Updated this week
- Autoscaler for DC/OS hosted in a cloud provider☆11Updated 8 years ago
- Azure Data Architecture Guide☆33Updated 7 years ago
- Circus Train is a dataset replication tool that copies Hive tables between clusters and clouds.☆90Updated last year