BlueGranite / tpc-ds-dataset-generatorLinks
Generate big TPC-DS datasets with Databricks
☆19Updated 3 years ago
Alternatives and similar repositories for tpc-ds-dataset-generator
Users that are interested in tpc-ds-dataset-generator are comparing it to the libraries listed below
Sorting:
- TPCDS benchmark for various engines☆18Updated 3 years ago
- Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs☆237Updated 5 months ago
- An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset☆110Updated 2 years ago
- Databricks Migration Tools☆43Updated 4 years ago
- Apache Spark Connector for Azure Kusto☆77Updated last week
- Databricks Platform - Architecture, Security, Automation and much more!!☆51Updated last week
- dbt adapter for Azure Synapse Dedicated SQL Pools☆73Updated last month
- How DevOps principles can be applied to Data Pipeline Solution built with Azure Databricks, Data Factory and ADL Gen2. Moved to: https://…☆60Updated 8 months ago
- This project provides a client library that allows Azure SQL DB or SQL Server to act as an input source or output sink for Spark jobs.☆77Updated 4 years ago
- AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure☆152Updated 4 years ago
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆45Updated 5 months ago
- Apache Spark Connector for SQL Server and Azure SQL☆287Updated 4 months ago
- Apache Spark Connector for Azure Cosmos DB☆204Updated 4 months ago
- ☆18Updated last year
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆153Updated 11 months ago
- ☆76Updated last year
- Databricks Implementation of the TPC-DI Specification using Traditional Notebooks and/or Delta Live Tables☆87Updated last week
- Snowflake Data Source for Apache Spark.☆226Updated 3 weeks ago
- Example code for doing DataOps☆47Updated 4 years ago
- Examples surrounding Databricks.☆59Updated last year
- End-to-end Machine Learning Pipeline demo using Delta Lake, MLflow and AzureML in Azure Databricks☆18Updated 5 years ago
- A proof of concept of how to integrate Spark Lineage in Azure Purview☆22Updated 4 years ago
- A Spark connector for the Azure Common Data Model☆15Updated 2 years ago
- How do to CI/CD with Azure Data Factory☆41Updated 4 years ago
- Release notes for Apache Spark based Runtime for Azure Synapse Analytics and Microsoft Fabric☆27Updated 2 weeks ago
- ☆95Updated 2 years ago
- Azure Databricks Workshops☆94Updated 4 years ago
- Pytest plugin for writing Azure Data Factory Integration Tests☆25Updated 3 years ago
- Lab environment deployments for the Microsoft data engineering (DP-203) ILT learning content.☆28Updated 4 years ago
- Two-day level 300 Azure Synapse Analytics workshop☆11Updated 4 years ago