BlueGranite / tpc-ds-dataset-generatorLinks
Generate big TPC-DS datasets with Databricks
☆21Updated 3 years ago
Alternatives and similar repositories for tpc-ds-dataset-generator
Users that are interested in tpc-ds-dataset-generator are comparing it to the libraries listed below
Sorting:
- TPCDS benchmark for various engines☆18Updated 3 years ago
- Databricks Migration Tools☆43Updated 4 years ago
- Enabling Continuous Data Processing with Apache Spark and Azure Event Hubs☆238Updated 9 months ago
- Apache Spark Connector for SQL Server and Azure SQL☆286Updated 8 months ago
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆46Updated 9 months ago
- ☆18Updated last year
- ☆76Updated last year
- This project provides a client library that allows Azure SQL DB or SQL Server to act as an input source or output sink for Spark jobs.☆76Updated 5 years ago
- Databricks Implementation of the TPC-DI Specification using Traditional Notebooks and/or Delta Live Tables☆89Updated 4 months ago
- An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset☆109Updated 2 years ago
- Databricks Platform - Architecture, Security, Automation and much more!!☆51Updated 2 weeks ago
- AZTK powered by Azure Batch: On-demand, Dockerized, Spark Jobs on Azure☆151Updated 4 years ago
- dbt adapter for Azure Synapse Dedicated SQL Pools☆75Updated 2 months ago
- Apache Spark Connector for Azure Cosmos DB☆203Updated 8 months ago
- Snowflake Data Source for Apache Spark.☆230Updated last month
- SQL Queries & Alerts for Databricks System Tables access.audit Logs☆39Updated 3 months ago
- ☆95Updated 3 years ago
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆153Updated last year
- A tool to validate data, built around Apache Spark.☆100Updated this week
- Example code for doing DataOps☆48Updated 4 years ago
- Tools for Deploying Databricks Solutions in Azure☆97Updated last year
- Apache Spark Connector for Azure Kusto☆78Updated last week
- Delta Lake examples☆231Updated last year
- A Spark connector for the Azure Common Data Model☆15Updated 2 years ago
- Testing framework for Databricks notebooks☆309Updated last year
- Version 1 of Technical Best Practices of Azure Databricks based on real world Customer and Technical SME inputs☆464Updated 2 years ago
- Repository of sample Databricks notebooks☆271Updated last year
- How DevOps principles can be applied to Data Pipeline Solution built with Azure Databricks, Data Factory and ADL Gen2. Moved to: https://…☆61Updated last year
- A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT …☆49Updated 2 years ago
- VSCode extension to work with Databricks☆131Updated 3 weeks ago