jplane / pyspark-devcontainer
A simple VS Code devcontainer setup for local PySpark development
☆50Updated last year
Alternatives and similar repositories for pyspark-devcontainer:
Users that are interested in pyspark-devcontainer are comparing it to the libraries listed below
- dbt adapter for Azure Synapse Dedicated SQL Pools☆71Updated 3 weeks ago
- Fabric Python Notebooks examples☆73Updated this week
- ☆102Updated last week
- Step-by-step tutorial on building a Kimball dimensional model with dbt☆139Updated 9 months ago
- Examples surrounding Databricks.☆58Updated 10 months ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆63Updated last week
- Delta Lake examples☆224Updated 7 months ago
- ☆114Updated 9 months ago
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆45Updated 3 months ago
- Personal project for setting up an open source data warehouse.☆29Updated 3 months ago
- Delta-Lake, ETL, Spark, Airflow☆47Updated 2 years ago
- Databricks Implementation of the TPC-DI Specification using Traditional Notebooks and/or Delta Live Tables☆83Updated 2 months ago
- Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including…☆153Updated 2 weeks ago
- SQL Queries & Alerts for Databricks System Tables access.audit Logs☆27Updated 6 months ago
- dbt-utils for the dbt-msft family of packages☆27Updated 7 months ago
- DBSQL SME Repo contains demos, tutorials, blog code, advanced production helper functions and more!☆60Updated 3 weeks ago
- Databricks CI/CD using Azure DevOps☆20Updated 2 years ago
- Showcase of advanced use cases relating to CI in dbt☆79Updated last week
- Companion repository for the book 'Delta Lake Up and Running'☆46Updated last month
- Databricks Platform - Architecture, Security, Automation and much more!!☆51Updated 3 weeks ago
- 🧱 A collection of supplementary utilities and helper notebooks to perform admin tasks on Databricks☆54Updated 4 months ago
- ☆22Updated 2 years ago
- ☆30Updated 10 months ago
- Execution of DBT models using Apache Airflow through Docker Compose☆116Updated 2 years ago
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆69Updated last year
- Code snippets for Data Engineering Design Patterns book☆101Updated last month
- dbt adapter for SQL Server and Azure SQL☆227Updated this week
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆152Updated 8 months ago
- Modern serverless lakehouse implementing HOOK methodology, Unified Star Schema (USS), and Analytical Data Storage System (ADSS) principle…☆114Updated last month
- Stream processing with Azure Databricks☆138Updated 4 months ago