jamesshocking / collapse-spark-dataframe
Python code that will collapse structured columns separating out the attributes into new columns
☆11Updated 3 years ago
Alternatives and similar repositories for collapse-spark-dataframe:
Users that are interested in collapse-spark-dataframe are comparing it to the libraries listed below
- dbt adapter for Azure Synapse Dedicated SQL Pools☆71Updated last week
- Yet Another (Spark) ETL Framework☆20Updated last year
- Delta Lake Documentation☆49Updated 10 months ago
- Generates bundles of verified adapters + core☆17Updated this week
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆45Updated 2 months ago
- ☆99Updated 2 weeks ago
- A cloud data platform product to accelerate time to insights. Our open-source framework is designed for the real world. Stripping away th…☆19Updated last week
- devops-for-databricks☆60Updated 10 months ago
- Delta Lake Website☆25Updated 3 weeks ago
- Custom PySpark Data Sources☆42Updated this week
- Example project using DBT, Databricks and AdventureWorks sample database☆11Updated 2 years ago
- A DataOps framework for building a lakehouse.☆50Updated this week
- A bunch of hacks developed around dbt☆48Updated 5 years ago
- dbt-utils for the dbt-msft family of packages☆26Updated 7 months ago
- Fabric Python Notebooks examples☆68Updated 2 weeks ago
- An example dbt project using AutomateDV to create a Data Vault 2.0 Data Warehouse based on the Snowflake TPC-H dataset.☆46Updated last year
- A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT …☆49Updated 2 years ago
- Delta lake and filesystem helper methods☆51Updated last year
- ☆14Updated 4 years ago
- Demonstration of using Files in Repos with Databricks Delta Live Tables☆32Updated 9 months ago
- Azure Deployments using Terraform☆30Updated 2 years ago
- Unity Catalog UI☆40Updated 7 months ago
- Scalefree's dbt package for a Data Vault 2.0 implementation congruent to the original Data Vault 2.0 definition by Dan Linstedt including…☆152Updated last week
- Delta Lake examples☆221Updated 6 months ago
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆152Updated 8 months ago
- Rules based grant management for Snowflake☆40Updated 6 years ago
- ☆22Updated 3 years ago
- Example of how to leverage Apache Spark distributed capabilities to call REST-API using a UDF☆50Updated 2 years ago
- PDF DataSource for Apache Spark, allow to read PDF files directly to the DataFrame and ocr it☆49Updated last week
- A DBT package to perform DataOps & administrative CI/CD on your data warehouse.☆16Updated 3 years ago