jamesshocking / collapse-spark-dataframe
Python code that will collapse structured columns separating out the attributes into new columns
☆11Updated 3 years ago
Alternatives and similar repositories for collapse-spark-dataframe:
Users that are interested in collapse-spark-dataframe are comparing it to the libraries listed below
- Yet Another (Spark) ETL Framework☆20Updated last year
- Delta Lake Documentation☆49Updated 9 months ago
- dbt adapter for Azure Synapse Dedicated SQL Pools☆70Updated 4 months ago
- Generates bundles of verified adapters + core☆16Updated this week
- Delta Lake Website☆25Updated 2 weeks ago
- Delta Lake examples☆218Updated 5 months ago
- Example project using DBT, Databricks and AdventureWorks sample database☆11Updated 2 years ago
- Delta lake and filesystem helper methods☆51Updated last year
- Rules based grant management for Snowflake☆40Updated 6 years ago
- ☆14Updated 4 years ago
- A lightweight helper utility which allows developers to do interactive pipeline development by having a unified source code for both DLT …☆49Updated 2 years ago
- Spark app to merge different schemas☆23Updated 4 years ago
- ☆94Updated last week
- A bunch of hacks developed around dbt☆48Updated 5 years ago
- Example of how to leverage Apache Spark distributed capabilities to call REST-API using a UDF☆50Updated 2 years ago
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆44Updated 2 months ago
- A DBT package to perform DataOps & administrative CI/CD on your data warehouse.☆16Updated 3 years ago
- devops-for-databricks☆60Updated 9 months ago
- A template DBT project for BigQuery on Google Cloud☆12Updated 3 years ago
- Demonstration of using Files in Repos with Databricks Delta Live Tables☆31Updated 8 months ago
- A cloud data platform product to accelerate time to insights. Our open-source framework is designed for the real world. Stripping away th…☆19Updated last week
- An example dbt project using AutomateDV to create a Data Vault 2.0 Data Warehouse based on the Snowflake TPC-H dataset.☆46Updated last year
- M3D Engine is a Spark application for the development of scalable data transformations and ingestions in data lakes.☆18Updated 3 years ago
- Custom PySpark Data Sources☆41Updated 2 months ago
- Utility functions for dbt projects running on Spark☆31Updated last month
- An Azure Databricks workshop leveraging the New York Taxi and Limousine Commission Trip Records dataset☆108Updated last year
- dbt (data build tool) adapter for the Dremio☆50Updated this week
- A DataOps framework for building a lakehouse.☆47Updated this week
- Fabric Python Notebooks examples☆67Updated this week
- dbt-utils for the dbt-msft family of packages☆27Updated 6 months ago