JuanARojasA / dataflatLinks
☆12Updated 6 months ago
Alternatives and similar repositories for dataflat
Users that are interested in dataflat are comparing it to the libraries listed below
Sorting:
- ☆15Updated 3 years ago
- Run Apache Airflow on OpenShift☆14Updated 4 years ago
- This repository contains NiFi processors for interacting with Snowflake Cloud Data Platform.☆12Updated 8 months ago
- Writing PySpark logs in Apache Spark and Databricks☆17Updated 3 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆12Updated 4 years ago
- Companion repository for the book 'Delta Lake Up and Running'☆47Updated 5 months ago
- A tool to generate PySpark schema from JSON.☆28Updated last year
- ☆24Updated 2 years ago
- A series of workshop modules introducing Feast feature store.☆19Updated 3 years ago
- Powershell Scripts for Power BI☆12Updated last year
- ☆14Updated 4 years ago
- ☆13Updated last year
- ☆18Updated last month
- This repo is a collection of tools to deploy, manage and operate a Databricks based Lakehouse.☆45Updated 7 months ago
- ☆34Updated this week
- DBSQL SME Repo contains demos, tutorials, blog code, advanced production helper functions and more!☆67Updated last week
- Playing with different packages of the Apache Spark☆30Updated last year
- ☆97Updated 2 years ago
- Managing Data as a Product, published by Packt☆17Updated 9 months ago
- Data Profiler for AWS Glue Data Catalog application as described in the AWS Big Data Blog post "Build an automatic data profiling and rep…☆20Updated 5 years ago
- This repo contains live examples to build Databricks' Lakehouse and recommended best practices from the field.☆20Updated 10 months ago
- ADB Essentials Demos used in the webinars: https://databricks.com/p/webinar/azure-databricks-essentials-series☆61Updated 3 years ago
- Demo of using the Nutter for testing of Databricks notebooks in the CI/CD pipeline☆153Updated last year
- A Python package to help Databricks Unity Catalog users to read and query Delta Lake tables with Polars, DuckDb, or PyArrow.☆26Updated last year
- [DEPRECATED] Demo repository implementing an end-to-end MLOps workflow on Databricks. Project derived from dbx basic python template☆114Updated 2 years ago
- ☆17Updated last year
- Full stack data engineering tools and infrastructure set-up☆56Updated 4 years ago
- Examples surrounding Databricks.☆60Updated last year
- ☆19Updated 7 months ago
- Spark data pipeline that processes movie ratings data.☆29Updated this week