ThomazRossito / Data-Quality-Pyspark
☆9Updated last week
Alternatives and similar repositories for Data-Quality-Pyspark:
Users that are interested in Data-Quality-Pyspark are comparing it to the libraries listed below
- Demo DAGs that show how to run dbt Core in Airflow using Cosmos☆59Updated 6 months ago
- Data Engineering com Apache Spark☆42Updated 3 years ago
- Spark development environment for kubernetes, spark-submit and jupyter notebook☆19Updated 3 years ago
- ☆46Updated 5 months ago
- An ETL Orchestration using Apache Airflow to extract CSV files from a Google Drive, validate, transform, and load into a PostgreSQL datab…☆24Updated 9 months ago
- Projeto Stack de dados OSS☆13Updated 2 weeks ago
- A self-contained, ready to run Airflow ELT project. Can be run locally or within codespaces.☆67Updated last year
- Retail data pipeline using Airflow, Dbt, Soda, GCP (GCS and BigQuery) and Metabase☆37Updated 9 months ago
- Notebooks e dicas sobre Databricks☆20Updated 5 months ago
- Ravi Azure ADB ADF Repository☆66Updated 3 months ago
- Delta-Lake, ETL, Spark, Airflow☆47Updated 2 years ago
- ☆11Updated last month
- ☆23Updated last year
- ☆14Updated last year
- ☆114Updated 8 months ago
- Passo a passo para instalar no linux/wsl ubuntu☆9Updated last year
- Repo for saving cheat sheets☆49Updated 10 months ago
- workshop 03 - como montar um dw pagando pouco☆32Updated last year
- Companion repository for the book 'Delta Lake Up and Running'☆46Updated 2 weeks ago
- This repo will guide you step-by-step method to create star schema dimensional model.☆25Updated 3 years ago
- Code for dbt tutorial☆156Updated 10 months ago
- Hey this is the repo that has all the queries and data for my video game training series!☆142Updated 2 years ago
- The resources of the preparation course for Databricks Data Engineer Professional certification exam☆112Updated 2 weeks ago
- ☆23Updated last year
- ☆87Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 8 months ago
- The Lakehouse Engine is a configuration driven Spark framework, written in Python, serving as a scalable and distributed engine for sever…☆244Updated 2 months ago
- Found a data engineering challenge or participated in a selection process ? Share with us!☆65Updated 2 years ago
- ☆128Updated 2 months ago
- ☆13Updated last year