ThomazRossito / Data-Quality-PysparkLinks
☆9Updated 2 months ago
Alternatives and similar repositories for Data-Quality-Pyspark
Users that are interested in Data-Quality-Pyspark are comparing it to the libraries listed below
Sorting:
- Demo DAGs that show how to run dbt Core in Airflow using Cosmos☆62Updated last month
- Spark development environment for kubernetes, spark-submit and jupyter notebook☆19Updated 3 years ago
- Data Engineering com Apache Spark☆42Updated 3 years ago
- An ETL Orchestration using Apache Airflow to extract CSV files from a Google Drive, validate, transform, and load into a PostgreSQL datab…☆24Updated 11 months ago
- Notebooks e dicas sobre Databricks☆20Updated 7 months ago
- ☆11Updated 3 months ago
- ☆39Updated 11 months ago
- Retail data pipeline using Airflow, Dbt, Soda, GCP (GCS and BigQuery) and Metabase☆37Updated 11 months ago
- ☆24Updated last year
- workshop 03 - como montar um dw pagando pouco☆33Updated last year
- ☆133Updated 4 months ago
- ☆131Updated 10 months ago
- Projeto Stack de dados OSS☆12Updated 2 months ago
- Code snippets for Data Engineering Design Patterns book☆119Updated 3 months ago
- Modern Data Stack☆62Updated 10 months ago
- Local Environment to Practice Data Engineering☆142Updated 5 months ago
- Code for dbt tutorial☆156Updated 3 weeks ago
- ☆13Updated last year
- This is an ETL application on AWS with general open sales and customer data that you can find here: https://github.com/camposvinicius/dat…☆17Updated 3 years ago
- Code for blog at: https://www.startdataengineering.com/post/docker-for-de/