marclamberti / webinar-airflow-chart
Materials of the Official Helm Chart Webinar
☆27Updated 3 years ago
Alternatives and similar repositories for webinar-airflow-chart:
Users that are interested in webinar-airflow-chart are comparing it to the libraries listed below
- Streaming Synthetic Sales Data Generator: Streaming sales data generator for Apache Kafka, written in Python☆43Updated 2 years ago
- ☆20Updated 3 years ago
- ☆71Updated 2 months ago
- Delta Lake Documentation☆49Updated 9 months ago
- Code snippets for Data Engineering Design Patterns book☆74Updated last week
- A repository of sample code to show data quality checking best practices using Airflow.☆74Updated 2 years ago
- Code for my "Efficient Data Processing in SQL" book.☆56Updated 7 months ago
- Spark on Kubernetes using Helm☆34Updated 4 years ago
- Delta Lake examples☆218Updated 5 months ago
- PySpark data-pipeline testing and CICD☆28Updated 4 years ago
- Enforce Best Practices for all your Airflow DAGs. ⭐☆97Updated this week
- Building Big Data Pipelines with Apache Beam, published by Packt☆86Updated 2 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Orchestrate Spark Jobs from Kubeflow Pipelines and poll for the status.☆53Updated 2 years ago
- Data Engineering with Spark and Delta Lake☆96Updated 2 years ago
- A Python Library to support running data quality rules while the spark job is running⚡☆180Updated last week
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- Simple stream processing pipeline☆99Updated 9 months ago
- A Table format agnostic data sharing framework☆38Updated last year
- ☆75Updated 5 months ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆167Updated last year
- A repository of sample code to accompany our blog post on Airflow and dbt.☆170Updated last year
- Delta-Lake, ETL, Spark, Airflow☆46Updated 2 years ago
- Trino dbt demo project to mix and load BigQuery data with and in a local PostgreSQL database☆72Updated 3 years ago
- Dockerizing an Apache Spark Standalone Cluster☆43Updated 2 years ago
- A Python package that creates fine-grained dbt tasks on Apache Airflow☆65Updated 6 months ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆86Updated 4 years ago
- Pyspark boilerplate for running prod ready data pipeline☆28Updated 4 years ago
- Quick Guides from Dremio on Several topics☆69Updated 2 months ago
- Airflow Providers containing Deferrable Operators & Sensors from Astronomer☆146Updated last week