mrafayaleem / etl-series
☆12Updated 4 years ago
Alternatives and similar repositories for etl-series:
Users that are interested in etl-series are comparing it to the libraries listed below
- rb_status_plugin : Data confidence tool for Airflow☆12Updated 2 years ago
- fast and scalable Airflow on Kubernetes Setup.☆28Updated last year
- Pylint plugin for static code analysis on Airflow code☆93Updated 4 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆167Updated last year
- Materials for the next course☆24Updated 2 years ago
- Quickly get a kubernetes executor airflow environment provisioned on GKE. Azure Kubernetes Service instructions included also as are inst…☆36Updated 4 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆76Updated 2 years ago
- Make simple storing test results and visualisation of these in a BI dashboard☆43Updated 3 weeks ago
- pytest plugin to run the tests with support of pyspark☆86Updated 3 weeks ago
- Astronomer Core Docker Images☆106Updated 10 months ago
- REST-like API exposing Airflow data and operations☆61Updated 6 years ago
- Airflow helm chart for AWS EKS☆18Updated 4 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆200Updated this week
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 3 months ago
- A Terraform template for provisioning Apache Airflow workflows on AWS ECS Fargate☆14Updated 4 years ago
- Great Expectations Airflow operator☆162Updated this week
- ☆199Updated last year
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated 2 years ago
- re_data - fix data issues before your users & CEO would discover them 😊☆98Updated 11 months ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- This code demonstrates the architecture featured on the AWS Big Data blog (https://aws.amazon.com/blogs/big-data/ ) which creates a concu…☆75Updated 6 years ago
- Terraform module to deploy an Apache Airflow cluster on AWS, backed by RDS PostgreSQL for metadata, S3 for logs and SQS as message broker…☆84Updated 2 years ago
- Pipeline definitions for managing data flows to power analytics at MIT Open Learning☆43Updated this week
- Visualize dependencies between Airflow DAGs☆49Updated 3 years ago
- Enforce Best Practices for all your Airflow DAGs. ⭐☆98Updated this week
- Airflow Providers containing Deferrable Operators & Sensors from Astronomer☆147Updated this week
- Any Airflow project day 1, you can spin up a local desktop Kubernetes Airflow environment AND one in Google Cloud Composer with tested da…☆111Updated last year
- Utility functions for dbt projects running on Spark☆32Updated 2 months ago
- Example orchestration pipeline for Fivetran + dbt managed by Airflow☆21Updated 4 years ago