mrafayaleem / etl-seriesLinks
☆12Updated 4 years ago
Alternatives and similar repositories for etl-series
Users that are interested in etl-series are comparing it to the libraries listed below
Sorting:
- Airflow training for the crunch conf☆105Updated 6 years ago
- Pylint plugin for static code analysis on Airflow code☆95Updated 4 years ago
- scaffold of Apache Airflow executing Docker containers☆85Updated 2 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- Bare minimal Airflow on Kubernetes (Local, EKS, AKS)☆53Updated 5 years ago
- Example orchestration pipeline for Fivetran + dbt managed by Airflow☆22Updated 4 years ago
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for A…☆41Updated 2 years ago
- A complete development environment setup for working with Airflow☆128Updated 2 years ago
- Enforce Best Practices for all your Airflow DAGs. ⭐☆102Updated this week
- (project & tutorial) dag pipeline tests + ci/cd setup☆88Updated 4 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- fast and scalable Airflow on Kubernetes Setup.☆28Updated 2 years ago
- Code to be contributed to the Apache Airflow (incubating) project for ETL workflow management for integrating with the Snowflake Data War…☆25Updated 7 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated 2 years ago
- Astronomer Core Docker Images☆107Updated last year
- rb_status_plugin : Data confidence tool for Airflow☆12Updated 2 years ago
- Data Warehousing Made Easy with Google BigQuery and Apache Airflow☆19Updated 6 years ago
- re_data - fix data issues before your users & CEO would discover them 😊☆98Updated last year
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆169Updated last year
- A VS Code Extension to make it easier to manage and develop Spark jobs on EMR☆38Updated 4 months ago
- Terraform module to deploy an Apache Airflow cluster on AWS, backed by RDS PostgreSQL for metadata, S3 for logs and SQS as message broker…☆84Updated 2 years ago
- DBT Cloud Plugin for Airflow☆38Updated last year
- triggering a DAG run multiple times☆88Updated last year
- CICD pipeline that deploys a dbt image on a GKE cluster☆11Updated 3 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 5 months ago
- Great Expectations Airflow operator☆166Updated last week
- Big Data Demystified meetup and blog examples☆31Updated 10 months ago
- ☆38Updated 4 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago
- Glue VSCode devcontainer setup☆14Updated 2 years ago