mrafayaleem / etl-series
☆12Updated 4 years ago
Alternatives and similar repositories for etl-series:
Users that are interested in etl-series are comparing it to the libraries listed below
- pytest plugin to run the tests with support of pyspark☆84Updated 10 months ago
- Airflow training for the crunch conf☆104Updated 6 years ago
- Bare minimal Airflow on Kubernetes (Local, EKS, AKS)☆52Updated 4 years ago
- Pylint plugin for static code analysis on Airflow code☆91Updated 4 years ago
- fast and scalable Airflow on Kubernetes Setup.☆28Updated last year
- Updated 4 months ago
- How to deploy airflow on Kubernetes☆39Updated last year
- Learn how to add data validation and documentation to a data pipeline built with dbt and Airflow.☆166Updated last year
- Astronomer Core Docker Images☆106Updated 8 months ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- A repository of sample code to show data quality checking best practices using Airflow.☆74Updated last year
- Enforce Best Practices for all your Airflow DAGs. ⭐☆94Updated this week
- Fake Pandas / PySpark DataFrame creator☆44Updated 10 months ago
- Airflow Backfill UI based plugin for existing / new Airflow environment☆66Updated 4 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 3 weeks ago
- rb_status_plugin : Data confidence tool for Airflow☆12Updated 2 years ago
- Example orchestration pipeline for Fivetran + dbt managed by Airflow☆21Updated 3 years ago
- An example dbt project using AutomateDV to create a Data Vault 2.0 Data Warehouse based on the Snowflake TPC-H dataset.☆45Updated 10 months ago
- Fast iterative local development and testing of Apache Airflow workflows☆195Updated last month
- Utility functions for dbt projects running on Spark☆31Updated last week
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆29Updated last year
- The Python fake data producer for Apache Kafka® is a complete demo app allowing you to quickly produce JSON fake streaming datasets and …☆82Updated 9 months ago
- Creates simple data models on Snowflake to report dbt source freshness and tests☆23Updated last year
- Airflow Providers containing Deferrable Operators & Sensors from Astronomer☆142Updated this week
- Make simple storing test results and visualisation of these in a BI dashboard☆40Updated last month
- Spark app to merge different schemas☆23Updated 4 years ago
- Quickly get a kubernetes executor airflow environment provisioned on GKE. Azure Kubernetes Service instructions included also as are inst…☆36Updated 4 years ago
- A CLI tool to streamline getting started with Apache Airflow™ and managing multiple Airflow projects☆206Updated this week
- ☆127Updated 4 years ago
- Data-aware orchestration with dagster, dbt, and airbyte☆31Updated 2 years ago