godatadriven / airflow-training-skeletonLinks
Skeleton project for Apache Airflow training participants to work on.
☆17Updated 5 years ago
Alternatives and similar repositories for airflow-training-skeleton
Users that are interested in airflow-training-skeleton are comparing it to the libraries listed below
Sorting:
- ☆23Updated 4 years ago
- Utility functions for dbt projects running on Spark☆34Updated 2 weeks ago
- Repo for holding the dbt project used to make sense of cloud cost data from the major cloud platforms☆38Updated 5 years ago
- Airflow declarative DAGs via YAML☆133Updated 2 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆93Updated 7 years ago
- Full stack data engineering tools and infrastructure set-up☆57Updated 4 years ago
- ☆48Updated 4 years ago
- Fast iterative local development and testing of Apache Airflow workflows☆202Updated last week
- Airflow workflow management platform chef cookbook.☆70Updated 6 years ago
- Airflow Unit Tests and Integration Tests☆261Updated 3 years ago
- The sane way of building a data layer in Airflow☆24Updated 6 years ago
- event-triggered plugins for airflow☆21Updated 6 years ago
- Rules based grant management for Snowflake☆41Updated 6 years ago
- locopy: Loading/Unloading to Redshift and Snowflake using Python.☆115Updated last week
- Visualize dependencies between Airflow DAGs☆49Updated 4 years ago
- Weekly Data Engineering Newsletter☆96Updated last year
- The dbt adapter for Firebolt☆30Updated this week
- Data validation library for PySpark 3.0.0☆33Updated 3 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 11 months ago
- Apache Airflow CI pipeline☆19Updated 6 years ago
- An Airflow docker image preconfigured to work well with Spark and Hadoop/EMR☆175Updated 7 months ago
- Soda Spark is a PySpark library that helps you with testing your data in Spark Dataframes☆63Updated 3 years ago
- A CLI to manage and monitor permissions in AWS Lake Formation☆25Updated 2 years ago
- PySpark data-pipeline testing and CICD☆28Updated 5 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- A curated list of dagster code snippets for data engineers☆56Updated last year
- ☆99Updated 2 years ago
- Soda SQL and Soda Spark have been deprecated and replaced by Soda Core. docs.soda.io/soda-core/overview.html☆62Updated 3 years ago
- The open source version of the Amazon Redshift Cluster Management Guide.☆48Updated 2 years ago
- The go to demo for public and private dbt Learn☆80Updated 9 months ago