godatadriven / airflow-training-skeletonLinks
Skeleton project for Apache Airflow training participants to work on.
☆16Updated 4 years ago
Alternatives and similar repositories for airflow-training-skeleton
Users that are interested in airflow-training-skeleton are comparing it to the libraries listed below
Sorting:
- event-triggered plugins for airflow☆21Updated 5 years ago
- Big Data Demystified meetup and blog examples☆31Updated 10 months ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆13Updated 2 years ago
- Analytics engineering with dbt - projects and developer environment☆18Updated 8 months ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- Examples for High Performance Spark☆16Updated 7 months ago
- ☆11Updated 7 months ago
- Example Set up For DBT Cloud using Github Integrations☆11Updated 5 years ago
- Visualize dependencies between Airflow DAGs☆49Updated 4 years ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Full stack data engineering tools and infrastructure set-up☆53Updated 4 years ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- 📆 Run, schedule, and manage your dbt jobs using Kubernetes.☆24Updated 6 years ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Utility functions for dbt projects running on Spark☆34Updated 4 months ago
- Profiles the data, validates the schema and runs data quality checks and produces a report☆20Updated 6 years ago
- ☆21Updated 4 years ago
- Weekly Data Engineering Newsletter☆96Updated 11 months ago
- Ibis analytics, with Ibis (and more!)☆22Updated 9 months ago
- AWS Big Data Certification☆25Updated 5 months ago
- Airflow workflow management platform chef cookbook.☆71Updated 5 years ago
- Fake Pandas / PySpark DataFrame creator☆47Updated last year
- Source code for 'PySpark Recipes' by Raju Kumar Mishra☆25Updated 5 years ago
- Repo for holding the dbt project used to make sense of cloud cost data from the major cloud platforms☆37Updated 5 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 5 months ago
- dbt / Amazon Redshift Demonstration Project☆34Updated 2 years ago
- A repository of sample code to show data quality checking best practices using Airflow.☆77Updated 2 years ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆28Updated 2 years ago
- A guide for leading a data (engineering) team☆64Updated last year
- Sample project that use Dagster, dbt, DuckDB and Dash to visualize car and motorcycle Spanish market☆57Updated 2 years ago