godatadriven / airflow-training-skeleton
Skeleton project for Apache Airflow training participants to work on.
☆16Updated 4 years ago
Alternatives and similar repositories for airflow-training-skeleton:
Users that are interested in airflow-training-skeleton are comparing it to the libraries listed below
- Profiles the data, validates the schema and runs data quality checks and produces a report☆20Updated 5 years ago
- event-triggered plugins for airflow☆21Updated 5 years ago
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆41Updated 2 months ago
- DataHub on AWS demonstration resources☆10Updated 2 years ago
- Airflow code accompanying blog post.☆21Updated 6 years ago
- ☆11Updated 4 months ago
- A serverless duckDB deployment at GCP☆38Updated 2 years ago
- Rules based grant management for Snowflake☆40Updated 6 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- Example Set up For DBT Cloud using Github Integrations☆11Updated 5 years ago
- An example PySpark project with pytest☆17Updated 7 years ago
- ☆24Updated 4 years ago
- Visualize dependencies between Airflow DAGs☆49Updated 3 years ago
- Big Data Demystified meetup and blog examples☆31Updated 7 months ago
- Full stack data engineering tools and infrastructure set-up☆50Updated 4 years ago
- Curated list of resources about Apache Airflow☆19Updated 3 years ago
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- A Getting Started Guide for developing and using Airflow Plugins☆93Updated 6 years ago
- ☆20Updated 3 years ago
- Postgres utility package for dbt (getdbt.com)☆18Updated last month
- Guide on how to setup Apache Airflow containers using Docker and IBM Bluemix☆11Updated 7 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- Utility functions for dbt projects running on Spark☆31Updated last month
- Make simple storing test results and visualisation of these in a BI dashboard☆43Updated last week
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Weekly Data Engineering Newsletter☆94Updated 8 months ago
- A repository of sample code to show data quality checking best practices using Airflow.☆74Updated 2 years ago
- A kind data platform on your local machine. 🤗☆10Updated this week
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- This is a real-life, high throughput streaming ELT data pipeline for ecommerce☆13Updated last year