godatadriven / airflow-training-skeleton
Skeleton project for Apache Airflow training participants to work on.
β16Updated 4 years ago
Alternatives and similar repositories for airflow-training-skeleton:
Users that are interested in airflow-training-skeleton are comparing it to the libraries listed below
- The sane way of building a data layer in Airflowβ24Updated 5 years ago
- Profiles the data, validates the schema and runs data quality checks and produces a reportβ20Updated 5 years ago
- A kind data platform on your local machine. π€β10Updated 3 weeks ago
- Examples for High Performance Sparkβ15Updated 5 months ago
- Data validation library for PySpark 3.0.0β33Updated 2 years ago
- event-triggered plugins for airflowβ21Updated 5 years ago
- π Run, schedule, and manage your dbt jobs using Kubernetes.β24Updated 6 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise itβ26Updated last year
- DataHub on AWS demonstration resourcesβ10Updated 2 years ago
- Utility functions for dbt projects running on Sparkβ32Updated 2 months ago
- Big Data Demystified meetup and blog examplesβ31Updated 8 months ago
- Airflow workflow management platform chef cookbook.β71Updated 5 years ago
- A Getting Started Guide for developing and using Airflow Pluginsβ93Updated 6 years ago
- Data Catalog for Databases and Data Warehousesβ34Updated last year
- Full stack data engineering tools and infrastructure set-upβ51Updated 4 years ago
- Evaluation Matrix for Change Data Captureβ25Updated 8 months ago
- Fake Pandas / PySpark DataFrame creatorβ46Updated last year
- β11Updated 5 months ago
- A serverless duckDB deployment at GCPβ39Updated 2 years ago
- Arcane Insight is a data analytics project designed to harness the power of SQLMesh & DuckDB to collect, transform, and analyze data fromβ¦β33Updated 3 months ago
- β24Updated 5 years ago
- β21Updated 3 years ago
- Debussy is an opinionated Data Architecture and Engineering framework, enabling data analysts and engineers to build better platforms andβ¦β28Updated 2 years ago
- Yet Another (Spark) ETL Frameworkβ20Updated last year
- Delta reader for the Ray open-source toolkit for building ML applicationsβ45Updated last year
- β49Updated 3 years ago
- Weekly Data Engineering Newsletterβ95Updated 9 months ago
- Dask integration for Snowflakeβ30Updated 5 months ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraformβ47Updated 3 months ago
- β74Updated last week