godatadriven / airflow-training-skeletonLinks
Skeleton project for Apache Airflow training participants to work on.
☆16Updated 4 years ago
Alternatives and similar repositories for airflow-training-skeleton
Users that are interested in airflow-training-skeleton are comparing it to the libraries listed below
Sorting:
- The sane way of building a data layer in Airflow☆24Updated 5 years ago
- event-triggered plugins for airflow☆21Updated 5 years ago
- Profiles the data, validates the schema and runs data quality checks and produces a report☆20Updated 6 years ago
- Visualize dependencies between Airflow DAGs☆49Updated 4 years ago
- ☆11Updated 6 months ago
- Data validation library for PySpark 3.0.0☆33Updated 2 years ago
- Big Data Demystified meetup and blog examples☆31Updated 9 months ago
- Example Set up For DBT Cloud using Github Integrations☆11Updated 5 years ago
- Amundsen Gremlin☆21Updated 2 years ago
- Airflow workflow management platform chef cookbook.☆71Updated 5 years ago
- Evaluation Matrix for Change Data Capture☆25Updated 10 months ago
- Using the Parquet file format with Python☆15Updated last year
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Fully unit tested utility functions for data engineering. Python 3 only.☆17Updated 9 months ago
- Utility functions for dbt projects running on Spark☆34Updated 3 months ago
- Curated list of resources about Apache Airflow☆19Updated 4 years ago
- Examples for High Performance Spark☆15Updated 7 months ago
- Weekly Data Engineering Newsletter☆95Updated 10 months ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Personal Finance Project to automatically collect swiss banking transaction into a DWH and visualise it☆26Updated last year
- A Getting Started Guide for developing and using Airflow Plugins☆93Updated 6 years ago
- ☆10Updated 3 years ago
- A library that brings useful functions from various modern database management systems to Apache Spark☆59Updated last year
- A kind data platform on your local machine. 🤗☆10Updated last week
- ⚠️ MAINTENANCE-ONLY MODE: Snowplow maintained SQL data models for working with Snowplow web and mobile behavioral data.☆41Updated 4 months ago
- Composable filesystem hooks and operators for Apache Airflow.☆17Updated 3 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Updated last year
- Discover the simplicity and strength of Duckdb, dbt, and Iceberg in this project. Create an efficient, versatile data analytics solution …☆34Updated last year
- Yet Another (Spark) ETL Framework☆21Updated last year
- Analytics engineering with dbt - projects and developer environment☆18Updated 8 months ago