benjigoldberg / udacity-airflow
Udacity Data Pipeline Exercises
☆15Updated 4 years ago
Alternatives and similar repositories for udacity-airflow:
Users that are interested in udacity-airflow are comparing it to the libraries listed below
- Helping you get Airflow running in production.☆9Updated 5 years ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- Just a boilerplate for PySpark and Flask☆35Updated 6 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 2 months ago
- Sharing interesting and noteworthy Data Engineering content☆67Updated 8 years ago
- AWS Big Data Certification☆25Updated 2 months ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- ☆10Updated 6 years ago
- A curated list of all the awesome examples, articles, tutorials and videos for Apache Airflow.☆96Updated 4 years ago
- 🚨 Simple, self-contained fraud detection system built with Apache Kafka and Python☆86Updated 5 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆33Updated 5 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- A repo to track data engineering projects☆13Updated 2 years ago
- 🐍💨 Airflow tutorial for PyCon 2019☆85Updated 2 years ago
- Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database☆13Updated 3 years ago
- Simple alert system implemented in Kafka and Python☆95Updated 6 years ago
- Code to be contributed to the Apache Airflow (incubating) project for ETL workflow management for integrating with the Snowflake Data War…☆25Updated 7 years ago
- Example of an ETL Pipeline using Airflow☆34Updated 7 years ago
- Use Airflow to move data from multiple MySQL databases to BigQuery☆100Updated 4 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- ☆16Updated 7 years ago
- Challenge for those applying to the Software Engineer, Big Data position☆34Updated 13 years ago
- Airflow helm chart for AWS EKS☆18Updated 4 years ago
- ☆26Updated 4 years ago
- An API to Analyze Cab GeoLocation Data and a Simulated App for finding an available cab in Real-Time☆63Updated 10 years ago
- ELT Code for your Data Warehouse☆26Updated last year
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆86Updated 4 years ago