brfulu / airflow-data-pipeline
Udacity Data Engineer Nanodegree - Airflow data pipeline
☆10Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for airflow-data-pipeline
- Udacity Data Engineer Nano Degree - Project-3 (Data Warehouse)☆22Updated 5 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 7 years ago
- Udacity Data Engineer Nanodegree - Capstone project☆10Updated 4 years ago
- ELT Code for your Data Warehouse☆26Updated last year
- My solutions for the Udacity Data Engineering Nanodegree☆33Updated 5 years ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 11 months ago
- Example project for consuming AWS Kinesis streamming and save data on Amazon Redshift using Apache Spark☆11Updated 6 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆36Updated 4 months ago
- Data lake, data warehouse on GCP☆54Updated 2 years ago
- Repository used for Spark Trainings☆53Updated last year
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for A…☆41Updated 2 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆85Updated 3 years ago
- Twitter Sentiment Analysis using Spark and Kafka☆114Updated 5 years ago
- Blog post on ETL pipelines with Airflow☆23Updated 4 years ago
- Sentiment Analysis of a Twitter Topic with Spark Structured Streaming☆55Updated 5 years ago
- Rules based grant management for Snowflake☆40Updated 5 years ago
- AWS Big Data Certification☆25Updated last year
- Use Airflow to move data from multiple MySQL databases to BigQuery☆99Updated 4 years ago
- ☆22Updated 4 years ago
- GCP-Data-Engineer-Study-Guide☆118Updated 5 years ago
- A _simple_ starter template for Snowflake Cloud Data Platform☆39Updated 2 years ago
- Code for my blogs on Data Engineering☆15Updated 4 years ago
- Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computin…☆24Updated last year
- Example custom model image trainable and distributable via AWS SageMaker☆36Updated last year