gfelot / DEND-Data_Pipeline_Airflow
Learn to build a data pipeline with Airflow to automate wrangling data - An Udacity Data Engineer Nano Degree Project
☆8Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for DEND-Data_Pipeline_Airflow
- My solutions for the Udacity Data Engineering Nanodegree☆33Updated 5 years ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆53Updated last year
- ☆19Updated 6 years ago
- Data engineering interviews Q&A for data community by data community☆61Updated 4 years ago
- Developed an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as …☆16Updated 5 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆14Updated 5 years ago
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Updated 2 years ago
- Udacity Data Engineer Nanodegree - Capstone project☆10Updated 4 years ago
- Python Notes on IPython Notebook files.☆37Updated 3 years ago
- Get started scripts with Snowflake - Build for the Cloud Data Warehouse☆7Updated 2 years ago
- Data Engineering pipeline hosted entirely in the AWS ecosystem utilizing DocumentDB as the database☆13Updated 3 years ago
- PySpark Cheatsheet☆35Updated last year
- A repo to track data engineering projects☆13Updated 2 years ago
- ☆15Updated 3 years ago
- ☆47Updated 2 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆12Updated 4 years ago
- Source code for 'Building a Data Warehouse' by Vincent Rainardi☆28Updated 7 years ago
- Analyzing and calculating key marketing metrics with SQL and Python☆14Updated 5 years ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆89Updated 2 years ago
- Predicting customer churn using scikit-learn☆9Updated 6 years ago
- Course on Udemy by Jose Portilla☆97Updated 6 years ago
- ☆18Updated 3 years ago
- Udacity Data Engineer Nano Degree - Project-3 (Data Warehouse)☆22Updated 5 years ago
- notebooks produced throughout the Udacity's Nanodegree Data Engineering Course☆72Updated 4 years ago
- Cloned by the `dbt init` task☆59Updated 6 months ago
- A modern ELT demo using airbyte, dbt, snowflake and dagster☆24Updated last year
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- All the Snowflake Virtual Warehouse - Example☆11Updated 4 years ago
- Data Quest - Data Engineer Learning and Projects☆24Updated 5 years ago