dai-dao / udacity-data-engineering-capstone
Capstone Project for Udacity Data Engineering Nanodegree
☆9Updated 5 years ago
Alternatives and similar repositories for udacity-data-engineering-capstone
Users that are interested in udacity-data-engineering-capstone are comparing it to the libraries listed below
Sorting:
- Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions☆17Updated last year
- ☆143Updated last year
- This is the starter code for both the course and the project for Data Streaming with Spark☆17Updated 2 years ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- Udacity Data Engineering Nano Degree (DEND)☆185Updated 5 years ago
- Projects done in the Data Engineering Nanodegree by Udacity.com☆272Updated 5 years ago
- ☆150Updated 7 years ago
- My Udacity Data Engineer Nano Degree Projects aka Udacity DEND☆16Updated 5 years ago
- Udacity Data Engineer Nano Degree - Project-3 (Data Warehouse)☆22Updated 5 years ago
- Jupyter notebooks for pyspark tutorials given at University☆107Updated 5 months ago
- Udacity Data Streaming Nanodegree Program☆22Updated 4 years ago
- notebooks produced throughout the Udacity's Nanodegree Data Engineering Course☆73Updated 4 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago
- Udacity Data Engineering Nanodegree Capstone Project☆36Updated 5 years ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆59Updated 6 years ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- Udacity Data Engineer Nanodegree - Capstone project☆11Updated 5 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆34Updated 5 years ago
- Repository used for Spark Trainings☆53Updated 2 years ago
- Big Data Engineering practice project, including ETL with Airflow and Spark using AWS S3 and EMR☆83Updated 5 years ago
- Apache Spark 3 - Structured Streaming Course Material☆122Updated last year
- A repo to track data engineering projects☆13Updated 2 years ago
- Repo to migrate old wiki to, esp for devs and code examples☆185Updated 8 years ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆39Updated 4 years ago
- ETL pipeline using pyspark (Spark - Python)☆114Updated 5 years ago
- Data lake, data warehouse on GCP☆56Updated 3 years ago
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆30Updated 4 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Updated 6 years ago
- Create Data Lake on AWS S3 to store dimensional tables after processing data using Spark on AWS EMR cluster☆9Updated 5 years ago