NFLX-WIBD / WIBD-Workshops-2018
☆199Updated 3 years ago
Alternatives and similar repositories for WIBD-Workshops-2018:
Users that are interested in WIBD-Workshops-2018 are comparing it to the libraries listed below
- Projects done in the Data Engineering Nanodegree by Udacity.com☆272Updated 5 years ago
- Udacity Data Engineering Nano Degree (DEND)☆185Updated 5 years ago
- Airflow ETL for Meetup API☆46Updated 6 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆897Updated 2 years ago
- Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.☆140Updated 4 years ago
- notebooks produced throughout the Udacity's Nanodegree Data Engineering Course☆73Updated 4 years ago
- A way for home buyers to know about factors affecting a state☆47Updated 6 years ago
- How to build an awesome data engineering team☆100Updated 5 years ago
- GCP-Data-Engineer-Study-Guide☆121Updated 5 years ago
- A book describing how to set up and maintain Data Engineering infrastructure using Google Cloud Platform.☆123Updated 4 years ago
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆136Updated 5 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Data Engineering on Google Cloud Platform☆372Updated 9 months ago
- ☆150Updated 7 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆33Updated 5 years ago
- Databricks - Apache Spark™ - 2X Certified Developer☆267Updated 4 years ago
- Apache Spark (PySpark) Practice on Real Data☆273Updated 5 years ago
- Beginner data engineering project - batch edition☆516Updated 3 months ago
- Repo to migrate old wiki to, esp for devs and code examples☆185Updated 8 years ago
- Fundamentals of Spark with Python (using PySpark), code examples☆344Updated 2 years ago
- Public source code for the Udemy online course Apache Airflow: Complete Hands-On Beginner to Advanced Class.☆63Updated 4 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆87Updated 4 years ago
- Repository used for Spark Trainings☆53Updated 2 years ago
- My Udacity Data Engineer Nano Degree Projects aka Udacity DEND☆16Updated 5 years ago
- ETL pipeline using pyspark (Spark - Python)☆114Updated 5 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆144Updated 4 years ago
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- An end-to-end GoodReads Data Pipeline for Building Data Lake, Data Warehouse and Analytics Platform.☆1,373Updated 5 years ago
- Code snippets and tutorials for working with social science data in PySpark☆420Updated 7 years ago