NFLX-WIBD / WIBD-Workshops-2018Links
☆199Updated 3 years ago
Alternatives and similar repositories for WIBD-Workshops-2018
Users that are interested in WIBD-Workshops-2018 are comparing it to the libraries listed below
Sorting:
- Projects done in the Data Engineering Nanodegree by Udacity.com☆273Updated 5 years ago
- Airflow ETL for Meetup API☆45Updated 6 years ago
- A way for home buyers to know about factors affecting a state☆47Updated 6 years ago
- Udacity Data Engineering Nano Degree (DEND)☆185Updated 5 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆898Updated 3 years ago
- Apache Spark (PySpark) Practice on Real Data☆274Updated 5 years ago
- Repo to migrate old wiki to, esp for devs and code examples☆185Updated 8 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Airflow basics tutorial☆397Updated 3 years ago
- Code snippets and tutorials for working with social science data in PySpark☆421Updated 7 years ago
- Data Engineering on Google Cloud Platform☆375Updated 11 months ago
- Fundamentals of Spark with Python (using PySpark), code examples☆350Updated 2 years ago
- notebooks produced throughout the Udacity's Nanodegree Data Engineering Course☆73Updated 4 years ago
- Notes on Apache Spark (pyspark)☆298Updated 6 years ago
- ☆143Updated 2 years ago
- LearningApacheSpark☆245Updated last year
- Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.☆139Updated 5 years ago
- ☆150Updated 7 years ago
- GCP-Data-Engineer-Study-Guide☆120Updated 5 years ago
- Course materials for my data pipeline video course with O'Reilly☆199Updated 7 years ago
- ☆179Updated 2 years ago
- How to build an awesome data engineering team☆100Updated 5 years ago
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- A book describing how to set up and maintain Data Engineering infrastructure using Google Cloud Platform.☆124Updated 4 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆88Updated 4 years ago
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆137Updated 5 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- Repository used for Spark Trainings☆53Updated 2 years ago
- My Udacity Data Engineer Nano Degree Projects aka Udacity DEND☆16Updated 5 years ago
- ☆763Updated 5 years ago