NFLX-WIBD / WIBD-Workshops-2018
☆199Updated 3 years ago
Alternatives and similar repositories for WIBD-Workshops-2018:
Users that are interested in WIBD-Workshops-2018 are comparing it to the libraries listed below
- Projects done in the Data Engineering Nanodegree by Udacity.com☆270Updated 5 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆902Updated 2 years ago
- Udacity Data Engineering Nano Degree (DEND)☆184Updated 5 years ago
- Airflow ETL for Meetup API☆46Updated 6 years ago
- A way for home buyers to know about factors affecting a state☆47Updated 6 years ago
- Apache Spark (PySpark) Practice on Real Data☆274Updated 5 years ago
- notebooks produced throughout the Udacity's Nanodegree Data Engineering Course☆73Updated 4 years ago
- Code snippets and tutorials for working with social science data in PySpark☆418Updated 7 years ago
- How to build an awesome data engineering team☆100Updated 5 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Fundamentals of Spark with Python (using PySpark), code examples☆343Updated 2 years ago
- My Udacity Data Engineer Nano Degree Projects aka Udacity DEND☆16Updated 5 years ago
- Data Engineering on Google Cloud Platform☆371Updated 7 months ago
- Repository used for Spark Trainings☆53Updated last year
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆134Updated 4 years ago
- GCP-Data-Engineer-Study-Guide☆120Updated 5 years ago
- ☆148Updated 6 years ago
- Public source code for the Udemy online course Apache Airflow: Complete Hands-On Beginner to Advanced Class.☆63Updated 4 years ago
- Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.☆139Updated 4 years ago
- ☆143Updated last year
- A book describing how to set up and maintain Data Engineering infrastructure using Google Cloud Platform.☆121Updated 4 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆86Updated 4 years ago
- ☆181Updated 2 years ago
- Repo to migrate old wiki to, esp for devs and code examples☆185Updated 8 years ago
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- Readings for Analytics Engineers☆241Updated 2 years ago
- Notes on Apache Spark (pyspark)☆299Updated 6 years ago
- Databricks - Apache Spark™ - 2X Certified Developer☆266Updated 4 years ago
- Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.☆314Updated 3 years ago
- Udacity Data Engineer Nanodegree - Capstone project☆11Updated 5 years ago