NFLX-WIBD / WIBD-Workshops-2018Links
☆199Updated 3 years ago
Alternatives and similar repositories for WIBD-Workshops-2018
Users that are interested in WIBD-Workshops-2018 are comparing it to the libraries listed below
Sorting:
- Projects done in the Data Engineering Nanodegree by Udacity.com☆273Updated 5 years ago
- Udacity Data Engineering Nano Degree (DEND)☆185Updated 5 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆898Updated 3 years ago
- Airflow ETL for Meetup API☆46Updated 6 years ago
- How to build an awesome data engineering team☆100Updated 5 years ago
- A way for home buyers to know about factors affecting a state☆48Updated 6 years ago
- Tracking and measuring neighborhood and district-level eviction rates in the city of San Francisco.☆139Updated 4 years ago
- Repo to migrate old wiki to, esp for devs and code examples☆185Updated 8 years ago
- Apache Spark (PySpark) Practice on Real Data☆274Updated 5 years ago
- notebooks produced throughout the Udacity's Nanodegree Data Engineering Course☆73Updated 4 years ago
- Fundamentals of Spark with Python (using PySpark), code examples☆350Updated 2 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- A full data warehouse infrastructure with ETL pipelines running inside docker on Apache Airflow for data orchestration, AWS Redshift for …☆137Updated 5 years ago
- ☆150Updated 7 years ago
- Data Engineering on Google Cloud Platform☆373Updated 10 months ago
- Course materials for my data pipeline video course with O'Reilly☆198Updated 7 years ago
- Code snippets and tutorials for working with social science data in PySpark☆421Updated 7 years ago
- GCP-Data-Engineer-Study-Guide☆120Updated 5 years ago
- Code to build a simple analytics data pipeline with Python☆102Updated 8 years ago
- Readings for Analytics Engineers☆249Updated 2 years ago
- A book describing how to set up and maintain Data Engineering infrastructure using Google Cloud Platform.☆124Updated 4 years ago
- Repository used for Spark Trainings☆53Updated 2 years ago
- ☆143Updated 2 years ago
- Databricks - Apache Spark™ - 2X Certified Developer☆266Updated 4 years ago
- (project & tutorial) dag pipeline tests + ci/cd setup☆88Updated 4 years ago
- Repository created to host udacity data engineer exercises☆11Updated 5 years ago
- Event data simulator. Generates a stream of pseudo-random events from a set of users, designed to simulate web traffic.☆524Updated 3 years ago
- Public source code for the Udemy online course Apache Airflow: Complete Hands-On Beginner to Advanced Class.☆63Updated 4 years ago
- Educational project on how to build an ETL (Extract, Transform, Load) data pipeline, orchestrated with Airflow.☆325Updated 3 years ago
- Data pipeline performing ETL to AWS Redshift using Spark, orchestrated with Apache Airflow☆147Updated 5 years ago