aaronstone007 / Udacity-Data-StreamingLinks
Projects from Udacity Data Streaming Nanodegree
☆15Updated last year
Alternatives and similar repositories for Udacity-Data-Streaming
Users that are interested in Udacity-Data-Streaming are comparing it to the libraries listed below
Sorting:
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- Repository used for Spark Trainings☆53Updated 2 years ago
- A code-based tutorial for production level data streaming with PySpark plus Optimus for data cleaning, Confluent Kafka, & Apache Drill u…☆26Updated 5 years ago
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Updated 3 years ago
- Realtime social media data analytics with Apache Spark, Python, Kafka, Pandas, etc☆51Updated 8 years ago
- This repo contains all materials regarding Udacity's data streaming nanodegree☆8Updated 5 years ago
- Various data stream/batch process demo with Apache Scala Spark 🚀☆11Updated 5 years ago
- AWS Big Data Certification☆25Updated 4 months ago
- ☆150Updated 7 years ago
- Apache Spark Interview Question and Answers☆21Updated 4 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Updated 6 years ago
- Simplified ETL process in Hadoop using Apache Spark. Has complete ETL pipeline for datalake. SparkSession extensions, DataFrame validatio…☆55Updated 2 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆16Updated 6 years ago
- Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computin…☆24Updated last year
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- Quickstart PySpark with Anaconda on AWS/EMR using Terraform☆47Updated 4 months ago
- Repo for all my code on the articles I post on medium☆107Updated 2 years ago
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆18Updated 2 years ago
- Mastering Spark for Data Science, published by Packt☆47Updated 2 years ago
- Spark and Python (PySpark) Examples☆39Updated 3 years ago
- code, labs and lectures for the course☆47Updated 2 years ago
- A repo to track data engineering projects☆13Updated 2 years ago
- ☆16Updated 2 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year
- My Git Repo for Csv Data☆21Updated 4 years ago
- My Study guide used to pass the CRT020 Spark Certification exam☆33Updated 5 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago
- Source code for the MC technical blog post "Data Observability in Practice Using SQL"☆38Updated 10 months ago
- ☆18Updated 7 years ago