shabie / streaming_nd
Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions
☆16Updated last year
Alternatives and similar repositories for streaming_nd:
Users that are interested in streaming_nd are comparing it to the libraries listed below
- This is the starter code for both the course and the project for Data Streaming with Spark☆16Updated 2 years ago
- A repo to track data engineering projects☆13Updated 2 years ago
- ☆11Updated 3 years ago
- Udacity Data Streaming Nanodegree Program☆22Updated 4 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆33Updated 5 years ago
- plan, design and implement enterprise data infrastructure solutions and create the blueprints for an organization’s data management syste…☆11Updated last year
- Data engineering interviews Q&A for data community by data community☆63Updated 4 years ago
- Udacity Data Engineering Nanodegree Program☆51Updated 3 years ago
- ( These solutions tested on 4 node Hortonwork cluster on my laptop. Do not test on your production environment until you test... :)☆21Updated 4 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 5 years ago
- Machine Learning DevOps Engineer Nanodegree☆10Updated 3 years ago
- Because its never late to start taking notes and 'public' it...☆60Updated 3 months ago
- Lecture notes, lab notes, and links to helpful resources to pass Google Certification Exam for Professional Data Engineer.☆17Updated 2 years ago
- PySpark Code for Hands-on Learners☆116Updated 5 years ago
- PySpark-ETL☆23Updated 5 years ago
- Apache Spark using SQL☆14Updated 3 years ago
- Projects from Udacity Data Streaming Nanodegree☆15Updated last year
- ☆10Updated 4 years ago
- A real-time streaming ETL pipeline for streaming and performing sentiment analysis on Twitter data using Apache Kafka, Apache Spark and D…☆30Updated 4 years ago
- Unit testing using databricks connect☆30Updated 3 years ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆21Updated 5 years ago
- ☆87Updated 2 years ago
- Udacity Data Engineer Nano Degree - Project-3 (Data Warehouse)☆22Updated 5 years ago
- Repository for Spark using Python material. It is popularly known as PySpark.☆19Updated 3 years ago
- Repository for Data Engineering Interview Series☆28Updated 4 months ago
- ☆31Updated 6 years ago
- ☆19Updated 6 years ago
- Repository used for Spark Trainings☆53Updated last year
- Demonstration of using Apache Spark to build robust ETL pipelines while taking advantage of open source, general purpose cluster computin…☆24Updated last year
- 😈Complete End to End ETL Pipeline with Spark, Airflow, & AWS☆43Updated 5 years ago