udacity / nd029-c2-apache-spark-and-spark-streaming-starter
This is the starter code for both the course and the project for Data Streaming with Spark
☆17Updated 2 years ago
Alternatives and similar repositories for nd029-c2-apache-spark-and-spark-streaming-starter
Users that are interested in nd029-c2-apache-spark-and-spark-streaming-starter are comparing it to the libraries listed below
Sorting:
- ☆143Updated last year
- AWS Big Data Certification☆25Updated 4 months ago
- Developed a data pipeline to automate data warehouse ETL by building custom airflow operators that handle the extraction, transformation,…☆90Updated 3 years ago
- An example MLFlow project☆48Updated 4 months ago
- Example Github Actions Directory☆46Updated 4 months ago
- Udacity Data Engineering Nanodegree Projects☆11Updated 5 years ago
- Some recipes for doing with serverless technologies☆38Updated 4 months ago
- [Video]AWS Certified Machine Learning-Specialty (ML-S) Guide☆121Updated 4 months ago
- ☆84Updated 2 years ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆39Updated 4 years ago
- A production-grade data pipeline has been designed to automate the parsing of user search patterns to analyze user engagement. Extract d…☆24Updated 3 years ago
- A repo to track data engineering projects☆13Updated 2 years ago
- Code Repository for AWS Certified Big Data Specialty 2019 - In Depth and Hands On!, published by Packt☆39Updated last year
- My solutions for the Udacity Data Engineering Nanodegree☆34Updated 5 years ago
- Udacity Data Streaming Nanodegree Program☆22Updated 4 years ago
- Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions☆17Updated last year
- Automated Machine Learning on AWS, published by Packt☆45Updated last year
- ☆27Updated 3 years ago
- Data Engineering with Spark and Delta Lake☆98Updated 2 years ago
- Capstone Project for Udacity Data Engineering Nanodegree☆9Updated 5 years ago
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆82Updated last year
- Udacity Data Engineer Nano Degree - Project-3 (Data Warehouse)☆22Updated 5 years ago
- This repository shows a sample example to build, manage and orchestrate Machine Learning workflows using Amazon Sagemaker and Apache Airf…☆136Updated 3 years ago
- ☆53Updated 4 years ago
- ☆115Updated 4 years ago
- ☆15Updated 3 years ago
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers☆24Updated 2 years ago
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆19Updated 2 years ago
- Code for my blogs on Data Engineering☆15Updated 4 years ago
- PyConDE & PyData Berlin 2019 Airflow Workshop: Airflow for machine learning pipelines.☆47Updated last year