Gigitsu / Big-Data-Analysis-with-Scala-and-Spark
Homeworks repository for the Big Data Analysis with Scala and Spark Coursera course
☆15Updated 2 months ago
Related projects: ⓘ
- Essential PySpark for Scalable Data Analytics, published by Packt☆43Updated last year
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆28Updated 5 months ago
- PySpark-ETL☆23Updated 4 years ago
- Apache Spark using SQL☆14Updated 3 years ago
- ☆19Updated 6 years ago
- Because its never late to start taking notes and 'public' it...☆59Updated 5 months ago
- Learning PySpark video series☆11Updated 6 years ago
- Data Engineering Capstone Project: ETL Pipelines and Data Warehouse Development☆20Updated 5 years ago
- ⭕️ Data Engineering for Data Scientists☆77Updated last year
- Reference code base for ML Engineering, Manning Publications☆120Updated 3 years ago
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆73Updated 11 months ago
- An example MLFlow project☆47Updated 2 years ago
- Scaling Machine Learning in Three Week course in a collaboration with O'Reilly following the guidance of Adi Polak's book - Scaling Machi…☆22Updated last year
- ☆84Updated 2 years ago
- Machine Learning Engineering on AWS, published by Packt☆66Updated 5 months ago
- Data Engineering with Databricks Cookbook, published by Packt☆26Updated 3 months ago
- PySpark Cheatsheet☆35Updated last year
- Source Code for 'Applied Data Science Using PySpark' by Ramcharan Kakarla, Sundar Krishnan, and Sridhar Alla☆43Updated 3 years ago
- Jupyter notebooks for pyspark tutorials given at University☆102Updated 3 weeks ago
- ☆27Updated 4 years ago
- [Video]AWS Certified Machine Learning-Specialty (ML-S) Guide☆119Updated last year
- Example MLOps using BentoML & mlFlow☆37Updated 3 years ago
- Explore tips and tricks to deploy machine learning models with Docker.☆13Updated last year
- Learn Amazon SageMaker - Second Edition, published by Packt☆51Updated last year
- Data Engineering with Spark and Delta Lake☆86Updated last year
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆98Updated 3 years ago
- Automated Machine Learning on AWS, published by Packt☆44Updated 8 months ago
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆14Updated last year
- This repo is mostly created for pyspark and hive related interview questions.☆46Updated 2 years ago
- Simplifying Data Engineering and Analytics with Delta, published by Packt☆20Updated last year