Gigitsu / Big-Data-Analysis-with-Scala-and-SparkLinks
Homeworks repository for the Big Data Analysis with Scala and Spark Coursera course
☆15Updated last year
Alternatives and similar repositories for Big-Data-Analysis-with-Scala-and-Spark
Users that are interested in Big-Data-Analysis-with-Scala-and-Spark are comparing it to the libraries listed below
Sorting:
- PySpark-ETL☆22Updated 6 years ago
- Learning PySpark video series☆11Updated 7 years ago
- Repository for medium article☆21Updated 2 years ago
- ☆152Updated 7 years ago
- Reference code base for ML Engineering, Manning Publications☆135Updated 4 years ago
- Jupyter notebooks for pyspark tutorials given at University☆110Updated 3 weeks ago
- ☆88Updated 3 years ago
- This repository contains code for Spark Streaming☆26Updated 4 years ago
- Code repository for the "PySpark in Action" book☆211Updated 7 months ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆61Updated 7 years ago
- This is repository of my YouTube Course on End to End Apache Spark in AIEngineering YouTube Channel☆188Updated 4 years ago
- ☆79Updated this week
- Spark Databricks Notebooks☆14Updated 5 years ago
- Frank Kane's Taming Big Data with Apache Spark and Python, published by Packt☆125Updated 3 years ago
- Apache Spark 3 - Structured Streaming Course Material☆126Updated 2 years ago
- ☆39Updated 2 years ago
- [Course-2020-2023] taught at Duke MIDS. This is also a Coursera Course that covers MLOps, ML Engineering and the foundations of Cloud Co…☆144Updated last year
- Essential PySpark for Scalable Data Analytics, published by Packt☆46Updated 3 years ago
- Data Engineering with Spark and Delta Lake☆106Updated 3 years ago
- Data models, build data warehouses and data lakes, automate data pipelines, and worked with massive datasets.☆13Updated 6 years ago
- ☆66Updated 3 years ago
- Notebook on finding fraud in credit card transactions☆14Updated 6 years ago
- Contains source files used in the Spark with Python course☆18Updated 6 years ago
- Deploy Flask Machine Learning Application on Azure App Services☆116Updated last year
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆39Updated 4 years ago
- PySpark Cheatsheet☆36Updated 3 years ago
- ⭕️ Data Engineering for Data Scientists☆78Updated 2 years ago
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers☆24Updated 3 years ago
- Mastering Big Data Analytics with PySpark, Published by Packt☆165Updated last year
- Some recipes for doing with serverless technologies☆39Updated last year