Gigitsu / Big-Data-Analysis-with-Scala-and-Spark
Homeworks repository for the Big Data Analysis with Scala and Spark Coursera course
☆15Updated 7 months ago
Alternatives and similar repositories for Big-Data-Analysis-with-Scala-and-Spark:
Users that are interested in Big-Data-Analysis-with-Scala-and-Spark are comparing it to the libraries listed below
- PySpark-ETL☆23Updated 5 years ago
- Learning PySpark video series☆11Updated 6 years ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆60Updated 6 years ago
- Simplifying Data Engineering and Analytics with Delta, published by Packt☆21Updated last year
- ☆9Updated 3 years ago
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆81Updated last year
- Essential PySpark for Scalable Data Analytics, published by Packt☆43Updated 2 years ago
- ☆23Updated 2 years ago
- Scaling Machine Learning in Three Week course in a collaboration with O'Reilly following the guidance of Adi Polak's book - Scaling Machi…☆23Updated last year
- Because its never late to start taking notes and 'public' it...☆60Updated 3 months ago
- Partly lecture and partly a hands-on tutorial and workshop, this is a three part series on how to get started with MLflow. In this four p…☆34Updated 4 years ago
- For Udemy students: the official repository of Rock the JVM's Spark Streaming course☆26Updated 2 years ago
- GitHub repository related to the course Mastering Elastic Map Reduce for Data Engineers☆25Updated 2 years ago
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆19Updated 2 years ago
- Databricks ML in Action, Published by Packt☆27Updated 9 months ago
- Optimizing Databricks Workload, published by Packt☆16Updated 2 years ago
- ☆19Updated 6 years ago
- ☆114Updated 4 years ago
- Using Polars and Pandas on AWS Lambda to process data.☆9Updated last year
- Nested Data (JSON/AVRO/XML) Parsing and Flattening in Spark☆16Updated last year
- Building Big Data Pipelines with Apache Beam, published by Packt☆86Updated last year
- The official repository for the Rock the JVM Spark Optimization with Scala course☆57Updated last year
- ☆20Updated last year
- An example MLFlow project☆48Updated last month
- Data Engineering with Spark and Delta Lake☆95Updated 2 years ago
- The official repository for the Rock the JVM Spark Optimization 2 course☆38Updated last year
- A Python PySpark Projet with Poetry☆22Updated 5 months ago
- This repository contains code for Spark Streaming☆21Updated 3 years ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆28Updated 10 months ago
- PySpark Cheatsheet☆36Updated 2 years ago