Gigitsu / Big-Data-Analysis-with-Scala-and-SparkLinks
Homeworks repository for the Big Data Analysis with Scala and Spark Coursera course
☆15Updated last year
Alternatives and similar repositories for Big-Data-Analysis-with-Scala-and-Spark
Users that are interested in Big-Data-Analysis-with-Scala-and-Spark are comparing it to the libraries listed below
Sorting:
- PySpark-ETL☆22Updated 5 years ago
- Learning PySpark video series☆11Updated 7 years ago
- ☆11Updated last year
- ☆13Updated 5 years ago
- Contains source files used in the Spark with Python course☆18Updated 6 years ago
- Selección de predictores mediante algoritmo genético python☆12Updated 6 years ago
- ☆14Updated 5 years ago
- Apache Spark 3 - Structured Streaming Course Material☆125Updated 2 years ago
- Spark Databricks Notebooks☆14Updated 4 years ago
- ☆39Updated 2 years ago
- Hands-On Data Analysis with Scala, published by Packt☆20Updated 2 years ago
- ☆88Updated 3 years ago
- Repository for medium article☆21Updated last year
- ☆37Updated 5 years ago
- Apache Spark 3 for Data Engineering and Analytics with Python , By Packt publishing☆24Updated 2 years ago
- Building Big Data Pipelines with Apache Beam, published by Packt☆87Updated 2 years ago
- Essential PySpark for Scalable Data Analytics, published by Packt☆45Updated 2 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Updated 4 years ago
- It's a Github Repo to get an understanding on various pre-processing steps required in Machine Learning before we build Machine Learning …☆28Updated 6 years ago
- [Course-2020-2023] taught at Duke MIDS. This is also a Coursera Course that covers MLOps, ML Engineering and the foundations of Cloud Co…☆141Updated 10 months ago
- A repository for a PySpark Cookbook by Tomasz Drabas and Denny Lee☆60Updated 7 years ago
- A Placeholder of all the scripts & notebooks which I'll be using for GA Sessions☆29Updated 6 years ago
- Data Engineering with Spark and Delta Lake☆105Updated 2 years ago
- Frank Kane's Taming Big Data with Apache Spark and Python, published by Packt☆124Updated 2 years ago
- Notebook on finding fraud in credit card transactions☆14Updated 6 years ago
- Source Code for 'Applied Data Science Using PySpark' by Ramcharan Kakarla, Sundar Krishnan, and Sridhar Alla☆48Updated 4 years ago
- A series of Jupyter notebooks that walk you through Machine Learning with Apache Spark ecosystem using Spark MLlib, PyTorch and TensorFlo…☆86Updated 2 years ago
- Spark in Action, 2nd edition - chapter 2☆29Updated 2 years ago
- Spark in Action, 2nd edition - chapter 1 - Introduction☆107Updated 2 years ago
- Code Repository for AWS Certified Big Data Specialty 2019 - In Depth and Hands On!, published by Packt☆43Updated 2 years ago