Essential PySpark for Scalable Data Analytics, published by Packt
☆46Jan 30, 2023Updated 3 years ago
Alternatives and similar repositories for Essential-PySpark-for-Scalable-Data-Analytics
Users that are interested in Essential-PySpark-for-Scalable-Data-Analytics are comparing it to the libraries listed below
Sorting:
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆13Jan 18, 2023Updated 3 years ago
- ☆14Dec 15, 2025Updated 2 months ago
- Serverless ETL and Analytics with AWS Glue, published by Packt☆53Oct 2, 2023Updated 2 years ago
- Python Advanced Predictive Analytics, by Packt☆12Jan 30, 2023Updated 3 years ago
- Data Statistics with Full Stack Python, published by Packt☆11Jan 30, 2023Updated 3 years ago
- The Regularization Cookbook, published by Packt☆16Feb 5, 2026Updated 3 weeks ago
- Feature Store for Machine Learning, published by Packt☆13Feb 5, 2026Updated 3 weeks ago
- Code Repository for Mastering Unsupervised learning with Python, Published by Packt☆27Feb 15, 2023Updated 3 years ago
- [Intemarché] Sales forecasting challenge☆11Jun 23, 2021Updated 4 years ago
- Analytics of Movielens dataset (100k) along with recomendation based on the user preference☆13Apr 9, 2017Updated 8 years ago
- The Statistics and Machine Learning with R Workshop, published by Packt☆11Feb 5, 2026Updated 3 weeks ago
- Mastering Big Data Analytics with PySpark, Published by Packt☆166Aug 20, 2024Updated last year
- ☆18May 11, 2023Updated 2 years ago
- Automated Machine Learning with Auto-Keras, Published by Packt☆40Jan 30, 2023Updated 3 years ago
- Machine Learning Engineering on AWS, published by Packt☆72Feb 5, 2026Updated 3 weeks ago
- ☆20Dec 15, 2025Updated 2 months ago
- Learn GO by Building Three Simple Golang Projects, published by Packt☆18Jan 18, 2023Updated 3 years ago
- Machine Learning Techniques for Text, Published by Packt☆39Feb 5, 2026Updated 3 weeks ago
- Apache Spark in 7 Days [Video], by Packt Publishing☆18Jan 30, 2023Updated 3 years ago
- PySpark for Beginners by Packt Pyblishing☆15Jan 30, 2023Updated 3 years ago
- Machine Learning Engineering with MLflow, published by Packt☆123Feb 5, 2026Updated 3 weeks ago
- Source Code for 'Applied Data Science Using PySpark' by Ramcharan Kakarla, Sundar Krishnan, and Sridhar Alla☆48May 18, 2021Updated 4 years ago
- The Art of Data-Driven Business Decisions, published by Packt☆18Feb 5, 2026Updated 3 weeks ago
- Machine Learning Automation with TPOT, published by Packt☆23Jan 18, 2023Updated 3 years ago
- Serverless Analytics with Amazon Athena, published by packt☆26Feb 5, 2026Updated 3 weeks ago
- Source Code for 'Advanced R Statistical Programming and Data Models' by Matt Wiley and Joshua F. Wiley☆23Aug 7, 2019Updated 6 years ago
- Generate descriptions of Snowflake tables and views with LLMs☆26May 22, 2025Updated 9 months ago
- Training Systems Using Python Statistical Modeling, Published by Packt☆20Jan 30, 2023Updated 3 years ago
- Simplifying Data Engineering and Analytics with Delta, published by Packt☆21Sep 20, 2023Updated 2 years ago
- Computer Vision Theory and Projects in Python for Beginners, by Packt Publishing☆27Dec 15, 2025Updated 2 months ago
- Mastering Pandas Second Edition, published by Packt☆27Jan 30, 2023Updated 3 years ago
- Microsoft Certified: Azure Data Scientist Associate Certification Guide, published by Packt☆60Feb 5, 2026Updated 3 weeks ago
- 10 Machine Learning Blueprints You Should Know for Cybersecurity, published by Packt☆30Feb 5, 2026Updated 3 weeks ago
- Example for ETL process with R, Docker, and Github Actions (WIP...).☆25Oct 1, 2022Updated 3 years ago
- ☆28Feb 14, 2026Updated 2 weeks ago
- This is a custom project for WGU, the original project repo is https://github.com/udacity/nd0821-c2-build-model-workflow-starter☆12Feb 1, 2026Updated last month
- Polars IO plugin to read SAS (sas7bdat), Stata (dta), and SPSS (sav) files☆22Updated this week
- Apache Spark Deep Learning Cookbook, published by Packt☆37Jan 30, 2023Updated 3 years ago
- Building Statistical Models in Python, Published by Packt☆37Feb 5, 2026Updated 3 weeks ago