Essential PySpark for Scalable Data Analytics, published by Packt
☆46Mar 2, 2026Updated 3 weeks ago
Alternatives and similar repositories for Essential-PySpark-for-Scalable-Data-Analytics
Users that are interested in Essential-PySpark-for-Scalable-Data-Analytics are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆13Jan 18, 2023Updated 3 years ago
- Data Statistics with Full Stack Python, published by Packt☆11Jan 30, 2023Updated 3 years ago
- Python Advanced Predictive Analytics, by Packt☆12Jan 30, 2023Updated 3 years ago
- ☆15Dec 15, 2025Updated 3 months ago
- Serverless ETL and Analytics with AWS Glue, published by Packt☆53Mar 2, 2026Updated 3 weeks ago
- The Regularization Cookbook, published by Packt☆16Mar 2, 2026Updated 3 weeks ago
- Analytics of Movielens dataset (100k) along with recomendation based on the user preference☆13Apr 9, 2017Updated 8 years ago
- Apache Spark in 7 Days [Video], by Packt Publishing☆18Jan 30, 2023Updated 3 years ago
- PySpark for Beginners by Packt Pyblishing☆15Jan 30, 2023Updated 3 years ago
- Code Repository for Mastering Unsupervised learning with Python, Published by Packt☆27Feb 15, 2023Updated 3 years ago
- Mastering Big Data Analytics with PySpark, Published by Packt☆166Aug 20, 2024Updated last year
- Machine Learning Engineering on AWS, published by Packt☆73Mar 2, 2026Updated 3 weeks ago
- ☆20Dec 15, 2025Updated 3 months ago
- Automated Machine Learning with Auto-Keras, Published by Packt☆40Jan 30, 2023Updated 3 years ago
- Training Systems Using Python Statistical Modeling, Published by Packt☆20Jan 30, 2023Updated 3 years ago
- Generate descriptions of Snowflake tables and views with LLMs☆26May 22, 2025Updated 10 months ago
- Feature Store for Machine Learning, published by Packt☆13Mar 2, 2026Updated 3 weeks ago
- Mastering Pandas Second Edition, published by Packt☆27Jan 30, 2023Updated 3 years ago
- Source Code for 'Applied Data Science Using PySpark' by Ramcharan Kakarla, Sundar Krishnan, and Sridhar Alla☆48May 18, 2021Updated 4 years ago
- ☆18May 11, 2023Updated 2 years ago
- Building Statistical Models in Python, Published by Packt☆37Mar 2, 2026Updated 3 weeks ago
- Example for ETL process with R, Docker, and Github Actions (WIP...).☆25Oct 1, 2022Updated 3 years ago
- Learning PySpark video series☆11Mar 5, 2018Updated 8 years ago
- The Statistics and Machine Learning with R Workshop, published by Packt☆13Mar 2, 2026Updated 3 weeks ago
- Medication Extraction and Reconciliation Knowledge Instrument☆13Dec 5, 2013Updated 12 years ago
- Interpretable, intuitive outlier detector intended for categorical and numeric data.☆12Jun 19, 2024Updated last year
- Data Analysis with IBM SPSS Statistics, published by Packt☆17Jan 30, 2023Updated 3 years ago
- This is the repository containing machine learning and deep learning projects, as well as some presentation slides on these topics.☆11May 20, 2024Updated last year
- Serverless Analytics with Amazon Athena, published by packt☆26Mar 2, 2026Updated 3 weeks ago
- Practical Data Wrangling, published by Packt☆18Jan 30, 2023Updated 3 years ago
- ☆15Oct 19, 2023Updated 2 years ago
- The Art of Data-Driven Business Decisions, published by Packt☆19Mar 2, 2026Updated 3 weeks ago
- Source Code for 'Advanced R Statistical Programming and Data Models' by Matt Wiley and Joshua F. Wiley☆23Aug 7, 2019Updated 6 years ago
- ☆11Mar 27, 2024Updated last year
- ☆27Feb 14, 2026Updated last month
- Machine Learning Engineering with MLflow, published by Packt☆123Mar 2, 2026Updated 3 weeks ago
- 10 Machine Learning Blueprints You Should Know for Cybersecurity, published by Packt☆30Mar 2, 2026Updated 3 weeks ago
- ☕⛵WIP PySpark dependency management☆22Jul 8, 2018Updated 7 years ago
- Data Wrangling with R, Published by Packt☆17May 3, 2023Updated 2 years ago