Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
☆165Dec 4, 2025Updated 5 months ago
Alternatives and similar repositories for big-data-mapreduce-course
Users that are interested in big-data-mapreduce-course are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆231Jun 26, 2023Updated 2 years ago
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,081Oct 14, 2024Updated last year
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆89Jan 3, 2020Updated 6 years ago
- PySpark-Tutorial provides basic algorithms using PySpark☆1,277May 26, 2025Updated last year
- ☆14Sep 14, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A collection of data analysis projects done using PySpark via Jupyter notebooks.☆10Oct 8, 2022Updated 3 years ago
- Apache Spark (PySpark) Practice on Real Data☆272Jan 31, 2020Updated 6 years ago
- Analytics projects using Big Data eco-systems (Hadoop, Spark, Storm)☆17Dec 27, 2021Updated 4 years ago
- Hands-On-Big-Data-Modeling, Published by Packt☆33Jan 30, 2023Updated 3 years ago
- This repository focuses on providing interview scenario questions that I have encountered during interviews. The questions are designed t…☆52Feb 11, 2025Updated last year
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆17Dec 18, 2018Updated 7 years ago
- Examples for High Performance Spark☆16Oct 25, 2025Updated 7 months ago
- Programs with word vectors, RNN, NLP stuff, etc☆18Feb 28, 2017Updated 9 years ago
- A Scalable Data Cleaning Library for PySpark.☆29Apr 4, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Kafka-Notes☆15Jun 20, 2021Updated 4 years ago
- Basic TensorFlow mechanics, operations, class definitions, and neural networks building. Examples from deeplearning.ai Tensorflow course …☆35Apr 12, 2019Updated 7 years ago
- Example of implementing Isolation Forest in Python☆26Jul 3, 2018Updated 7 years ago
- Fundamentals of Spark with Python (using PySpark), code examples☆363Oct 29, 2022Updated 3 years ago
- 🐍 Quick reference guide to common patterns & functions in PySpark.☆679Feb 21, 2023Updated 3 years ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆494Oct 15, 2024Updated last year
- Git/Github Intro☆13Jun 17, 2015Updated 10 years ago
- Jupyter notebooks for pyspark tutorials given at University☆110Jan 7, 2026Updated 4 months ago
- Automatically exported from code.google.com/p/password-mixer☆29May 4, 2015Updated 11 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Jun 28, 2015Updated 10 years ago
- Hive,Pig,Hbase,Sqoop examples☆15Apr 24, 2017Updated 9 years ago
- Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks☆1,663Mar 16, 2024Updated 2 years ago
- Deploy Your First Deep Learning Model On Kubernetes With Python, Keras, Flask, and Docker 使用 Kubernetes 轻松部署深度学习模型☆11Oct 15, 2018Updated 7 years ago
- A Binder-compatibible repo with a Dockerfile☆11Aug 18, 2017Updated 8 years ago
- Data-Intensive Text Processing with MapReduce☆629Mar 3, 2021Updated 5 years ago
- Model to accurately forecast inventory demand based on historical sales data.☆67Jul 6, 2016Updated 9 years ago
- FlaskRestful + Swagger UI + Docker Compose + Unit Test | How to organize Python Code for REST API☆14Jun 5, 2022Updated 3 years ago
- LearningApacheSpark☆250Jan 3, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- In this work, we compared the predictive capabilities of six different machine learning algorithms - linear regression, random forest, ex…☆16Sep 21, 2020Updated 5 years ago
- Easier way to use reflect to set and get values in Go☆14Jul 24, 2024Updated last year
- Examples for learning spark☆332Nov 9, 2015Updated 10 years ago
- Real-world Spark pipelines examples☆82Feb 27, 2018Updated 8 years ago
- Materials and Jekyll website for the Wednesday software working group.☆10Feb 17, 2017Updated 9 years ago
- spark MLlib机器学习实践源码☆10Oct 28, 2016Updated 9 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆95Apr 24, 2017Updated 9 years ago