Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
☆165Dec 4, 2025Updated 5 months ago
Alternatives and similar repositories for big-data-mapreduce-course
Users that are interested in big-data-mapreduce-course are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆229Jun 26, 2023Updated 2 years ago
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,080Oct 14, 2024Updated last year
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆89Jan 3, 2020Updated 6 years ago
- PySpark-Tutorial provides basic algorithms using PySpark☆1,273May 26, 2025Updated 11 months ago
- ☆14Sep 14, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A collection of data analysis projects done using PySpark via Jupyter notebooks.☆10Oct 8, 2022Updated 3 years ago
- Apache Spark (PySpark) Practice on Real Data☆272Jan 31, 2020Updated 6 years ago
- Automated Trading Bot using GCP & TD Ameritrade☆14Dec 20, 2020Updated 5 years ago
- Examples for High Performance Spark☆16Oct 25, 2025Updated 6 months ago
- Programs with word vectors, RNN, NLP stuff, etc☆18Feb 28, 2017Updated 9 years ago
- A Scalable Data Cleaning Library for PySpark.☆29Apr 4, 2019Updated 7 years ago
- Kafka-Notes☆15Jun 20, 2021Updated 4 years ago
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.☆18Feb 19, 2023Updated 3 years ago
- Basic TensorFlow mechanics, operations, class definitions, and neural networks building. Examples from deeplearning.ai Tensorflow course …☆35Apr 12, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Demonstrating the efficiency of pmdarima’s auto_arima() function compared to implementing a traditional ARIMA model.☆12Feb 16, 2021Updated 5 years ago
- Fundamentals of Spark with Python (using PySpark), code examples☆363Oct 29, 2022Updated 3 years ago
- 🐍 Quick reference guide to common patterns & functions in PySpark.☆673Feb 21, 2023Updated 3 years ago
- Thinkscripts to pull call and option volume☆18Apr 28, 2020Updated 6 years ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆493Oct 15, 2024Updated last year
- Jupyter notebooks for pyspark tutorials given at University☆110Jan 7, 2026Updated 4 months ago
- Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks☆1,664Mar 16, 2024Updated 2 years ago
- A Binder-compatibible repo with a Dockerfile☆11Aug 18, 2017Updated 8 years ago
- Data-Intensive Text Processing with MapReduce☆628Mar 3, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Model to accurately forecast inventory demand based on historical sales data.☆66Jul 6, 2016Updated 9 years ago
- LearningApacheSpark☆250Jan 3, 2024Updated 2 years ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- Examples for learning spark☆332Nov 9, 2015Updated 10 years ago
- 用户行为分析系统☆12Dec 10, 2015Updated 10 years ago
- Real-world Spark pipelines examples☆82Feb 27, 2018Updated 8 years ago
- Materials and Jekyll website for the Wednesday software working group.☆10Feb 17, 2017Updated 9 years ago
- Mastering Numerical Computing with NumPy, published by Packt☆28Jan 24, 2023Updated 3 years ago
- spark MLlib机器学习实践源码☆10Oct 28, 2016Updated 9 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Structured Streaming Machine Learning example with Spark 2.0☆94Apr 24, 2017Updated 9 years ago
- Rasa Chatbot using Django backend and Sockets for communication☆12Dec 8, 2022Updated 3 years ago
- Run Samza as a Spring Boot application☆18Mar 6, 2017Updated 9 years ago
- Serialization from the C API for R☆13Jan 6, 2026Updated 4 months ago
- Materials for the "Advanced Scikit-learn" class in the afternoon☆165Dec 12, 2018Updated 7 years ago
- [UNMAINTAINED] 基于PySpark与MySQL的复杂网络链路预测。☆23Jan 22, 2018Updated 8 years ago
- ecommerce GCP Streaming pipeline ― Cloud Storage, Compute Engine, Pub/Sub, Dataflow, Apache Beam, BigQuery and Tableau; GCP Batch pipelin…☆11Mar 9, 2022Updated 4 years ago