Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
☆166Dec 4, 2025Updated 4 months ago
Alternatives and similar repositories for big-data-mapreduce-course
Users that are interested in big-data-mapreduce-course are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Examples for learning spark☆19Aug 19, 2015Updated 10 years ago
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,081Oct 14, 2024Updated last year
- PySpark-Tutorial provides basic algorithms using PySpark☆1,274May 26, 2025Updated 10 months ago
- A collection of data analysis projects done using PySpark via Jupyter notebooks.☆10Oct 8, 2022Updated 3 years ago
- Apache Spark (PySpark) Practice on Real Data☆271Jan 31, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Analytics projects using Big Data eco-systems (Hadoop, Spark, Storm)☆17Dec 27, 2021Updated 4 years ago
- This repository focuses on providing interview scenario questions that I have encountered during interviews. The questions are designed t…☆50Feb 11, 2025Updated last year
- Jupyter Notebook showing how to process Telecom datasets using PySpark (SparkSQL and DataFrames) and plotting the results using Matplotli…☆17Dec 3, 2018Updated 7 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆17Dec 18, 2018Updated 7 years ago
- Examples for High Performance Spark☆16Oct 25, 2025Updated 5 months ago
- A Scalable Data Cleaning Library for PySpark.☆29Apr 4, 2019Updated 7 years ago
- ☆20Oct 15, 2021Updated 4 years ago
- This data project can be used as a take-home assignment to learn Pyspark and Data Engineering.☆18Feb 19, 2023Updated 3 years ago
- Basic TensorFlow mechanics, operations, class definitions, and neural networks building. Examples from deeplearning.ai Tensorflow course …☆35Apr 12, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Fundamentals of Spark with Python (using PySpark), code examples☆363Oct 29, 2022Updated 3 years ago
- 🐍 Quick reference guide to common patterns & functions in PySpark.☆667Feb 21, 2023Updated 3 years ago
- The project implemented some machine learning algorithms on spark which is written in scala and it also included standalone implementatio…☆16Jan 3, 2022Updated 4 years ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆491Oct 15, 2024Updated last year
- Minimal local renderer for Helm template YAML, written in Python.☆12Oct 29, 2024Updated last year
- ☆10Jun 28, 2015Updated 10 years ago
- Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks☆1,663Mar 16, 2024Updated 2 years ago
- Web server written in c++☆18Jan 14, 2013Updated 13 years ago
- Data-Intensive Text Processing with MapReduce☆628Mar 3, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Model to accurately forecast inventory demand based on historical sales data.☆66Jul 6, 2016Updated 9 years ago
- LearningApacheSpark☆250Jan 3, 2024Updated 2 years ago
- In this work, we compared the predictive capabilities of six different machine learning algorithms - linear regression, random forest, ex…☆16Sep 21, 2020Updated 5 years ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- Source code for http://allaboutscala.com/scala-cheatsheet/☆11Jun 12, 2018Updated 7 years ago
- Examples for learning spark☆332Nov 9, 2015Updated 10 years ago
- ☆17May 31, 2017Updated 8 years ago
- Rasa Chatbot using Django backend and Sockets for communication☆12Dec 8, 2022Updated 3 years ago
- A Trino connector to access git repository contents☆18Feb 9, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Jenkins plugin that allows to deploy / stop Apache Spark applications in Spark standalone clusters.☆10Oct 25, 2015Updated 10 years ago
- Advanced Analytics with Spark (Spark高级数据分析) 书中的Scala代码,此项目中将其也转换为Java代码☆11Jun 20, 2017Updated 8 years ago
- ☆13Nov 2, 2015Updated 10 years ago
- Spatial Analysis and Data Extraction☆22May 31, 2018Updated 7 years ago
- Serialization from the C API for R☆13Jan 6, 2026Updated 3 months ago
- Distributed stock price forecasting system to predict S&P 500 stock prices.☆11Nov 12, 2021Updated 4 years ago
- Code examples supporting the "Introduction to Apache Spark" video published by O'Reilly Media☆37Jul 1, 2022Updated 3 years ago