Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
☆166Dec 4, 2025Updated 3 months ago
Alternatives and similar repositories for big-data-mapreduce-course
Users that are interested in big-data-mapreduce-course are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆230Jun 26, 2023Updated 2 years ago
- PySpark-Tutorial provides basic algorithms using PySpark☆1,275May 26, 2025Updated 10 months ago
- ☆14Sep 14, 2021Updated 4 years ago
- A collection of data analysis projects done using PySpark via Jupyter notebooks.☆10Oct 8, 2022Updated 3 years ago
- Apache Spark (PySpark) Practice on Real Data☆271Jan 31, 2020Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Hands-On-Big-Data-Modeling, Published by Packt☆33Jan 30, 2023Updated 3 years ago
- Analytics projects using Big Data eco-systems (Hadoop, Spark, Storm)☆17Dec 27, 2021Updated 4 years ago
- Predicting the sales of Rossmann drug stores through machine learning.☆11Mar 27, 2018Updated 8 years ago
- Jupyter Notebook showing how to process Telecom datasets using PySpark (SparkSQL and DataFrames) and plotting the results using Matplotli…☆17Dec 3, 2018Updated 7 years ago
- Examples for High Performance Spark☆16Oct 25, 2025Updated 5 months ago
- A Reader on Data Visualization☆19Jun 10, 2019Updated 6 years ago
- Programs with word vectors, RNN, NLP stuff, etc☆18Feb 28, 2017Updated 9 years ago
- A Scalable Data Cleaning Library for PySpark.☆29Apr 4, 2019Updated 6 years ago
- Kafka-Notes☆15Jun 20, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Fundamentals of Spark with Python (using PySpark), code examples☆364Oct 29, 2022Updated 3 years ago
- 🐍 Quick reference guide to common patterns & functions in PySpark.☆666Feb 21, 2023Updated 3 years ago
- Unsupervised Anomaly Detection via Deep Metric Learning with End-to-End Optimization☆12Mar 23, 2023Updated 3 years ago
- The project implemented some machine learning algorithms on spark which is written in scala and it also included standalone implementatio…☆16Jan 3, 2022Updated 4 years ago
- Git/Github Intro☆13Jun 17, 2015Updated 10 years ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆490Oct 15, 2024Updated last year
- Jupyter notebooks for pyspark tutorials given at University☆110Jan 7, 2026Updated 2 months ago
- a testimonials app for Django☆27Jun 19, 2021Updated 4 years ago
- ☆10Jun 28, 2015Updated 10 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks☆1,662Mar 16, 2024Updated 2 years ago
- Deploy Your First Deep Learning Model On Kubernetes With Python, Keras, Flask, and Docker 使用 Kubernetes 轻松部署深度学习模型☆11Oct 15, 2018Updated 7 years ago
- FlaskRestful + Swagger UI + Docker Compose + Unit Test | How to organize Python Code for REST API☆14Jun 5, 2022Updated 3 years ago
- LearningApacheSpark☆250Jan 3, 2024Updated 2 years ago
- Local Development of AWS Glue with Docker and Visual Studio Code☆14Nov 29, 2021Updated 4 years ago
- Source code for http://allaboutscala.com/scala-cheatsheet/☆11Jun 12, 2018Updated 7 years ago
- 用户行为分析系统☆12Dec 10, 2015Updated 10 years ago
- Real-world Spark pipelines examples☆83Feb 27, 2018Updated 8 years ago
- spark MLlib机器学习实践源码☆10Oct 28, 2016Updated 9 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Structured Streaming Machine Learning example with Spark 2.0☆94Apr 24, 2017Updated 8 years ago
- Deploy of Airflow 2.0 using ECS Fargate and AWS CDK.☆14Nov 5, 2021Updated 4 years ago
- Rasa Chatbot using Django backend and Sockets for communication☆12Dec 8, 2022Updated 3 years ago
- A Trino connector to access git repository contents☆18Feb 9, 2026Updated last month
- A Jenkins plugin that allows to deploy / stop Apache Spark applications in Spark standalone clusters.☆10Oct 25, 2015Updated 10 years ago
- Run Samza as a Spring Boot application☆18Mar 6, 2017Updated 9 years ago
- Distributed stock price forecasting system to predict S&P 500 stock prices.☆11Nov 12, 2021Updated 4 years ago