mahmoudparsian / big-data-mapreduce-courseView external linksLinks
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
☆166Dec 4, 2025Updated 2 months ago
Alternatives and similar repositories for big-data-mapreduce-course
Users that are interested in big-data-mapreduce-course are comparing it to the libraries listed below
Sorting:
- Examples for learning spark☆19Aug 19, 2015Updated 10 years ago
- Machine Learning Course @ Santa Clara University☆24Jun 10, 2020Updated 5 years ago
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆228Jun 26, 2023Updated 2 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆88Jan 3, 2020Updated 6 years ago
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,084Oct 14, 2024Updated last year
- PySpark-Tutorial provides basic algorithms using PySpark☆1,273May 26, 2025Updated 8 months ago
- ☆14Sep 14, 2021Updated 4 years ago
- A collection of data analysis projects done using PySpark via Jupyter notebooks.☆10Oct 8, 2022Updated 3 years ago
- FlaskRestful + Swagger UI + Docker Compose + Unit Test | How to organize Python Code for REST API☆14Jun 5, 2022Updated 3 years ago
- Web server written in c++☆18Jan 14, 2013Updated 13 years ago
- Basic TensorFlow mechanics, operations, class definitions, and neural networks building. Examples from deeplearning.ai Tensorflow course …☆35Apr 12, 2019Updated 6 years ago
- Demonstrating the efficiency of pmdarima’s auto_arima() function compared to implementing a traditional ARIMA model.☆12Feb 16, 2021Updated 5 years ago
- Jupyter Notebook showing how to process Telecom datasets using PySpark (SparkSQL and DataFrames) and plotting the results using Matplotli…☆16Dec 3, 2018Updated 7 years ago
- Hands-On-Big-Data-Modeling, Published by Packt☆33Jan 30, 2023Updated 3 years ago
- BS/MS/PhD Thesis Template with Latex for Amirkabir University of Technology (Tehran Polytechnic) - قالب پایاننامه لاتک دانشگاه صنعتی امی…☆16Jan 31, 2019Updated 7 years ago
- Git/Github Intro☆13Jun 17, 2015Updated 10 years ago
- Analytics projects using Big Data eco-systems (Hadoop, Spark, Storm)☆17Dec 27, 2021Updated 4 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆17Dec 18, 2018Updated 7 years ago
- Various useful data structures in Python☆39Nov 14, 2019Updated 6 years ago
- Jupyter notebooks for pyspark tutorials given at University☆110Jan 7, 2026Updated last month
- Examples for High Performance Spark☆16Oct 25, 2025Updated 3 months ago
- Fundamentals of Spark with Python (using PySpark), code examples☆362Oct 29, 2022Updated 3 years ago
- 🐍 Quick reference guide to common patterns & functions in PySpark.☆652Feb 21, 2023Updated 2 years ago
- A spreadsheet engine implemented in Python.☆19Aug 24, 2024Updated last year
- General math scripts and important algorithms' implementation in Python 3☆22Feb 18, 2018Updated 7 years ago
- Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks☆1,667Mar 16, 2024Updated last year
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆488Oct 15, 2024Updated last year
- Contains relevant notebooks for the hands-on NLP workshop for the GIDS AIML Conference -2020 Edition☆23May 3, 2021Updated 4 years ago
- This repository focuses on providing interview scenario questions that I have encountered during interviews. The questions are designed t…☆46Feb 11, 2025Updated last year
- Airflow Tutorials☆25Feb 28, 2021Updated 4 years ago
- Spring MVC examples from my blog - see http://geowarin.wordpress.com/category/spring-mvc-examples☆50May 10, 2013Updated 12 years ago
- This RESTFul API consumes data from a Wellsite Information Transfer Standard Markup Language (WITSML) server and provides responses in fo…☆11Jul 31, 2022Updated 3 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆94Apr 24, 2017Updated 8 years ago
- Labs and data files for a full-day Spark workshop☆24May 24, 2025Updated 8 months ago
- ☆59Dec 11, 2021Updated 4 years ago
- ☆20Mar 12, 2023Updated 2 years ago
- dbt (data build tool) projects targeting AWS analytics services (redshift, glue, emr, athena) and open table formats☆31Apr 13, 2023Updated 2 years ago
- ☆27Aug 8, 2024Updated last year
- LearningApacheSpark☆250Jan 3, 2024Updated 2 years ago