Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
☆166Dec 4, 2025Updated 3 months ago
Alternatives and similar repositories for big-data-mapreduce-course
Users that are interested in big-data-mapreduce-course are comparing it to the libraries listed below
Sorting:
- Examples for learning spark☆19Aug 19, 2015Updated 10 years ago
- O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian☆230Jun 26, 2023Updated 2 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆88Jan 3, 2020Updated 6 years ago
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,083Oct 14, 2024Updated last year
- PySpark-Tutorial provides basic algorithms using PySpark☆1,272May 26, 2025Updated 9 months ago
- A collection of data analysis projects done using PySpark via Jupyter notebooks.☆10Oct 8, 2022Updated 3 years ago
- Source code for http://allaboutscala.com/scala-cheatsheet/☆11Jun 12, 2018Updated 7 years ago
- FlaskRestful + Swagger UI + Docker Compose + Unit Test | How to organize Python Code for REST API☆14Jun 5, 2022Updated 3 years ago
- Apache Spark (PySpark) Practice on Real Data☆273Jan 31, 2020Updated 6 years ago
- A Binder-compatibible repo with a Dockerfile☆11Aug 18, 2017Updated 8 years ago
- Basic TensorFlow mechanics, operations, class definitions, and neural networks building. Examples from deeplearning.ai Tensorflow course …☆35Apr 12, 2019Updated 6 years ago
- Demonstrating the efficiency of pmdarima’s auto_arima() function compared to implementing a traditional ARIMA model.☆12Feb 16, 2021Updated 5 years ago
- Hands-On-Big-Data-Modeling, Published by Packt☆33Jan 30, 2023Updated 3 years ago
- BS/MS/PhD Thesis Template with Latex for Amirkabir University of Technology (Tehran Polytechnic) - قالب پایاننامه لاتک دانشگاه صنعتی امی…☆16Jan 31, 2019Updated 7 years ago
- Git/Github Intro☆13Jun 17, 2015Updated 10 years ago
- Tool for computing continuous distributed representations of word. Modified to learn N-Grams☆15Jan 12, 2017Updated 9 years ago
- Jupyter Notebook showing how to process Telecom datasets using PySpark (SparkSQL and DataFrames) and plotting the results using Matplotli…☆17Dec 3, 2018Updated 7 years ago
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆17Dec 18, 2018Updated 7 years ago
- Analytics projects using Big Data eco-systems (Hadoop, Spark, Storm)☆17Dec 27, 2021Updated 4 years ago
- 本次新人赛是Datawhale与天池联合发起的0基础入门系列赛事第三场 —— 零基础入门NLP之新闻文本分类挑战赛。赛题以自然语言处理为背景,要求选手根据新闻文本字符对新闻的类别进行分类,这是一个经典文本分类问题。通过这道赛题可以引导大家走入自然语言处理的世界,带大家接触N…☆18Aug 22, 2020Updated 5 years ago
- Various useful data structures in Python☆39Nov 14, 2019Updated 6 years ago
- Jupyter notebooks for pyspark tutorials given at University☆110Jan 7, 2026Updated 2 months ago
- Examples for High Performance Spark☆16Oct 25, 2025Updated 4 months ago
- Fundamentals of Spark with Python (using PySpark), code examples☆362Oct 29, 2022Updated 3 years ago
- Programs with word vectors, RNN, NLP stuff, etc☆18Feb 28, 2017Updated 9 years ago
- ☆37Mar 31, 2017Updated 8 years ago
- Deep Learning Projects on TensorFlow and Keras☆20Jun 13, 2024Updated last year
- 🐍 Quick reference guide to common patterns & functions in PySpark.☆662Feb 21, 2023Updated 3 years ago
- General math scripts and important algorithms' implementation in Python 3☆22Feb 18, 2018Updated 8 years ago
- Apache Spark & Python (pySpark) tutorials for Big Data Analysis and Machine Learning as IPython / Jupyter notebooks☆1,663Mar 16, 2024Updated last year
- Data-Intensive Text Processing with MapReduce☆628Mar 3, 2021Updated 5 years ago
- NLP Utilities in Java☆43Dec 14, 2022Updated 3 years ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆490Oct 15, 2024Updated last year
- Section 3 of the Django + Angular + Ionic Course☆23Jul 22, 2018Updated 7 years ago
- Code repository for the "PySpark in Action" book☆214Jun 11, 2025Updated 8 months ago
- Machine Learning for Time-Series with Python.Published by Packt☆21Apr 24, 2024Updated last year
- Airflow Tutorials☆25Feb 28, 2021Updated 5 years ago
- Structured Streaming Machine Learning example with Spark 2.0☆94Apr 24, 2017Updated 8 years ago
- Eskimo is a state of the art Big Data Infrastructure and Management Web Console to build, manage and operate Big Data 2.0 Analytics clust…☆25Sep 14, 2023Updated 2 years ago