O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
☆230Jun 26, 2023Updated 2 years ago
Alternatives and similar repositories for data-algorithms-with-spark
Users that are interested in data-algorithms-with-spark are comparing it to the libraries listed below
Sorting:
- Machine Learning Course @ Santa Clara University☆24Jun 10, 2020Updated 5 years ago
- Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University☆166Dec 4, 2025Updated 3 months ago
- PySpark-Tutorial provides basic algorithms using PySpark☆1,272May 26, 2025Updated 9 months ago
- Haskell Cookbook, published by Packt☆25Jan 18, 2023Updated 3 years ago
- Code repository for the "PySpark in Action" book☆214Jun 11, 2025Updated 9 months ago
- Python extension pack for Anaconda☆22Oct 10, 2018Updated 7 years ago
- 🐍 Quick reference guide to common patterns & functions in PySpark.☆664Feb 21, 2023Updated 3 years ago
- Amazon EMR Notebook to show how to read from and write to Delta tables with Amazon EMR☆17Apr 27, 2025Updated 10 months ago
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆489Oct 15, 2024Updated last year
- Anaconda plugin for StarCluster☆21Aug 14, 2024Updated last year
- FlaskRestful + Swagger UI + Docker Compose + Unit Test | How to organize Python Code for REST API☆14Jun 5, 2022Updated 3 years ago
- Talks from the UW Python for Geosciences Seminar☆12Mar 1, 2016Updated 10 years ago
- ☆10Oct 3, 2022Updated 3 years ago
- ☆12Jun 23, 2016Updated 9 years ago
- PySpark functions and utilities with examples. Assists ETL process of data modeling☆104Dec 3, 2020Updated 5 years ago
- Pyspark RDD, DataFrame and Dataset Examples in Python language☆1,345Dec 7, 2025Updated 3 months ago
- Implementing best practices for PySpark ETL jobs and applications.☆2,081Jan 1, 2023Updated 3 years ago
- Mobile robot data were analyzed with Apache-Spark to extract five different statistical result such as travel time, waiting time, average…☆15Apr 5, 2022Updated 3 years ago
- Python library for deploying models built using Python to Alteryx Promote.☆15Dec 10, 2021Updated 4 years ago
- 极速VPN☆21Aug 6, 2020Updated 5 years ago
- DEEP BERLIN AI for Good Hackathon 2020☆14Apr 21, 2020Updated 5 years ago
- ☆12Jul 12, 2021Updated 4 years ago
- Experimental plugin for scikit-learn to be able to run (some estimators) on Intel GPUs via numba-dpex.☆16Feb 28, 2024Updated 2 years ago
- A wrapper around SageMaker ML Lineage Tracking extending ML Lineage to end-to-end ML lifecycles, including additional capabilities around…☆16Oct 14, 2021Updated 4 years ago
- Hands-On Chatbot Development with Alexa Skills and Amazon Lex, published by Packt☆15Jan 30, 2023Updated 3 years ago
- Source Code for 'Practical Haskell, 3rd Edition' by Alejandro Serrano Mena☆13Oct 11, 2022Updated 3 years ago
- Web scraped Data Science Interview Questions from Towards Data Science/ Medium.com asked by FAANG/Top Product based companies in last 4-5…☆16Jan 16, 2023Updated 3 years ago
- Git/Github Intro☆13Jun 17, 2015Updated 10 years ago
- repo with resources from Understanding Data with Alex Merced videos☆14Jan 20, 2024Updated 2 years ago
- Introduction to Dask for PyTorch Workflows☆13Mar 3, 2021Updated 5 years ago
- 👋 Project done for @hankified https://helloish.com☆16Oct 18, 2019Updated 6 years ago
- pytest plugin that checks URLs☆18May 16, 2024Updated last year
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,083Oct 14, 2024Updated last year
- The source code for the book Modern Data Engineering with Apache Spark☆39Jul 26, 2022Updated 3 years ago
- ☆16Jul 5, 2021Updated 4 years ago
- TriScale software☆14Apr 23, 2024Updated last year
- Various useful data structures in Python☆39Nov 14, 2019Updated 6 years ago
- Jupyter notebooks for pyspark tutorials given at University☆110Jan 7, 2026Updated 2 months ago
- Samples and documentation for various advertising and marketing use cases on AWS.☆36May 23, 2023Updated 2 years ago