mahmoudparsian / pyspark-algorithmsView external linksLinks
PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2
☆88Jan 3, 2020Updated 6 years ago
Alternatives and similar repositories for pyspark-algorithms
Users that are interested in pyspark-algorithms are comparing it to the libraries listed below
Sorting:
- Machine Learning Course @ Santa Clara University☆24Jun 10, 2020Updated 5 years ago
- Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University☆166Dec 4, 2025Updated 2 months ago
- library for conducting propensity matching on spark scale☆14Jun 27, 2023Updated 2 years ago
- My finite volume method project. Here I will implement the many pieces of a finite volume method to incorporate into a larger code.☆11Aug 15, 2019Updated 6 years ago
- Generate PMML for various machine learning and statistical models.☆19Mar 8, 2022Updated 3 years ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 3 months ago
- Xml to Csv converter for Large files using Apache Spark☆12Jul 11, 2020Updated 5 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆17Jan 12, 2017Updated 9 years ago
- Detect memory leaks in minutes without a heap dump.☆17Apr 7, 2017Updated 8 years ago
- PySpark-Tutorial provides basic algorithms using PySpark☆1,273May 26, 2025Updated 8 months ago
- Consolidate all problem sets into one repo☆16Mar 4, 2025Updated 11 months ago
- TOC is a decentralized solution based on e-sport + entertainment live video industry☆11Jun 4, 2018Updated 7 years ago
- Git tutorial materials☆23Jun 19, 2017Updated 8 years ago
- ☆18Nov 9, 2025Updated 3 months ago
- Highly interactive, thread-parallel Lattice Boltzmann CFD solver☆21Apr 29, 2019Updated 6 years ago
- Live Demonstrations of Java Performance Problems. Instructions: https://github.com/eostermueller/javaPerformanceTroubleshooting/wiki/In…☆20Jan 21, 2022Updated 4 years ago
- A tutorial that helps Big Data Engineers ramp up faster by getting familiar with PySpark dataframes and functions. It also covers topics …☆20Nov 12, 2021Updated 4 years ago
- Contains public materials for students enrolled in MITx: 6.871x, Machine Learning for Healthcare☆20Jun 12, 2021Updated 4 years ago
- Collection of Databricks and Jupyter Notebooks☆22Feb 9, 2026Updated last week
- Python wrapper generator for Fortran☆31Feb 6, 2026Updated last week
- MapReduce, Spark, Java, and Scala for Data Algorithms Book☆1,084Oct 14, 2024Updated last year
- How to manage Slowly Changing Dimensions with Apache Hive☆55Aug 27, 2019Updated 6 years ago
- Python Emacs Intellisense and Unit Testing Support for Fortran☆21Jan 3, 2020Updated 6 years ago
- Updated repository☆157Nov 25, 2021Updated 4 years ago
- AutoML Software designed to give users access to a whole plethora of ML models, some trainable on the GPU.☆14Oct 23, 2021Updated 4 years ago
- A Scalable Data Cleaning Library for PySpark.☆29Apr 4, 2019Updated 6 years ago
- This repository contains a Fortran implementation of a 2D flow using the projection method, with Finite Volume Method (FVM) approach. The…☆24Nov 29, 2022Updated 3 years ago
- Code examples on Apache Spark using python☆108Aug 11, 2022Updated 3 years ago
- An (unofficial) command line interface for Google APIs☆31May 22, 2023Updated 2 years ago
- 2D Laminar Flow Solver for Computational Fluid Dynamics Course☆33Jan 23, 2016Updated 10 years ago
- As the field of Computational Fluid Dynamics (CFD) progresses, the fluid flows are more and more analysed by using simulations with the h…☆32Jun 6, 2023Updated 2 years ago
- ☆10Jun 21, 2021Updated 4 years ago
- Data sets and ML models versioning example from DVC get started☆10Jun 4, 2024Updated last year
- ⛅ Run OpenVSCode Server in Google Cloud Shell☆11Dec 22, 2023Updated 2 years ago
- ☆10Jun 29, 2021Updated 4 years ago
- Record matching and entity resolution at scale in Spark☆36Oct 31, 2023Updated 2 years ago
- Distributed adaptive octree construction, 2:1 balancing & partitioning based on space filling curves☆33Aug 19, 2024Updated last year
- Instant search for and access to many datasets in Pyspark.☆34Oct 6, 2022Updated 3 years ago
- ☆14Sep 14, 2021Updated 4 years ago