an easy way to define preprocessing data pipeline (similar to sklean-pandas but for Spark ML)
β17Feb 6, 2019Updated 7 years ago
Alternatives and similar repositories for pipeasy-spark
Users that are interested in pipeasy-spark are comparing it to the libraries listed below
Sorting:
- A list of repositories commonly used @ Quantmetryβ14Jul 3, 2019Updated 6 years ago
- π§ Melusine: Use python to automatize your email processing workflowβ363Feb 26, 2026Updated 3 weeks ago
- Implementation of tree-structured neural networks in PyTorch.β14Nov 15, 2021Updated 4 years ago
- A scikit-learn-compatible module for comparing imputation methods.β140Jan 1, 2026Updated 2 months ago
- Article for Special Edition of Information: Machine Learning with Pythonβ14Jan 8, 2025Updated last year
- Prediction Explanations Clusteringβ10Oct 19, 2023Updated 2 years ago
- Tools for Working with Multidimensional Data in R and C++β19Nov 17, 2022Updated 3 years ago
- .NET wrapper for Apache MXNet written in C#β13Feb 16, 2020Updated 6 years ago
- A python tool for automatic image georeferencingβ15Apr 15, 2021Updated 4 years ago
- Techniques & resources for training interpretable ML models, explaining ML models, and debugging ML models.β21Feb 2, 2026Updated last month
- PDF and notebooks of GUDHI presentation @ NIPS 2017β15Jan 22, 2018Updated 8 years ago
- A toolbox for fair and explainable machine learningβ55Jun 17, 2024Updated last year
- β19Dec 27, 2020Updated 5 years ago
- 'Keep Calm and Trust your Model' : On Explainability of Machine Learning Modelsβ15Jul 29, 2017Updated 8 years ago
- High-performances Brain Tractogram Visualizationβ18May 6, 2022Updated 3 years ago
- COVID-19 statistics in Taiwanβ14Apr 13, 2023Updated 2 years ago
- Demonstrates how to submit a job to Spark on HDP directly via YARN's REST API from any workstationβ23Apr 18, 2016Updated 9 years ago
- β12Sep 4, 2017Updated 8 years ago
- Paper and talk from KDD 2019 XAI Workshopβ20May 31, 2020Updated 5 years ago
- Homomorphic Random Forest libraryβ17Apr 12, 2023Updated 2 years ago
- A catalog of Jupyter Notebooks presenting new techniques to interpret black box machine learning models.β15Nov 14, 2018Updated 7 years ago
- β14May 30, 2019Updated 6 years ago
- A slider control using d3.jsβ15Dec 27, 2017Updated 8 years ago
- Honest calibration assessment for binary outcome predictionsβ11Aug 9, 2022Updated 3 years ago
- Simulate Evidence Accumulation Models in Pythonβ23Nov 16, 2021Updated 4 years ago
- Documentation de l'algorithme d'orientation COVID19β13Nov 15, 2020Updated 5 years ago
- A small wrapper to do Beta Boosting with XgBoostβ15Oct 26, 2021Updated 4 years ago
- Temporary repo to split the pseudo livrableβ17May 7, 2020Updated 5 years ago
- The simplest way to deploy a machine learning modelβ24Nov 19, 2022Updated 3 years ago
- β13Feb 29, 2024Updated 2 years ago
- Download and load MIMIC-III into a PostgreSQL DB on an Ubuntu VMβ10Jul 3, 2016Updated 9 years ago
- A framework to simplify artificial intelligence prompt engineeringβ31Sep 28, 2022Updated 3 years ago
- A simple, beautiful Jekyll theme that's mobile first.β15Jan 5, 2023Updated 3 years ago
- control spark-shell from vimβ11Oct 27, 2016Updated 9 years ago
- Python library for converting Apache Spark ML pipelines to PMMLβ99Updated this week
- Slides, videos and other potentially useful artifacts from various presentations on responsible machine learning.β22Nov 19, 2019Updated 6 years ago
- Pipeline for building Machine Learning Classifiers for the diagnosis of EHR text-data. We used this pipeline for our study, published herβ¦β12Jul 6, 2023Updated 2 years ago
- mapping geodataβ13Apr 5, 2016Updated 9 years ago
- Optimal Spectral Transportation : audio musical unmixing using optimal transportβ25Apr 24, 2025Updated 10 months ago