Quantmetry / pipeasy-spark
an easy way to define preprocessing data pipeline (similar to sklean-pandas but for Spark ML)
☆17Updated 5 years ago
Alternatives and similar repositories for pipeasy-spark:
Users that are interested in pipeasy-spark are comparing it to the libraries listed below
- A list of repositories commonly used @ Quantmetry☆14Updated 5 years ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 2 years ago
- Model Error Analysis for scikit-learn models.☆29Updated 3 years ago
- Initier la mise à disposition, pour tout citoyen, de techniques d’Intelligence Artificielle destinées à appréhender le nombre important d…☆12Updated 4 months ago
- A toolbox for fair and explainable machine learning☆54Updated 7 months ago
- Embed categorical variables via neural networks.☆59Updated last year
- Supervised forecasting of sequential data in Python.☆55Updated 6 years ago
- ⬛ Python Individual Conditional Expectation Plot Toolbox☆165Updated 4 years ago
- A simple, extensible library for developing AutoML systems☆173Updated last year
- Hierarchical Time Series Forecasting with a familiar API☆223Updated last year
- An extension of CatBoost to probabilistic modelling☆142Updated last year
- Hierarchical Time Series Forecasting using Prophet☆144Updated 3 years ago
- Better `keras` models for time series and beyond☆61Updated last year
- Visualization ideas for data science☆19Updated 6 years ago
- Surrogate Assisted Feature Extraction☆36Updated 3 years ago
- xverse (XuniVerse) is collection of transformers for feature engineering and feature selection☆117Updated last year
- CBM Encoding☆19Updated 3 years ago
- Python implementation of R package breakDown☆42Updated last year
- scikit-learn-inspired time series☆198Updated 9 months ago
- General Interpretability Package☆58Updated 2 years ago
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distr…☆122Updated last week
- Phi_K correlation analyzer library☆159Updated this week
- stratx is a library for A Stratification Approach to Partial Dependence for Codependent Variables☆65Updated 8 months ago
- Repo for the ML_Insights python package☆149Updated last year
- Implementation of tree-structured neural networks in PyTorch.☆14Updated 3 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆55Updated 3 years ago
- This repository contains a notebook demonstrating a practical implementation of the so-called Entity Embedding for Encoding Categorical F…☆74Updated 5 years ago
- ACV is a python library that provides explanations for any machine learning model or data. It gives local rule-based explanations for any…☆100Updated 2 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated last year