Quantmetry / pipeasy-sparkLinks
an easy way to define preprocessing data pipeline (similar to sklean-pandas but for Spark ML)
☆17Updated 6 years ago
Alternatives and similar repositories for pipeasy-spark
Users that are interested in pipeasy-spark are comparing it to the libraries listed below
Sorting:
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 2 years ago
- Hierarchical Time Series Forecasting with a familiar API☆224Updated 2 years ago
- Better `keras` models for time series and beyond☆61Updated last year
- Initier la mise à disposition, pour tout citoyen, de techniques d’Intelligence Artificielle destinées à appréhender le nombre important d…☆12Updated 10 months ago
- Hierarchical Time Series Forecasting using Prophet☆144Updated 4 years ago
- Example usage of scikit-hts☆57Updated 2 years ago
- Supervised forecasting of sequential data in Python.☆55Updated 6 years ago
- Surrogate Assisted Feature Extraction☆37Updated 3 years ago
- Model Error Analysis for scikit-learn models.☆29Updated 3 years ago
- CBM Encoding☆19Updated 4 years ago
- General Interpretability Package☆58Updated 2 years ago
- BATS and TBATS forecasting methods☆182Updated 2 years ago
- A list of repositories commonly used @ Quantmetry☆14Updated 5 years ago
- mlmachine accelerates machine learning experimentation☆30Updated 3 years ago
- Optuna + LightGBM = OptGBM☆35Updated 2 years ago
- Phi_K correlation analyzer library☆164Updated 5 months ago
- scikit-learn-inspired time series☆200Updated last year
- NitroFE is a Python feature engineering engine which provides a variety of modules designed to internally save past dependent values for …☆106Updated 3 years ago
- ACV is a python library that provides explanations for any machine learning model or data. It gives local rule-based explanations for any…☆102Updated 2 years ago
- A tiny framework to perform adversarial validation of your training and test data.☆21Updated 5 months ago
- An extension of CatBoost to probabilistic modelling☆144Updated last year
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distr…☆121Updated 5 months ago
- A toolbox for fair and explainable machine learning☆55Updated last year
- xverse (XuniVerse) is collection of transformers for feature engineering and feature selection☆118Updated 2 years ago
- SHAP-based validation for linear and tree-based models. Applied to binary, multiclass and regression problems.☆150Updated 2 months ago
- Distance metrics which can handle mixed-type data and missing values☆59Updated 2 years ago
- A simple, extensible library for developing AutoML systems☆175Updated last year
- DataFrame support for scikit-learn.☆63Updated last year
- TSFresh primitives for featuretools☆36Updated 2 years ago
- Hello world univariate examples for a variety of time series packages.☆56Updated 9 months ago