Quantmetry / pipeasy-sparkLinks
an easy way to define preprocessing data pipeline (similar to sklean-pandas but for Spark ML)
☆17Updated 7 years ago
Alternatives and similar repositories for pipeasy-spark
Users that are interested in pipeasy-spark are comparing it to the libraries listed below
Sorting:
- Hierarchical Time Series Forecasting with a familiar API☆226Updated 2 years ago
- ⬛ Python Individual Conditional Expectation Plot Toolbox☆164Updated 5 years ago
- Example usage of scikit-hts☆57Updated 3 years ago
- Hierarchical Time Series Forecasting using Prophet☆144Updated 5 years ago
- scikit-learn-inspired time series☆201Updated last year
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 3 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆80Updated 2 years ago
- A simple, extensible library for developing AutoML systems☆175Updated 2 years ago
- Multiple correspondence analysis☆181Updated 2 months ago
- General Interpretability Package☆58Updated 3 years ago
- Better `keras` models for time series and beyond☆61Updated 2 years ago
- Buy Till You Die and Customer Lifetime Value statistical models in Python.☆118Updated last year
- Repo for the ML_Insights python package☆153Updated 9 months ago
- A toolbox for fair and explainable machine learning☆55Updated last year
- Supervised forecasting of sequential data in Python.☆55Updated 7 years ago
- A library that unifies the API for most commonly used libraries and modeling techniques for time-series forecasting in the Python ecosyst…☆150Updated last year
- An extension of CatBoost to probabilistic modelling☆149Updated 2 years ago
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆58Updated 4 years ago
- Experimental Gradient Boosting Machines in Python with numba.☆189Updated 7 years ago
- BATS and TBATS forecasting methods☆183Updated 2 years ago
- xverse (XuniVerse) is collection of transformers for feature engineering and feature selection☆117Updated 2 years ago
- Confidence intervals for scikit-learn forest algorithms☆290Updated 9 months ago
- 🍦 Deployment tool for online machine learning models☆98Updated 3 years ago
- Python package for Imputation Methods☆251Updated 2 years ago
- A unified framework for tabular probabilistic regression, time-to-event prediction, and probability distributions in python☆296Updated 2 weeks ago
- Embed categorical variables via neural networks.☆59Updated 2 years ago
- Lightweight, Python library for fast and reproducible experimentation☆136Updated 7 years ago
- Automated Data Science and Machine Learning library to optimize workflow.☆105Updated 2 years ago
- NOTE: skutil is now deprecated. See its sister project: https://github.com/tgsmith61591/skoot. Original description: A set of scikit-lear…☆31Updated 7 years ago
- A catalog of Jupyter Notebooks presenting new techniques to interpret black box machine learning models.☆15Updated 7 years ago