Quantmetry / pipeasy-sparkLinks
an easy way to define preprocessing data pipeline (similar to sklean-pandas but for Spark ML)
☆17Updated 6 years ago
Alternatives and similar repositories for pipeasy-spark
Users that are interested in pipeasy-spark are comparing it to the libraries listed below
Sorting:
- scikit-learn-inspired time series☆200Updated last year
- Hierarchical Time Series Forecasting with a familiar API☆225Updated 2 years ago
- Hierarchical Time Series Forecasting using Prophet☆144Updated 4 years ago
- Better `keras` models for time series and beyond☆61Updated last year
- A simple, extensible library for developing AutoML systems☆175Updated 2 years ago
- Repo for the ML_Insights python package☆152Updated 4 months ago
- xverse (XuniVerse) is collection of transformers for feature engineering and feature selection☆118Updated 2 years ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 3 years ago
- Embed categorical variables via neural networks.☆59Updated 2 years ago
- A tiny framework to perform adversarial validation of your training and test data.☆22Updated 7 months ago
- ⬛ Python Individual Conditional Expectation Plot Toolbox☆165Updated 5 years ago
- Visualization ideas for data science☆20Updated 7 years ago
- An extension of CatBoost to probabilistic modelling☆147Updated last year
- A toolbox for fair and explainable machine learning☆55Updated last year
- Supervised forecasting of sequential data in Python.☆55Updated 6 years ago
- A unified framework for tabular probabilistic regression, time-to-event prediction, and probability distributions in python☆279Updated this week
- Lightweight, Python library for fast and reproducible experimentation☆136Updated 6 years ago
- Python package for causal inference using Bayesian structural time-series models.☆241Updated 5 years ago
- DataGene - Identify How Similar TS Datasets Are to One Another (by @firmai)☆205Updated 3 years ago
- General Interpretability Package☆58Updated 2 years ago
- A library that unifies the API for most commonly used libraries and modeling techniques for time-series forecasting in the Python ecosyst…☆150Updated last year
- CinnaMon is a Python library which offers a number of tools to detect, explain, and correct data drift in a machine learning system☆77Updated 2 years ago
- Buy Till You Die and Customer Lifetime Value statistical models in Python.☆117Updated last year
- Phi_K correlation analyzer library☆166Updated 2 weeks ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆76Updated 2 years ago
- Experimental Gradient Boosting Machines in Python with numba.☆185Updated 6 years ago
- Confidence intervals for scikit-learn forest algorithms☆289Updated 4 months ago
- Survival analsyis and time-to-failure predictive modeling using Weibull distributions and Recurrent Neural Networks in Keras☆244Updated 6 years ago
- Demo Weibull Time-to-event Recurrent Neural Network in Keras☆222Updated 6 years ago
- Time should be taken seer-iously☆317Updated 2 years ago