Quantmetry / pipeasy-sparkLinks
an easy way to define preprocessing data pipeline (similar to sklean-pandas but for Spark ML)
☆17Updated 6 years ago
Alternatives and similar repositories for pipeasy-spark
Users that are interested in pipeasy-spark are comparing it to the libraries listed below
Sorting:
- scikit-learn-inspired time series☆201Updated last year
- Hierarchical Time Series Forecasting with a familiar API☆225Updated 2 years ago
- Hierarchical Time Series Forecasting using Prophet☆144Updated 4 years ago
- Better `keras` models for time series and beyond☆61Updated last year
- Spark implementation of computing Shapley Values using monte-carlo approximation☆78Updated 2 years ago
- Multiple correspondence analysis☆181Updated 2 months ago
- Supervised forecasting of sequential data in Python.☆55Updated 6 years ago
- A library that unifies the API for most commonly used libraries and modeling techniques for time-series forecasting in the Python ecosyst…☆150Updated last year
- ⬛ Python Individual Conditional Expectation Plot Toolbox☆165Updated 5 years ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 3 years ago
- Repo for the ML_Insights python package☆153Updated 7 months ago
- A simple, extensible library for developing AutoML systems☆175Updated 2 years ago
- Lightweight, Python library for fast and reproducible experimentation☆136Updated 7 years ago
- Embed categorical variables via neural networks.☆59Updated 2 years ago
- A library for composing end-to-end tunable machine learning pipelines.☆120Updated 9 months ago
- Buy Till You Die and Customer Lifetime Value statistical models in Python.☆117Updated last year
- Example usage of scikit-hts☆57Updated 3 years ago
- Visualization ideas for data science☆20Updated 7 years ago
- Python package for Imputation Methods☆251Updated last year
- Experimental Gradient Boosting Machines in Python with numba.☆188Updated 6 years ago
- An extension of CatBoost to probabilistic modelling☆148Updated 2 years ago
- xverse (XuniVerse) is collection of transformers for feature engineering and feature selection☆117Updated 2 years ago
- a python grammar for evolutionary algorithms and heuristics☆192Updated 3 years ago
- Distance metrics which can handle mixed-type data and missing values☆59Updated 3 years ago
- Confidence intervals for scikit-learn forest algorithms☆290Updated 7 months ago
- Python package for causal inference using Bayesian structural time-series models.☆244Updated 5 years ago
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distr…☆120Updated 10 months ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbation…☆164Updated 4 months ago
- General Interpretability Package☆58Updated 2 years ago
- Time should be taken seer-iously☆319Updated 2 years ago