Quantmetry / pipeasy-sparkLinks
an easy way to define preprocessing data pipeline (similar to sklean-pandas but for Spark ML)
☆17Updated 6 years ago
Alternatives and similar repositories for pipeasy-spark
Users that are interested in pipeasy-spark are comparing it to the libraries listed below
Sorting:
- scikit-learn-inspired time series☆201Updated last year
- Hierarchical Time Series Forecasting with a familiar API☆226Updated 2 years ago
- Hierarchical Time Series Forecasting using Prophet☆144Updated 4 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆79Updated 2 years ago
- Repo for the ML_Insights python package☆153Updated 8 months ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 3 years ago
- A simple, extensible library for developing AutoML systems☆175Updated 2 years ago
- ⬛ Python Individual Conditional Expectation Plot Toolbox☆164Updated 5 years ago
- An extension of CatBoost to probabilistic modelling☆148Updated 2 years ago
- Better `keras` models for time series and beyond☆61Updated last year
- xverse (XuniVerse) is collection of transformers for feature engineering and feature selection☆117Updated 2 years ago
- Experimental Gradient Boosting Machines in Python with numba.☆189Updated 7 years ago
- A library that unifies the API for most commonly used libraries and modeling techniques for time-series forecasting in the Python ecosyst…☆150Updated last year
- Python package for Imputation Methods☆251Updated 2 years ago
- GAM timeseries modeling with auto-changepoint detection. Inspired by Facebook Prophet and implemented in PyMC3☆327Updated 5 years ago
- A unified framework for tabular probabilistic regression, time-to-event prediction, and probability distributions in python☆292Updated last week
- PyAF is an Open Source Python library for Automatic Time Series Forecasting built on top of popular pydata modules.☆459Updated last month
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆57Updated 4 years ago
- General Interpretability Package☆58Updated 3 years ago
- Home of the PipeGraph extension to Scikit-Learn☆24Updated 9 months ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆102Updated 6 years ago
- scikit-learn compatible implementation of stability selection.☆214Updated 2 years ago
- Example usage of scikit-hts☆57Updated 3 years ago
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distr…☆120Updated 11 months ago
- Confidence intervals for scikit-learn forest algorithms☆290Updated 8 months ago
- machine learning with logical rules in Python☆653Updated last year
- Embed categorical variables via neural networks.☆59Updated 2 years ago
- An automated machine learning tool aimed to facilitate AutoML research.☆102Updated last year
- 🍦 Deployment tool for online machine learning models☆98Updated 3 years ago
- Python package for performing the Alternating Conditional Expectation (ACE) regression☆72Updated 2 years ago