Quantmetry / pipeasy-sparkLinks
an easy way to define preprocessing data pipeline (similar to sklean-pandas but for Spark ML)
☆17Updated 6 years ago
Alternatives and similar repositories for pipeasy-spark
Users that are interested in pipeasy-spark are comparing it to the libraries listed below
Sorting:
- Hierarchical Time Series Forecasting with a familiar API☆226Updated 2 years ago
- Better `keras` models for time series and beyond☆61Updated 2 years ago
- Hierarchical Time Series Forecasting using Prophet☆144Updated 4 years ago
- scikit-learn-inspired time series☆201Updated last year
- Supervised forecasting of sequential data in Python.☆55Updated 7 years ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 3 years ago
- An extension of CatBoost to probabilistic modelling☆148Updated 2 years ago
- Embed categorical variables via neural networks.☆59Updated 2 years ago
- Repo for the ML_Insights python package☆153Updated 9 months ago
- Python package for Imputation Methods☆251Updated 2 years ago
- A simple, extensible library for developing AutoML systems☆175Updated 2 years ago
- A library that unifies the API for most commonly used libraries and modeling techniques for time-series forecasting in the Python ecosyst…☆150Updated last year
- A package for data science practitioners. This library implements a number of helpful, common data transformations with a scikit-learn fr…☆58Updated 4 years ago
- Python package for causal inference using Bayesian structural time-series models.☆244Updated 5 years ago
- Lightweight, Python library for fast and reproducible experimentation☆136Updated 7 years ago
- Experimental Gradient Boosting Machines in Python with numba.☆189Updated 7 years ago
- BATS and TBATS forecasting methods☆183Updated 2 years ago
- xverse (XuniVerse) is collection of transformers for feature engineering and feature selection☆117Updated 2 years ago
- ⬛ Python Individual Conditional Expectation Plot Toolbox☆164Updated 5 years ago
- PyAF is an Open Source Python library for Automatic Time Series Forecasting built on top of popular pydata modules.☆459Updated 2 months ago
- Tutorial for a new versioning Machine Learning pipeline☆80Updated 4 years ago
- GAM timeseries modeling with auto-changepoint detection. Inspired by Facebook Prophet and implemented in PyMC3☆326Updated 5 years ago
- Tabular feature encoding pipelines for machine learning with options for string parsing, missing data infill, and stochastic perturbation…☆164Updated 6 months ago
- Confidence intervals for scikit-learn forest algorithms☆290Updated 9 months ago
- Sky Cast: A Comparison of Modern Techniques for Forecasting Time Series☆68Updated 8 years ago
- Phi_K correlation analyzer library☆172Updated 2 weeks ago
- Example usage of scikit-hts☆57Updated 3 years ago
- ACV is a python library that provides explanations for any machine learning model or data. It gives local rule-based explanations for any…☆102Updated 3 years ago
- Home of the PipeGraph extension to Scikit-Learn☆24Updated 10 months ago
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distr…☆120Updated last year