Ibotta / sk-distLinks
Distributed scikit-learn meta-estimators in PySpark
☆286Updated 6 months ago
Alternatives and similar repositories for sk-dist
Users that are interested in sk-dist are comparing it to the libraries listed below
Sorting:
- Automated vs Manual Feature Engineering Comparison. Implemented using Featuretools.☆327Updated 5 years ago
- Easy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries☆707Updated 4 years ago
- Nyoka is a Python library that helps to export ML models into PMML (PMML 4.4.1 Standard).☆189Updated last year
- Easy to use library to bring Tensorflow on Apache Spark☆296Updated 2 years ago
- Joblib Apache Spark Backend☆249Updated 7 months ago
- Python library for converting Scikit-Learn pipelines to PMML☆698Updated 3 weeks ago
- Feature exploration for supervised learning☆761Updated 4 years ago
- Train and run Pytorch models on Apache Spark.☆340Updated 2 years ago
- Tools for WoE Transformation mostly used in ScoreCard Model for credit rating☆256Updated 6 years ago
- Python library for converting Apache Spark ML pipelines to PMML☆98Updated this week
- Automated feature engineering in Python with Featuretools☆520Updated 6 years ago
- Python PMML scoring library☆79Updated 2 months ago
- A simplified version of featuretools for Spark☆31Updated 6 years ago
- edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab☆225Updated 5 years ago
- Python package that optimizes information value, weight-of-evidence monotonicity and representativeness of features for credit scorecard …☆117Updated 3 years ago
- ☆296Updated 3 years ago
- An automatic machine learning toolkit, including hyper-parameter tuning and feature engineering.☆60Updated 6 years ago
- Uplift modeling package.☆376Updated 3 years ago
- AutoGBT is used for AutoML in a lifelong machine learning setting to classify large volume high cardinality data streams under concept-dr…☆114Updated 5 years ago
- Uplift modeling and evaluation library. Actively maintained pypi version.☆78Updated last year
- Isolation Forest on Spark☆231Updated last year
- (AAAI' 20) A Python Toolbox for Machine Learning Model Combination☆659Updated 2 years ago
- Python implementation of the population stability index (PSI)☆142Updated 2 years ago
- ML-Ensemble – high performance ensemble learning☆860Updated 2 years ago
- A Python wrapper for XGBoost4J-Spark classes.☆47Updated last year
- Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one…☆382Updated 3 years ago
- Kaggling Home Credit Default Risk in a pipeline fashion.☆12Updated 7 years ago
- ☆249Updated 4 years ago
- ThunderGBM: Fast GBDTs and Random Forests on GPUs☆708Updated 7 months ago
- Deploy AutoML as a service using Flask☆226Updated 8 years ago