Ibotta / sk-distLinks
Distributed scikit-learn meta-estimators in PySpark
☆286Updated 4 months ago
Alternatives and similar repositories for sk-dist
Users that are interested in sk-dist are comparing it to the libraries listed below
Sorting:
- Automated vs Manual Feature Engineering Comparison. Implemented using Featuretools.☆327Updated 5 years ago
- Nyoka is a Python library that helps to export ML models into PMML (PMML 4.4.1 Standard).☆187Updated last year
- Joblib Apache Spark Backend☆249Updated 5 months ago
- Easy to use library to bring Tensorflow on Apache Spark☆296Updated last year
- Easy hyperparameter optimization and automatic result saving across machine learning algorithms and libraries☆708Updated 4 years ago
- Feature exploration for supervised learning☆762Updated 4 years ago
- Automated feature engineering in Python with Featuretools☆521Updated 6 years ago
- Python library for converting Apache Spark ML pipelines to PMML☆98Updated last month
- Python PMML scoring library☆78Updated 3 weeks ago
- Python library for converting Scikit-Learn pipelines to PMML☆699Updated last week
- Isolation Forest on Spark☆229Updated 10 months ago
- Tools for WoE Transformation mostly used in ScoreCard Model for credit rating☆256Updated 5 years ago
- A Python wrapper for XGBoost4J-Spark classes.☆47Updated last year
- AutoGBT is used for AutoML in a lifelong machine learning setting to classify large volume high cardinality data streams under concept-dr…☆114Updated 5 years ago
- Train and run Pytorch models on Apache Spark.☆340Updated 2 years ago
- A simplified version of featuretools for Spark☆31Updated 6 years ago
- (AAAI' 20) A Python Toolbox for Machine Learning Model Combination☆660Updated 2 years ago
- Uplift modeling package.☆374Updated 2 years ago
- ☆74Updated 7 years ago
- ☆249Updated 4 years ago
- Java library and command-line application for converting XGBoost models to PMML☆131Updated last month
- Python implementation of the population stability index (PSI)☆142Updated last year
- Deploy AutoML as a service using Flask☆226Updated 7 years ago
- ☆34Updated 6 years ago
- An automatic machine learning toolkit, including hyper-parameter tuning and feature engineering.☆60Updated 5 years ago
- Benchmarking Gradient Boosting in TensorFlow and XGBoost☆137Updated 7 years ago
- Python package that optimizes information value, weight-of-evidence monotonicity and representativeness of features for credit scorecard …☆117Updated 2 years ago
- The Synthetic Minority Oversampling Technique (SMOTE) implemented in Spark.☆48Updated 7 years ago
- edaviz - Python library for Exploratory Data Analysis and Visualization in Jupyter Notebook or Jupyter Lab☆226Updated 5 years ago
- Home repository for the Regularized Greedy Forest (RGF) library. It includes original implementation from the paper and multithreaded one…☆382Updated 3 years ago