sllynn / spark-xgboost
A Python wrapper for XGBoost4J-Spark classes.
☆47Updated 9 months ago
Alternatives and similar repositories for spark-xgboost:
Users that are interested in spark-xgboost are comparing it to the libraries listed below
- A simplified version of featuretools for Spark☆31Updated 5 years ago
- The Synthetic Minority Oversampling Technique (SMOTE) implemented in Spark.☆49Updated 6 years ago
- ☆34Updated 6 years ago
- Python library for converting Apache Spark ML pipelines to PMML☆95Updated last year
- A parallel implementation of factorization machines based on Spark☆73Updated 4 years ago
- Positive-Unlabeled Learning for Apache Spark☆42Updated 6 years ago
- A prototype implementation of ClusChurn based on PyTorch.☆52Updated 6 years ago
- An attention-based Recurrent Neural Net multi-touch attribution model in a supervised learning fashion of predicting if a series of event…☆29Updated 3 years ago
- Machine learning enhancements to Spark MlLib☆20Updated 9 years ago
- Java library and command-line application for converting XGBoost models to PMML☆129Updated last week
- Sample application running fbprophet using spark☆49Updated 5 years ago
- Spark-based GBM☆56Updated 4 years ago
- An implementation of our CIKM 2018 paper "Deep Conversion Attribution with Dual-attention Recurrent Neural Network"☆61Updated 5 years ago
- convert DataFrame to libffm data format in parallel☆30Updated 6 years ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆103Updated 5 years ago
- Some notes/codes on hyperparameters tuning techniques with some hacking around...☆24Updated 6 years ago
- Spark On Angel, arming Spark with a powerful Parameter Server, which enable Spark to train very big models☆84Updated 2 years ago
- LightCTR is a tensorflow 2.0 based, extensible toolbox for building CTR/CVR predicting models.☆103Updated last year
- A distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scal…☆234Updated last month
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated last year
- Joblib Apache Spark Backend☆244Updated 5 months ago
- Isolation Forest on Spark☆227Updated 3 months ago
- Python implementation of the population stability index (PSI)☆134Updated last year
- Uplift modeling and evaluation library. Actively maintained pypi version.☆74Updated last year
- Hybrid model of Gradient Boosting Trees and Logistic Regression (GBDT+LR) on Spark☆88Updated 6 years ago
- Some popular algorithms(dbscan,knn,fm etc.) on spark☆32Updated 6 years ago
- Spark Implementation of Google Facets Overview https://github.com/PAIR-code/facets☆54Updated last year
- A feedback controller for stabilizing RTB performance to a target value.☆65Updated 9 years ago
- An automatic machine learning toolkit, including hyper-parameter tuning and feature engineering.☆58Updated 5 years ago