sllynn / spark-xgboostLinks
A Python wrapper for XGBoost4J-Spark classes.
☆47Updated last year
Alternatives and similar repositories for spark-xgboost
Users that are interested in spark-xgboost are comparing it to the libraries listed below
Sorting:
- A simplified version of featuretools for Spark☆31Updated 6 years ago
- An implementation of our CIKM 2018 paper "Deep Conversion Attribution with Dual-attention Recurrent Neural Network"☆61Updated 6 years ago
- The Synthetic Minority Oversampling Technique (SMOTE) implemented in Spark.☆48Updated 7 years ago
- ☆34Updated 6 years ago
- ☆74Updated 6 years ago
- Easy converter pandas -> tfrecords & tfrecords -> pandas☆38Updated 2 years ago
- Nyoka is a Python library that helps to export ML models into PMML (PMML 4.4.1 Standard).☆185Updated last year
- Uplift modeling and evaluation library. Actively maintained pypi version.☆75Updated last year
- Isolation Forest on Spark☆229Updated 9 months ago
- convert DataFrame to libffm data format in parallel☆30Updated 7 years ago
- Machine learning enhancements to Spark MlLib☆20Updated 10 years ago
- A prototype implementation of ClusChurn based on PyTorch.☆52Updated 7 years ago
- PAKDD AutoML challenge 2nd Feature Engineering Part☆70Updated 4 years ago
- Finding customer lookalikes using Machine Learning in PySpark☆33Updated 6 years ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆103Updated 5 years ago
- Java library and command-line application for converting XGBoost models to PMML☆129Updated 3 months ago
- Compute and plot NDCG for a recommender system☆95Updated 7 years ago
- ☆27Updated 3 years ago
- Joblib Apache Spark Backend☆249Updated 3 months ago
- Multiple Response Uplift (or heterogeneous treatment effects) package that builds and evaluates tradeoffs with multiple treatments and mu…☆69Updated 2 months ago
- AugBoost: Gradient Boosting Enhanced with Step-Wise Feature Augmentation (2019 IJCAI paper)☆23Updated 5 years ago
- xgboost Extension for Easy Ranking & TreeFeature☆125Updated 5 years ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated 2 years ago
- Train TensorFlow models on YARN in just a few lines of code!☆89Updated last year
- Finding similar, high-valued users based on seed users. The model includes 1805 features using Hive HQL and AWS Redshift.☆35Updated 6 years ago
- An implementation of the minimum description length principal expert binning algorithm by Usama Fayyad☆105Updated 2 years ago
- A python package for feature selection in python☆51Updated 4 years ago
- A Python implementation of "Shapley Value Methods for Attribution Modeling in Online Advertising" by Zhao, et al.☆40Updated 5 years ago
- An attention-based Recurrent Neural Net multi-touch attribution model in a supervised learning fashion of predicting if a series of event…☆32Updated 3 years ago
- Repository for the research and implementation of categorical encoding into a Featuretools-compatible Python library☆51Updated 2 years ago