sllynn / spark-xgboost
A Python wrapper for XGBoost4J-Spark classes.
☆47Updated 11 months ago
Alternatives and similar repositories for spark-xgboost:
Users that are interested in spark-xgboost are comparing it to the libraries listed below
- A simplified version of featuretools for Spark☆31Updated 5 years ago
- ☆34Updated 6 years ago
- The Synthetic Minority Oversampling Technique (SMOTE) implemented in Spark.☆49Updated 6 years ago
- ☆74Updated 6 years ago
- Uplift modeling and evaluation library. Actively maintained pypi version.☆75Updated last year
- A POC of Google's Wide & Deep Learning models deployed on Google Cloud ML Engine for Kaggle's Outbrain Click Competition☆36Updated 6 years ago
- A prototype implementation of ClusChurn based on PyTorch.☆52Updated 7 years ago
- Joblib Apache Spark Backend☆245Updated 7 months ago
- Java library and command-line application for converting XGBoost models to PMML☆129Updated last month
- Some notes/codes on hyperparameters tuning techniques with some hacking around...☆24Updated 6 years ago
- Nyoka is a Python library that helps to export ML models into PMML (PMML 4.4.1 Standard).☆185Updated last year
- Spark On Angel, arming Spark with a powerful Parameter Server, which enable Spark to train very big models☆84Updated 2 years ago
- Python implementation of the population stability index (PSI)☆139Updated last year
- Isolation Forest on Spark☆227Updated 5 months ago
- A Python implementation of "Shapley Value Methods for Attribution Modeling in Online Advertising" by Zhao, et al.☆39Updated 4 years ago
- Machine learning enhancements to Spark MlLib☆20Updated 10 years ago
- LightCTR is a tensorflow 2.0 based, extensible toolbox for building CTR/CVR predicting models.☆102Updated last year
- Sample application running fbprophet using spark☆49Updated 6 years ago
- A parallel implementation of factorization machines based on Spark☆73Updated 4 years ago
- Multiple Response Uplift (or heterogeneous treatment effects) package that builds and evaluates tradeoffs with multiple treatments and mu…☆66Updated 8 months ago
- Spark implementation of computing Shapley Values using monte-carlo approximation☆74Updated 2 years ago
- Finding similar, high-valued users based on seed users. The model includes 1805 features using Hive HQL and AWS Redshift.☆34Updated 5 years ago
- convert DataFrame to libffm data format in parallel☆30Updated 6 years ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆103Updated 5 years ago
- Jupyter Notebook used for writing the article "Black-Box models are actually more explainable than a Logistic Regression" published in To…☆73Updated 2 years ago
- Positive-Unlabeled Learning for Apache Spark☆42Updated 7 years ago
- PySpark Algorithms Book: https://www.amazon.com/dp/B07X4B2218/ref=sr_1_2☆84Updated 5 years ago
- JPMML-SparkML plugin for converting XGBoost4J-Spark models to PMML☆36Updated 5 years ago
- Public solution for AutoSeries competition☆72Updated 5 years ago
- About uplift modeling☆30Updated 8 years ago