mjuez / approx-smoteLinks
Approx-SMOTE: fast SMOTE for Big Data on Apache Spark
☆18Updated 3 years ago
Alternatives and similar repositories for approx-smote
Users that are interested in approx-smote are comparing it to the libraries listed below
Sorting:
- The Distributed Node2Vec Algorithm for Very Large Graphs☆18Updated 4 years ago
- A simple version of lightgbm(lightgbm source code analysis). LightGBM源码解析,轻量化GBDT的实现。☆12Updated 5 years ago
- Implement node2vec algorithm using Spark 2 from: http://snap.stanford.edu/node2vec/☆10Updated 6 years ago
- ☆10Updated 5 years ago
- A flexible package for multimodal-deep-learning to combine tabular data with text and images using Wide and Deep models in Pytorch☆1,395Updated 4 months ago
- 通过一些简单的机器学习项目练习编程实践能力,快速掌握机器学习算法☆14Updated 6 years ago
- XGBoost for label-imbalanced data: XGBoost with weighted and focal loss functions☆337Updated last year
- mtgbmcode☆177Updated 3 years ago
- esmm model by tensorflow keras☆72Updated 4 years ago
- GBST is an optimized distributed gradient boosting survival trees library that is implemented based on the XGBoost☆37Updated 5 years ago
- ☆369Updated last year
- The Synthetic Minority Oversampling Technique (SMOTE) implemented in Spark.☆48Updated 7 years ago
- ☆15Updated 5 years ago
- Gradient boosted decision trees for multiple outputs. Better generalization ability, faster training and inference.☆47Updated last year
- 基 于movieLen1M数据集的DSSM深度召回实验☆24Updated 4 years ago
- An implementation of the focal loss to be used with LightGBM for binary and multi-class classification problems☆256Updated 6 years ago
- Uplift modeling package.☆377Updated 3 years ago
- Uplift modeling and evaluation library. Actively maintained pypi version.☆78Updated 2 years ago
- Implementation of paper DESCN, which is accepted in SIGKDD 2022.☆105Updated 2 years ago
- TensorFlow implementation of multi-task learning architectures, incl. MMoE & PLE, on wechat dataset☆215Updated 4 years ago
- ☆72Updated 4 years ago
- Isolation Forest on Spark☆232Updated last year
- Finding similar, high-valued users based on seed users. The model includes 1805 features using Hive HQL and AWS Redshift.☆36Updated 6 years ago
- Factorization Machine models in PyTorch☆1,088Updated last year
- Python library for converting Scikit-Learn pipelines to PMML☆698Updated last week
- Official code for "DaisyRec 2.0: Benchmarking Recommendation for Rigorous Evaluation" (TPAMI2022) and "Are We Evaluating Rigorously? Benc…☆549Updated last year
- ☆411Updated 10 months ago
- Factorization Machines for Recommendation and Ranking Problems with Implicit Feedback Data☆174Updated last year
- Competition Review☆26Updated 5 years ago
- ☆18Updated 5 years ago