LIDIAgroup / SparkFeatureSelection
Generic implementation of Information Theory-based Feature Selection methods. It also contains an Entropy Minimization Discretization implementation, as well as two artificial dataset generators.
☆19Updated 10 years ago
Alternatives and similar repositories for SparkFeatureSelection:
Users that are interested in SparkFeatureSelection are comparing it to the libraries listed below
- Code for the 3rd place finish for Avazu Click-Through Rate Prediction☆86Updated 9 years ago
- Reactive Factorization Engine☆104Updated 10 years ago
- Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logi…☆170Updated 6 years ago
- A simple implementation of Microsoft's AdPredictor (http://bit.ly/SFgcq8) in Python☆91Updated 11 years ago
- A SimRank algorithm implementation using Spark☆49Updated 11 years ago
- Spark implementation of Fayyad's discretizer based on Minimum Description Length Principle (MDLP)☆44Updated 2 years ago
- follow-the-regularized-leader implemented by java, with an example using criteo dataset.☆37Updated 9 years ago
- libffm with ftrl updater☆93Updated 7 years ago
- DistML provide a supplement to mllib to support model-parallel on Spark☆166Updated 8 years ago
- Multithreaded Asynchronous FTRL Proximal Implementation☆128Updated 7 years ago
- Multi-thread implementation of Piece-wise Linear Model(PLM) or Mixture of LR(MLR) with FTRL for binary-class classification problem.☆127Updated 3 years ago
- Software for the kaggle criteo challenge☆53Updated 10 years ago
- Implementation of Factorization Machines on Spark using parallel stochastic gradient descent (python and scala)☆229Updated 8 years ago
- Glint: High performance scala parameter server☆168Updated 6 years ago
- 7th in a competition organised by ICT☆24Updated 9 years ago
- field-aware factorization machine implemented by java with an experiment using criteo data set.☆39Updated 9 years ago
- An implementation of the multi-class/multi-label classifier, of which the training is carried out using AdaBoost.MH on Apache Spark.☆107Updated 10 years ago
- Bayesian Personalized Ranking for Spark☆40Updated 7 years ago
- Factorization Machines on Spark and Glint☆25Updated 8 years ago
- Kaggle Avazu beat-the-benchmark model☆36Updated 10 years ago
- Spark-based GBM☆56Updated 5 years ago
- a simple gradient boost classification tree implemented with python without any other lib dependence.☆47Updated 11 years ago
- fast_tffm: Tensorflow-based Distributed Factorization Machine☆143Updated 7 years ago
- LR and FM (with sgd or ftrl) model☆25Updated 8 years ago
- ☆108Updated 7 years ago
- Vector-free L-BFGS implementation for Spark MLlib☆47Updated 7 years ago
- Criteo/Kaggle Competition of CTR prediction☆130Updated 10 years ago
- A recommend system forked from APEX☆82Updated 10 years ago
- Hashed Factorization Machine with Follow The Regularized Leader for Kaggle Avazu Click-Through Rate Competition☆260Updated 8 years ago
- Machine learning applied at large scale☆10Updated 8 years ago