LIDIAgroup / SparkFeatureSelection
Generic implementation of Information Theory-based Feature Selection methods. It also contains an Entropy Minimization Discretization implementation, as well as two artificial dataset generators.
☆19Updated 10 years ago
Alternatives and similar repositories for SparkFeatureSelection:
Users that are interested in SparkFeatureSelection are comparing it to the libraries listed below
- Reactive Factorization Engine☆104Updated 10 years ago
- A SimRank algorithm implementation using Spark☆48Updated 11 years ago
- Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logi…☆170Updated 6 years ago
- Spark implementation of Fayyad's discretizer based on Minimum Description Length Principle (MDLP)☆43Updated 2 years ago
- Code for the 3rd place finish for Avazu Click-Through Rate Prediction☆87Updated 10 years ago
- follow-the-regularized-leader implemented by java, with an example using criteo dataset.☆37Updated 9 years ago
- Kaggle Avazu beat-the-benchmark model☆36Updated 10 years ago
- A simple implementation of Microsoft's AdPredictor (http://bit.ly/SFgcq8) in Python☆91Updated 11 years ago
- libffm with ftrl updater☆94Updated 7 years ago
- a simple gradient boost classification tree implemented with python without any other lib dependence.☆47Updated 11 years ago
- Factorization Machines on Spark and Glint☆25Updated 8 years ago
- field-aware factorization machine implemented by java with an experiment using criteo data set.☆39Updated 9 years ago
- Criteo/Kaggle Competition of CTR prediction☆130Updated 10 years ago
- Locality Sensitive Hashing for Apache Spark☆87Updated 3 years ago
- 7th in a competition organised by ICT☆24Updated 9 years ago
- Multithreaded Asynchronous FTRL Proximal Implementation☆128Updated 7 years ago
- Multi-thread implementation of Piece-wise Linear Model(PLM) or Mixture of LR(MLR) with FTRL for binary-class classification problem.☆128Updated 3 years ago
- Spark-based GBM☆56Updated 5 years ago
- DistML provide a supplement to mllib to support model-parallel on Spark☆166Updated 8 years ago
- Implementation of Factorization Machines on Spark using parallel stochastic gradient descent (python and scala)☆229Updated 8 years ago
- An implementation of the multi-class/multi-label classifier, of which the training is carried out using AdaBoost.MH on Apache Spark.☆107Updated 10 years ago
- An implement of Factorization Machines (LibFM)☆250Updated 6 years ago
- Notes on Logistic Regression and OWLQN☆26Updated 7 years ago
- ☆10Updated 9 years ago
- Vector-free L-BFGS implementation for Spark MLlib☆47Updated 7 years ago
- sparse word2vec☆108Updated 2 years ago
- Item and User-based KNN recommendation algorithms using PySpark☆126Updated 7 years ago
- Predictive analatics using deepLearning4j and Spark☆26Updated 8 years ago
- Machine learning applied at large scale☆10Updated 8 years ago
- LR and FM (with sgd or ftrl) model☆25Updated 8 years ago