LIDIAgroup / SparkFeatureSelectionLinks
Generic implementation of Information Theory-based Feature Selection methods. It also contains an Entropy Minimization Discretization implementation, as well as two artificial dataset generators.
☆19Updated 11 years ago
Alternatives and similar repositories for SparkFeatureSelection
Users that are interested in SparkFeatureSelection are comparing it to the libraries listed below
Sorting:
- A SimRank algorithm implementation using Spark☆49Updated 11 years ago
- Reactive Factorization Engine☆104Updated 10 years ago
- DistML provide a supplement to mllib to support model-parallel on Spark☆167Updated 8 years ago
- Code for the 3rd place finish for Avazu Click-Through Rate Prediction☆87Updated 10 years ago
- 阅读论文备份☆17Updated 9 years ago
- Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logi…☆170Updated 6 years ago
- Criteo/Kaggle Competition of CTR prediction☆129Updated 10 years ago
- Multithreaded Asynchronous FTRL Proximal Implementation☆128Updated 8 years ago
- follow-the-regularized-leader implemented by java, with an example using criteo dataset.☆37Updated 9 years ago
- An implementation of the multi-class/multi-label classifier, of which the training is carried out using AdaBoost.MH on Apache Spark.☆108Updated 10 years ago
- Kaggle Avazu beat-the-benchmark model☆36Updated 10 years ago
- field-aware factorization machine implemented by java with an experiment using criteo data set.☆39Updated 10 years ago
- libffm with ftrl updater☆94Updated 8 years ago
- Notes on Logistic Regression and OWLQN☆26Updated 8 years ago
- An implement of Factorization Machines (LibFM)☆252Updated 7 years ago
- ☆154Updated 6 years ago
- A simple implementation of Microsoft's AdPredictor (http://bit.ly/SFgcq8) in Python☆91Updated 11 years ago
- a simple gradient boost classification tree implemented with python without any other lib dependence.☆48Updated 12 years ago
- Hashed Factorization Machine with Follow The Regularized Leader for Kaggle Avazu Click-Through Rate Competition☆260Updated 8 years ago
- Multi-thread implementation of Piece-wise Linear Model(PLM) or Mixture of LR(MLR) with FTRL for binary-class classification problem.☆129Updated 4 years ago
- Item and User-based KNN recommendation algorithms using PySpark☆125Updated 7 years ago
- Implementation of Factorization Machines on Spark using parallel stochastic gradient descent (python and scala)☆230Updated 9 years ago
- 7th in a competition organised by ICT☆24Updated 9 years ago
- ☆10Updated 10 years ago
- Spark implementation of Fayyad's discretizer based on Minimum Description Length Principle (MDLP)☆43Updated 2 years ago
- sparse word2vec☆108Updated 3 years ago
- ADMM based large scale logistic regression☆337Updated last year
- Spark-based approximate nearest neighbor search using locality-sensitive hashing☆104Updated 9 years ago
- Scalable recommendation system written in Scala using the Apache Spark framework☆105Updated 10 years ago
- Distributed FM and LR based on Parameter Server with Ftrl☆129Updated 7 years ago