sramirez / spark-infotheoretic-feature-selectionLinks
This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is based on the common theoretic framework presented by Gavin Brown. Implementations of mRMR, InfoGain, JMI and other commonly used FS filters are provided.
☆134Updated 3 years ago
Alternatives and similar repositories for spark-infotheoretic-feature-selection
Users that are interested in spark-infotheoretic-feature-selection are comparing it to the libraries listed below
Sorting:
- An implementation of DBSCAN runing on top of Apache Spark☆182Updated 7 years ago
- k-Nearest Neighbors algorithm on Spark☆239Updated last year
- Spark implementation of Fayyad's discretizer based on Minimum Description Length Principle (MDLP)☆43Updated 2 years ago
- Java library and command-line application for converting Apache Spark ML pipelines to PMML☆267Updated 5 months ago
- PMML evaluator library for the Apache Spark cluster computing system (http://spark.apache.org/)☆94Updated 3 years ago
- Spark 2.0 Scala Machine Learning examples☆77Updated 5 years ago
- Machine learning enhancements to Spark MlLib☆20Updated 10 years ago
- Featureselection methods as Spark MLlib Pipelines☆30Updated 7 years ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆65Updated 8 years ago
- SparklingGraph provides easy to use set of features that will give you ability to proces large scala graphs using Spark and GraphX.☆152Updated 5 years ago
- Spark-based GBM☆56Updated 5 years ago
- DBSCAN clustering algorithm on top of Apache Spark☆261Updated 7 years ago
- Isolation Forest on Spark☆229Updated 9 months ago
- Locality Sensitive Hashing for Apache Spark☆87Updated 3 years ago
- Spark-based approximate nearest neighbor search using locality-sensitive hashing☆104Updated 9 years ago
- Distributed Streaming Matrix Factorization implemented on Spark for Recommendation Systems☆106Updated 9 years ago
- Implementation of Factorization Machines on Spark using parallel stochastic gradient descent (python and scala)☆230Updated 9 years ago
- Glint: High performance scala parameter server☆168Updated 7 years ago
- The Synthetic Minority Oversampling Technique (SMOTE) implemented in Spark.☆48Updated 7 years ago
- Vector-free L-BFGS implementation for Spark MLlib☆47Updated 8 years ago
- Locality Sensitive Hashing for Apache Spark☆196Updated 8 years ago
- Zen aims to provide the largest scale and the most efficient machine learning platform on top of Spark, including but not limited to logi…☆170Updated 6 years ago
- Spark Extension : ML transformers, SQL aggregations, etc that are missing in Apache Spark☆147Updated 9 years ago
- Building Annoy Index on Apache Spark☆72Updated 4 years ago
- The Nak Machine Learning Library☆343Updated 8 years ago
- A library for exporting Spark ML models and pipelines to PFA☆54Updated 6 years ago
- Train TensorFlow models on YARN in just a few lines of code!☆89Updated last year
- Scalable recommendation system written in Scala using the Apache Spark framework☆105Updated 10 years ago
- Java library and command-line application for converting XGBoost models to PMML☆129Updated 3 months ago
- Easy to use library to bring Tensorflow on Apache Spark☆295Updated last year