Spark implementation of Fayyad's discretizer based on Minimum Description Length Principle (MDLP)
☆43Jan 12, 2023Updated 3 years ago
Alternatives and similar repositories for spark-MDLP-discretization
Users that are interested in spark-MDLP-discretization are comparing it to the libraries listed below
Sorting:
- This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is base…☆135May 5, 2022Updated 3 years ago
- Generic implementation of Information Theory-based Feature Selection methods. It also contains an Entropy Minimization Discretization imp…☆19Jul 21, 2014Updated 11 years ago
- Discretization with Fayyad and Irani's minimum description length principle criterion (MDLPC)☆60Aug 7, 2018Updated 7 years ago
- Machine learning enhancements to Spark MlLib☆20Mar 19, 2015Updated 10 years ago
- The machine learning component of Open Network Insight: scalable analytics combining spark for big data and C / MPI for high performance …☆13Nov 9, 2016Updated 9 years ago
- Scalable PCA (sPCA) is a scalable implementation of Principal component analysis algorithm on top of Spark☆12May 12, 2015Updated 10 years ago
- Code for my own blog☆10Nov 7, 2013Updated 12 years ago
- An implementation of Maximum Entropy model☆14Apr 28, 2012Updated 13 years ago
- Another, hopefully better, implementation of ALS on Spark☆14May 20, 2015Updated 10 years ago
- A framework for PSL inference.☆21Nov 9, 2015Updated 10 years ago
- Breezedeus's Blog☆17Jul 4, 2023Updated 2 years ago
- Glint: High performance scala parameter server☆170Jul 20, 2018Updated 7 years ago
- Affinity Propagation on Spark☆20May 31, 2021Updated 4 years ago
- Vector-free L-BFGS implementation for Spark MLlib☆46Jun 23, 2017Updated 8 years ago
- analytics tool kit☆42Jan 23, 2017Updated 9 years ago
- ☆25Mar 12, 2018Updated 7 years ago
- A library for exporting Spark ML models and pipelines to PFA☆55Nov 21, 2018Updated 7 years ago
- Using JPMML Evaluator to validate the PMML models exported from Spark☆19May 1, 2017Updated 8 years ago
- TeraSort for Spark and Flink which uses a range partitioner based on sampling☆22Feb 5, 2016Updated 10 years ago
- PMML evaluator library for Apache Spark☆98Feb 8, 2026Updated 3 weeks ago
- Gaussian Mixture Model Implementation in Pyspark☆31Dec 2, 2014Updated 11 years ago
- Joins for skewed datasets in Spark☆57Aug 18, 2017Updated 8 years ago
- Parameter Server implementation in Apache Flink☆56Oct 15, 2018Updated 7 years ago
- Zeppelin notebook examples☆25Feb 18, 2016Updated 10 years ago
- CRF is a Java implementation of Conditional Random Fields, an algorithm for learning from labeled sequences of examples. It also includes…☆28Sep 4, 2014Updated 11 years ago
- Deeplearning framework running on Spark☆61Dec 16, 2023Updated 2 years ago
- ☆32Mar 20, 2020Updated 5 years ago
- ADMM on Apache Spark☆31Jul 21, 2015Updated 10 years ago
- This package is essentially a ros-wrapper of neural_cam. More features would be added in the future, geared towards mobile robot platform…☆11Jul 12, 2019Updated 6 years ago
- ☆11Jul 7, 2020Updated 5 years ago
- Text Classification Engine☆36Jun 10, 2019Updated 6 years ago
- Featureselection methods as Spark MLlib Pipelines☆31Apr 29, 2018Updated 7 years ago
- ChiMerge: Discretization of Numeric Attributes☆41Mar 15, 2016Updated 9 years ago
- R as a backend for web apps.☆10Mar 7, 2018Updated 7 years ago
- CWTS OpenAlex ETL data pipeline.☆16Oct 29, 2025Updated 4 months ago
- Benchmarks of artificial neural network library for Spark MLlib☆11Dec 3, 2015Updated 10 years ago
- FTRL-Proximal Online Learning Algorithm☆15May 22, 2017Updated 8 years ago
- 中文语料:大量人工标注样本,非常有价值 !!!☆11Aug 15, 2019Updated 6 years ago
- A collection of OCR'd and machine-corrected Greek texts. This base repository contains Git submodules for the different works and an inve…☆11Nov 18, 2014Updated 11 years ago