Spark implementation of Fayyad's discretizer based on Minimum Description Length Principle (MDLP)
☆43Jan 12, 2023Updated 3 years ago
Alternatives and similar repositories for spark-MDLP-discretization
Users that are interested in spark-MDLP-discretization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is base…☆135May 5, 2022Updated 4 years ago
- Generic implementation of Information Theory-based Feature Selection methods. It also contains an Entropy Minimization Discretization imp…☆19Jul 21, 2014Updated 11 years ago
- Machine learning enhancements to Spark MlLib☆20Mar 19, 2015Updated 11 years ago
- Apache Spark 2x Machine Learning Cookbook, published by Packt☆33Jul 23, 2025Updated 10 months ago
- Hide udf☆18May 23, 2013Updated 13 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code for my own blog☆10Nov 7, 2013Updated 12 years ago
- Model-based clustering package for mixed data☆13Mar 23, 2026Updated 2 months ago
- An implementation of Maximum Entropy model☆14Apr 28, 2012Updated 14 years ago
- dllib is a distributed deep learning library running on Apache Spark☆32Oct 26, 2017Updated 8 years ago
- Practice and Workshop on BigData and Cloud Computing using Docker Containers and OpenNebula. HDFS, hadoop and spark+R☆11Mar 16, 2017Updated 9 years ago
- ☆25Mar 12, 2018Updated 8 years ago
- Factorization Machines on Spark and Glint☆25Nov 7, 2016Updated 9 years ago
- Glint: High performance scala parameter server☆170Jul 20, 2018Updated 7 years ago
- This package contains the code for executing clustering validity indices in Spark. The package includes BD-Silhouette, BD-Dunn, Davies-Bo…☆10Oct 29, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An improved implementation of the classical feature selection method: minimum Redundancy and Maximum Relevance (mRMR).☆83Apr 1, 2022Updated 4 years ago
- The machine learning component of Open Network Insight: scalable analytics combining spark for big data and C / MPI for high performance …☆13Nov 9, 2016Updated 9 years ago
- Breezedeus's Blog☆17Jul 4, 2023Updated 2 years ago
- Zeppelin notebook examples☆25Feb 18, 2016Updated 10 years ago
- Using JPMML Evaluator to validate the PMML models exported from Spark☆19May 1, 2017Updated 9 years ago
- Scalable PCA (sPCA) is a scalable implementation of Principal component analysis algorithm on top of Spark☆12May 12, 2015Updated 11 years ago
- Spark Streaming jobs.☆11Mar 10, 2015Updated 11 years ago
- Vector-free L-BFGS implementation for Spark MLlib☆46Jun 23, 2017Updated 8 years ago
- TeraSort for Spark and Flink which uses a range partitioner based on sampling☆22Feb 5, 2016Updated 10 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Another, hopefully better, implementation of ALS on Spark☆14May 20, 2015Updated 11 years ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆66Apr 16, 2017Updated 9 years ago
- Parameter Server implementation in Apache Flink☆56Oct 15, 2018Updated 7 years ago
- PMML evaluator library for Apache Spark☆99Feb 8, 2026Updated 3 months ago
- A library for exporting Spark ML models and pipelines to PFA☆55Nov 21, 2018Updated 7 years ago
- Featureselection methods as Spark MLlib Pipelines☆31Apr 29, 2018Updated 8 years ago
- Deeplearning framework running on Spark☆62Dec 16, 2023Updated 2 years ago
- A package for generating synthetic clusters with control over "difficulty"☆24Apr 24, 2026Updated last month
- ExtJS component for drawing trees (actually Directed Acyclic Graphs)☆21Aug 16, 2012Updated 13 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- SOUL: Scala Oversampling and Undersampling Library.☆13Apr 11, 2019Updated 7 years ago
- A data generator for Apache Druid☆12Mar 26, 2025Updated last year
- ChiMerge: Discretization of Numeric Attributes☆41Mar 15, 2016Updated 10 years ago
- Named Entity Recognition (NER) models (neural and sparse) implemented based on package LibN3L☆20Jan 2, 2017Updated 9 years ago
- analytics tool kit☆41Jan 23, 2017Updated 9 years ago
- Live video capture sample application on Windows platform☆10Jun 27, 2016Updated 9 years ago
- A collection of demonstration languages in Lua/Terra suitable for learning or for forking when creating a new language☆11Aug 27, 2015Updated 10 years ago