Machine learning enhancements to Spark MlLib
☆20Mar 19, 2015Updated 11 years ago
Alternatives and similar repositories for spark-mrmr-feature-selection
Users that are interested in spark-mrmr-feature-selection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is base…☆135May 5, 2022Updated 4 years ago
- An improved implementation of the classical feature selection method: minimum Redundancy and Maximum Relevance (mRMR).☆83Apr 1, 2022Updated 4 years ago
- This repository contains code samples for Vertex AI, including pipelines, metadata and more. Mainly with finance datasets.☆15Feb 7, 2026Updated 3 months ago
- Spark implementation of Fayyad's discretizer based on Minimum Description Length Principle (MDLP)☆43Jan 12, 2023Updated 3 years ago
- How do we measure the degradation of a machine learning process? Why does the performance of our predictive models decrease? Maybe it is …☆33Sep 27, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 主要解决ctr预估工程中的特征选择,特征编号(特征离散),单特征auc和logloss这3个问题.☆20Mar 30, 2017Updated 9 years ago
- ☆22Dec 7, 2015Updated 10 years ago
- Thin wrapper on top of the AppNexus API.☆12Oct 31, 2018Updated 7 years ago
- Materials for the Applied Machine Learning Workshop in New York☆14Sep 12, 2018Updated 7 years ago
- Salt state definitions and Vagrant configs for either creating a salt master or for local Vagrant VM builds.☆14Aug 13, 2014Updated 11 years ago
- ☆13Sep 6, 2016Updated 9 years ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆66Apr 16, 2017Updated 9 years ago
- The Synthetic Minority Oversampling Technique (SMOTE) implemented in Spark.☆48Jul 4, 2018Updated 7 years ago
- Implementation to VirtualTaobao☆13Jan 17, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Thesis project about Visual Anomaly Detection based on Self Supervised Learning. The model identifies anomalies from information acquired…☆10Apr 14, 2023Updated 3 years ago
- Filipino multi-modal NLP dataset. Consists of 350k+ Filipino news articles and associated images☆14Mar 11, 2025Updated last year
- In this small project we will predict the email that in which folder it will go in spam or primary.☆11Jul 5, 2016Updated 9 years ago
- Source code for 'Pro Spark Streaming' by Zubair Nabi☆11Mar 27, 2017Updated 9 years ago
- ☆14Aug 26, 2016Updated 9 years ago
- Ordinal output layers and loss functions (Rennie & Srebro, 2005) for PyTorch and TF/Keras.☆13Mar 24, 2026Updated 2 months ago
- ☆10Dec 8, 2022Updated 3 years ago
- A collection of real-time detection methods built with Tinybird. Methods include rate-of-change, out-of-range, timeout, Z-score, and Inte…☆17Jun 4, 2024Updated last year
- Remove Tomek Links from your data.☆30Nov 4, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Oct 8, 2015Updated 10 years ago
- Explore the potential of recommendation system using reinforcement learning☆15Apr 23, 2020Updated 6 years ago
- "Python packaging: lo estás haciendo mal"☆14Mar 4, 2021Updated 5 years ago
- Memborable Unique Identifier☆13Sep 29, 2022Updated 3 years ago
- TF-IDF with Spark for the Kaggle popcorn competition☆10Jul 1, 2015Updated 10 years ago
- A Claude Code plugin for managing requirement documents, tracking changes, and outputting specs — enabling downstream tools to generate c…☆27Jan 26, 2026Updated 4 months ago
- Rundeck Salt Plugin☆31Updated this week
- Kafka, Spark Streaming, Kudu integration examples☆17Dec 22, 2017Updated 8 years ago
- Openscoring application for the Docker distributed applications platform☆11Nov 8, 2020Updated 5 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Generic implementation of Information Theory-based Feature Selection methods. It also contains an Entropy Minimization Discretization imp…☆19Jul 21, 2014Updated 11 years ago
- USP Game Development Kit or USPGameDev Kit =D☆18Jun 15, 2018Updated 7 years ago
- ☆22Dec 5, 2016Updated 9 years ago
- Spark MLlib code optimized to efficiently support sparse data☆51Dec 22, 2016Updated 9 years ago
- CLI Based Browser for S3 Buckets☆14Aug 12, 2016Updated 9 years ago
- Software for the experiments reported in the RecSys 2019 paper "A Simple Multi-Armed Nearest-Neighbor Bandit for Interactive Recommendati…☆21Apr 4, 2024Updated 2 years ago
- An asynchronous behavior-driven development framework.☆13May 3, 2024Updated 2 years ago