Machine learning enhancements to Spark MlLib
☆20Mar 19, 2015Updated 11 years ago
Alternatives and similar repositories for spark-mrmr-feature-selection
Users that are interested in spark-mrmr-feature-selection are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This package contains a generic implementation of greedy Information Theoretic Feature Selection (FS) methods. The implementation is base…☆135May 5, 2022Updated 3 years ago
- Featureselection methods as Spark MLlib Pipelines☆31Apr 29, 2018Updated 7 years ago
- An improved implementation of the classical feature selection method: minimum Redundancy and Maximum Relevance (mRMR).☆83Apr 1, 2022Updated 3 years ago
- My answers to the exercises from the book "Scala for the impatient" (2nd edition) -- 2017.☆19May 24, 2017Updated 8 years ago
- How do we measure the degradation of a machine learning process? Why does the performance of our predictive models decrease? Maybe it is …☆33Sep 27, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Feature engineering toolkit for Spark MLlib.☆12Apr 1, 2017Updated 8 years ago
- R package providing basic command line optional argument parsing☆12Mar 22, 2026Updated last week
- 主要解决ctr预估工程中的特征选择,特征编号(特征离散),单特征auc和logloss这3个问题.☆20Mar 30, 2017Updated 9 years ago
- ☆21May 5, 2016Updated 9 years ago
- ☆15Mar 15, 2018Updated 8 years ago
- Dr.Riptide - DOS game reverse engineered, tools☆13Nov 29, 2019Updated 6 years ago
- Hive User-Defined Functions (UDFs) for Text Mining☆14Feb 24, 2014Updated 12 years ago
- Materials for the Applied Machine Learning Workshop in New York☆14Sep 12, 2018Updated 7 years ago
- A curated inventory of machine learning methods available on the Apache Spark platform, both in official and third party libraries.☆66Apr 16, 2017Updated 8 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated last year
- An implementation of GloVe model for learning word representations for big text corpuses distributed with Apache Spark.☆15Feb 25, 2018Updated 8 years ago
- The Synthetic Minority Oversampling Technique (SMOTE) implemented in Spark.☆48Jul 4, 2018Updated 7 years ago
- ☆26Nov 22, 2022Updated 3 years ago
- Implementation to VirtualTaobao☆13Jan 17, 2020Updated 6 years ago
- Describes how to perform remote-node load balancing of work with Akka☆30Aug 25, 2012Updated 13 years ago
- Implementation of Robust PCA and Robust Deep Autoencoder over Time Series☆14May 17, 2020Updated 5 years ago
- Interactive visualization of non-linear logistic regression decision boundaries☆28Jul 24, 2014Updated 11 years ago
- Filipino multi-modal NLP dataset. Consists of 350k+ Filipino news articles and associated images☆13Mar 11, 2025Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆22Feb 14, 2020Updated 6 years ago
- In this small project we will predict the email that in which folder it will go in spam or primary.☆11Jul 5, 2016Updated 9 years ago
- Source code for 'Pro Spark Streaming' by Zubair Nabi☆11Mar 27, 2017Updated 9 years ago
- Org-Mode Elixir language support☆25Jan 25, 2018Updated 8 years ago
- A package for Go that can be used for range queries on large number of intervals☆43Jan 6, 2017Updated 9 years ago
- TARNet Model with tensorflow 2 API.☆11Jun 7, 2025Updated 9 months ago
- A simple in-memory graph database (wrapper for python-igraph)☆11Jul 6, 2019Updated 6 years ago
- Ordinal output layers and loss functions (Rennie & Srebro, 2005) for PyTorch and TF/Keras.☆13Mar 24, 2026Updated last week
- Materials of the PyData Madrid monthly meetups☆16Oct 26, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Dec 8, 2022Updated 3 years ago
- a CLI tool to easily deploy your current working branch to GitHub Pages☆20Jul 25, 2018Updated 7 years ago
- ☆11Oct 8, 2015Updated 10 years ago
- notebooks for nlp-on-spark☆13Jan 27, 2017Updated 9 years ago
- Scala for the Impatient (2nd edition) - My Solutions☆10Dec 22, 2017Updated 8 years ago
- Explore the potential of recommendation system using reinforcement learning☆15Apr 23, 2020Updated 5 years ago
- "Python packaging: lo estás haciendo mal"☆14Mar 4, 2021Updated 5 years ago