WinVector / Logistic
Experimental logistic regression code supporting multiple result categories, many levels of categorical modeling variables, good optimization, L2 regularization and more.
☆35Updated 3 years ago
Related projects: ⓘ
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 8 years ago
- a large scale lbfgs using a method in nips 2014 paper "Large-scale L-BFGS using MapReduce".☆13Updated 9 years ago
- The experiment software underlying two papers published at ECIR-2015 and SEMEVAL-2015.☆36Updated 9 years ago
- Feature Engineering ToolBox☆8Updated 9 years ago
- LASER-A Scalable Response Prediction Platform For Online Advertising☆48Updated 9 years ago
- A Latent Dirichlet Allocation topic modeling package based on SparseLDA Gibbs Sampling inference algorithm☆8Updated 11 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 8 years ago
- FTRL proximal algorithm according to McMahan et al. 2013☆24Updated 5 years ago
- Distributed implementation of Robust PLSA using Spark☆12Updated 3 years ago
- Source code for exploring MLlib blog post☆11Updated 9 years ago
- Vector-free L-BFGS implementation on Spark☆9Updated 8 years ago
- Machine learning applied at large scale☆10Updated 8 years ago
- Predictive analatics using deepLearning4j and Spark☆26Updated 7 years ago
- Script to perform dictionary based n-gram text tagging efficiently in apache spark☆11Updated 7 years ago
- Distributed optimization framework with parameter server☆23Updated 9 years ago
- ☆24Updated this week
- ☆14Updated this week
- In-database parallel grid-search for XGBoost on Greenplum☆15Updated 6 years ago
- Implementation of ADMM algorithm on Apache Spark☆25Updated 8 years ago
- Predicting job salaries from ads - a Kaggle competition☆55Updated 10 years ago
- Implementation of the Apriori algorithm using Spark.☆38Updated 9 years ago
- ADMM Logistic Regression implemented in Spark☆32Updated 10 years ago
- 4th Place Solution for The Hunt for Prohibited Content Competition on Kaggle (http://www.kaggle.com/c/avito-prohibited-content)☆29Updated 10 years ago
- Different approaches to computing document similarity☆28Updated 7 years ago
- Parallel Iterative Algorithm (SGD) on Hadoop's YARN framework☆42Updated 11 years ago
- Machine Intelligence Toolkits- based on Parameter Server that Efficient Distributed Communication Framework and Alternating Direction Mu…☆11Updated 6 years ago
- Document or binary file vectorization with Normalized Compression Distance in Python.☆16Updated 8 years ago
- ☆13Updated 8 years ago
- A Scala implementation of glmnet for Spark MLlib from "Regularization Paths for Generalized Linear Models via Coordinate Descent" (http:/…☆11Updated 7 years ago
- tag doc using topN words with lda☆10Updated 9 years ago