perdisci / jbirch
Automatically exported from code.google.com/p/jbirch
☆12Updated 2 years ago
Alternatives and similar repositories for jbirch:
Users that are interested in jbirch are comparing it to the libraries listed below
- Using latent Dirichlet allocation (LDA) in Apache Lucene☆58Updated 12 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- ADMM based large scale logistic regression☆337Updated last year
- Scalable Topic Modeling using Variational Inference in MapReduce☆150Updated 9 years ago
- Java implementation of the Microsoft's AdPredictor algorithm☆17Updated 6 years ago
- Scalable Distributed LDA implementation for Spark & Glint☆28Updated 8 years ago
- Topic Modeling on Apache Spark☆94Updated 5 years ago
- Predicting job salaries from ads - a Kaggle competition☆55Updated 10 years ago
- Get Data Reused☆20Updated 7 years ago
- Online LDA based on Spark☆16Updated 10 years ago
- The experiment software underlying two papers published at ECIR-2015 and SEMEVAL-2015.☆37Updated 9 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Classifying text with bag-of-words☆113Updated 9 years ago
- Spark MLlib code optimized to efficiently support sparse data☆50Updated 8 years ago
- Machine learning applied at large scale☆10Updated 8 years ago
- Vowpal Wabbit Webservice. A web service that accepts VW formatted text and runs it through a VW daemon instance.☆40Updated 8 years ago
- Java 8 Factorization Machines Library☆27Updated 8 years ago
- Mercury:Recommendation Engine Sandbox Using Movielens Dataset☆8Updated 10 years ago
- Online Machine Learning Algorithms☆30Updated last year
- Quickly start YARN cluster on EC2☆30Updated 7 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆92Updated 9 years ago
- POC IDS anomaly detection engine built with iPython notebook, matplotlib, pandas, numpy, scikit-learn, d3.js, hyperloglog implementation,…☆79Updated 10 years ago
- Kaggle Criteo https://www.kaggle.com/c/criteo-display-ad-challenge☆98Updated 10 years ago
- An implementation of the multi-class/multi-label classifier, of which the training is carried out using AdaBoost.MH on Apache Spark.☆107Updated 10 years ago
- Splash Project for parallel stochastic learning☆94Updated 7 years ago
- FTRL proximal algorithm according to McMahan et al. 2013☆24Updated 5 years ago
- Predictive analatics using deepLearning4j and Spark☆26Updated 8 years ago
- Online Latent Dirichlet Allocation with Infinite Vocabulary using Variational Inference☆74Updated 9 years ago
- TF-IDF with Spark for the Kaggle popcorn competition☆10Updated 9 years ago
- word2vec variations☆7Updated 7 years ago