Scalable Distributed LDA implementation for Spark & Glint
☆29Sep 27, 2016Updated 9 years ago
Alternatives and similar repositories for glintlda
Users that are interested in glintlda are comparing it to the libraries listed below
Sorting:
- Glint: High performance scala parameter server☆170Jul 20, 2018Updated 7 years ago
- Cache efficient implementation for Latent Dirichlet Allocation☆165Jan 4, 2019Updated 7 years ago
- DistML provide a supplement to mllib to support model-parallel on Spark☆169Feb 6, 2017Updated 9 years ago
- simd enabled column imprints☆11Feb 12, 2018Updated 8 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Oct 6, 2015Updated 10 years ago
- ☆10Apr 20, 2016Updated 9 years ago
- BinDex: A Two-Layered Index for Fast and Robust Scans (SIGMOD2020)☆10Jun 5, 2020Updated 5 years ago
- Factorization Machines on Spark and Glint☆25Nov 7, 2016Updated 9 years ago
- Scalable, fast, and lightweight system for large-scale topic modeling☆846Dec 28, 2020Updated 5 years ago
- ☆14Aug 26, 2016Updated 9 years ago
- Experiments on english wikipedia. GloVe and word2vec.☆13Dec 1, 2015Updated 10 years ago
- fast_tffm: Tensorflow-based Distributed Factorization Machine☆141Mar 15, 2017Updated 8 years ago
- Field-aware Factorization Machines on CUDA☆30Jan 15, 2026Updated last month
- Dictionary-based compression for inverted indexes.☆24Mar 22, 2019Updated 6 years ago
- Distributed implementation of Robust PLSA using Spark☆12Apr 29, 2021Updated 4 years ago
- Yahoo!'s topic modelling framework using Latent Dirichlet Allocation☆98Sep 21, 2011Updated 14 years ago
- Simplified implementations of deep learning related works☆13Oct 6, 2016Updated 9 years ago
- Improving the effectiveness Lucene's BM25 (and testing it using Yahoo! Answers and Stack Overflow collections)☆16Feb 26, 2022Updated 4 years ago
- Splash Project for parallel stochastic learning☆93Jun 16, 2017Updated 8 years ago
- ☆108May 17, 2017Updated 8 years ago
- Space efficient (graph) algorithms☆18Sep 10, 2020Updated 5 years ago
- A primal-dual framework for distributed L1-regularized optimization☆37Apr 18, 2016Updated 9 years ago
- UT Austin Machine Learning Group Latent Variable Modeling Toolkit☆26Feb 2, 2012Updated 14 years ago
- LIBBLE by Parameter Server☆17Sep 17, 2018Updated 7 years ago
- Distributed LDA, takes raw text as input and outputs topic word table.☆16Apr 16, 2016Updated 9 years ago
- Optimizing database queries with array programming☆20Sep 21, 2020Updated 5 years ago
- A fully adaptive, zero-tuning parameter manager that enables efficient distributed machine learning training☆21Feb 23, 2023Updated 3 years ago
- Online Max-Margin Topic Models for Accurate and Fast Text Classification [release v0.1]☆53Mar 26, 2016Updated 9 years ago
- based on the work of Harald Lang when at CWI☆23Mar 2, 2020Updated 6 years ago
- implementation of dynamic wavelet matrix(tree) and static wavelet matrix☆24Jun 4, 2019Updated 6 years ago
- Succinct C++☆24Sep 13, 2020Updated 5 years ago
- R functions for fitting latent factor models with internal computation in C/C++☆122Oct 28, 2017Updated 8 years ago
- An implementation of the TKDE paper "Bidding Machine: Learning to Bid for Directly Optimizing Profits in Display Advertising"☆60Apr 3, 2018Updated 7 years ago
- ☆18Feb 22, 2018Updated 8 years ago
- ☆49Apr 17, 2018Updated 7 years ago
- Scalable Structural Index Constructor for JSON Analytics☆27Oct 10, 2024Updated last year
- lecture notes for probabilistic topic models using ipython notebook☆22Dec 16, 2014Updated 11 years ago
- Perseus is a set of scripts (docker+javascript) to investigate a distributed database's responsiveness when one of its three nodes is iso…☆48Mar 29, 2019Updated 6 years ago
- Cross-lingual Dependency Parsing Based on Distributed Representations☆20Mar 2, 2018Updated 8 years ago