cuMF / culda_cgsLinks
Efficient LDA solution on GPUs.
☆24Updated 6 years ago
Alternatives and similar repositories for culda_cgs
Users that are interested in culda_cgs are comparing it to the libraries listed below
Sorting:
- CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)☆72Updated 7 years ago
- To Index or Not to Index: Optimizing Exact Maximum Inner Product Search☆26Updated 6 years ago
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 8 years ago
- Distributed NMF/NTF Library☆47Updated 8 months ago
- Light-weight GPU kernel interface for graph operations☆15Updated 5 years ago
- Nonnegative matrix factorizations in MapReduce☆24Updated 10 years ago
- ☆14Updated 8 years ago
- Word embedding via tensor decomposition.☆23Updated 7 years ago
- Bayesian Poisson Tucker decomposition☆17Updated 8 years ago
- High Dimensional Approximate Near(est) Neighbor☆33Updated 7 years ago
- Metric Learning to Rank☆46Updated 12 years ago
- Sketching linear classifiers over data streams with the Weight-Median Sketch (SIGMOD 2018).☆39Updated 7 years ago
- SRS - Fast Approximate Nearest Neighbor Search in High Dimensional Euclidean Space With a Tiny Index☆55Updated 10 years ago
- High-performance Non-negative Matrix Factorizations (NMF) - Python/C++☆49Updated 7 years ago
- Official repository of "Efficient and Effective Query Expansion for Web Search", Short Paper @ CIKM 2018☆15Updated 5 years ago
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆110Updated 6 years ago
- Implementation of PCA algorithm using Gram-Scmidt modification on NIPALS☆10Updated 10 years ago
- Distributed learning with mpi4py☆48Updated 6 years ago
- Run large scale tensor and coupled matrix-tensor factorization on top of stock Hadoop.☆18Updated 7 years ago
- A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.☆132Updated 3 years ago
- Code of the paper "Enhancing Network Embedding with Auxiliary Information: An Explicit Matrix Factorization Perspective"☆21Updated 7 years ago
- Large Scale Graphical Model☆24Updated 6 years ago
- MXNet - nGraph integration☆34Updated 3 years ago
- Implementation of Monte Carlo Word Movers Distance in Python with TensorFlow☆13Updated 8 years ago
- GPU/CPU (CUDA) Implementation of "Recurrent Memory Array Structures", Simple RNN, LSTM, Array LSTM..☆25Updated 5 years ago
- Code repo for "Transformer on a Diet" paper☆31Updated 5 years ago
- Generating Families of Practical Fast Matrix Multiplication Algorithms☆12Updated 8 years ago
- PyTorch Flexible Hash Embeddings☆28Updated 5 years ago
- Scalable Distributed LDA implementation for Spark & Glint☆29Updated 8 years ago
- CUDA Matrix Factorization Library with Alternating Least Square (ALS)☆180Updated 6 years ago