cuMF / culda_cgsLinks
Efficient LDA solution on GPUs.
☆24Updated 7 years ago
Alternatives and similar repositories for culda_cgs
Users that are interested in culda_cgs are comparing it to the libraries listed below
Sorting:
- CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)☆71Updated 8 years ago
- Distributed NMF/NTF Library☆51Updated last year
- To Index or Not to Index: Optimizing Exact Maximum Inner Product Search☆27Updated 6 years ago
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 9 years ago
- GPU/CPU (CUDA) Implementation of "Recurrent Memory Array Structures", Simple RNN, LSTM, Array LSTM..☆26Updated 5 years ago
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆111Updated 7 years ago
- A study of the downstream instability of word embeddings☆12Updated 3 years ago
- MXNet - nGraph integration☆34Updated 4 years ago
- Slides/code for the Lisbon machine learning school 2017☆28Updated 8 years ago
- Bayesian Poisson Tucker decomposition☆17Updated 8 years ago
- Word embedding via tensor decomposition.☆23Updated 7 years ago
- C++ code for "A Faster Drop-in Implementation for Leaf-wise Exact Greedy Induction of Decision Tree Using Pre-sorted Deque"☆36Updated 2 years ago
- High-performance Non-negative Matrix Factorizations (NMF) - Python/C++☆49Updated 7 years ago
- Nonnegative matrix factorizations in MapReduce☆24Updated 11 years ago
- Rectified Factor Networks☆37Updated 6 years ago
- Code repo for "Transformer on a Diet" paper☆31Updated 5 years ago
- ☆15Updated 3 years ago
- Code of the paper "Enhancing Network Embedding with Auxiliary Information: An Explicit Matrix Factorization Perspective"☆21Updated 7 years ago
- Manuscript and code for the paper "Gradient Energy Matching for Distributed Asynchronous Gradient Descent".☆19Updated 7 years ago
- MPI Parallel framework for training deep learning models built in Theano☆54Updated 8 years ago
- Hash Embedding code for the paper "Hash Embeddings for Efficient Word Representations"☆42Updated 8 years ago
- PyTorch Flexible Hash Embeddings☆28Updated 6 years ago
- maximum inner product tree☆26Updated 13 years ago
- Theano implementation of GloVe for graphs☆47Updated 10 years ago
- A Tool for Automatic Parallelization of Deep Learning Training in Distributed Multi-GPU Environments.☆132Updated 3 years ago
- Randomized SVD of large sparse matrices on Spark☆77Updated 3 years ago
- ☆14Updated 9 years ago
- a very fast parser for sparse matrix at libsvm format☆10Updated 8 years ago
- Run large scale tensor and coupled matrix-tensor factorization on top of stock Hadoop.☆18Updated 8 years ago
- Scalable Distributed LDA implementation for Spark & Glint☆29Updated 9 years ago