cuMF / cumf_sgd
CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)
☆71Updated 7 years ago
Alternatives and similar repositories for cumf_sgd:
Users that are interested in cumf_sgd are comparing it to the libraries listed below
- CUDA Matrix Factorization Library with Alternating Least Square (ALS)☆176Updated 6 years ago
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 8 years ago
- Efficient LDA solution on GPUs.☆24Updated 6 years ago
- cache-friendly multithread matrix factorization☆88Updated 8 years ago
- GPU-specialized parameter server for GPU machine learning.☆100Updated 6 years ago
- CUDA implementation of k-means☆23Updated 11 years ago
- FRED simulator and associated paper☆26Updated 9 years ago
- Random Walk (Personalized PageRank) Algorithms for Large Graphs☆73Updated 8 years ago
- A platform for distributed optimization expriments using OpenMPI☆20Updated 7 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- A light-weight matrix factorization tool☆39Updated 7 years ago
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆111Updated 6 years ago
- Training deep neural networks with low precision multiplications☆63Updated 9 years ago
- Cyclades☆28Updated 6 years ago
- LR、FM model solved by ftrl and sgd parallel on MPI☆112Updated 7 years ago
- SRS - Fast Approximate Nearest Neighbor Search in High Dimensional Euclidean Space With a Tiny Index☆55Updated 9 years ago
- A implementation of CF-NADE. Yin Zheng, et. al. "A Neural Autoregressive Approach to Collaborative Filtering", accepted by ICML 2016.☆79Updated 6 years ago
- Different implementation of sparse matrix multiplication. All matrices are in CSR format. The code contains different CUDA kernels for mu…☆16Updated 14 years ago
- MPI for Torch☆60Updated 7 years ago
- (Spring 2017) Assignment 2: GPU Executor☆63Updated 7 years ago
- Scalable Distributed LDA implementation for Spark & Glint☆28Updated 8 years ago
- ☆30Updated 7 years ago
- Personalized PageRank (PPR) on GraphLab PowerGraph☆15Updated 8 years ago
- This repository contains the cuStinger data structure used for dynamic graph representation.☆19Updated 6 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆296Updated 6 years ago
- ☆127Updated 8 years ago
- cuda implementation of CBOW model (word2vec)☆116Updated 11 years ago
- High Efficiency Convolution Kernel for Maxwell GPU Architecture☆134Updated 7 years ago
- GPU/CPU (CUDA) Implementation of "Recurrent Memory Array Structures", Simple RNN, LSTM, Array LSTM..☆25Updated 4 years ago
- Light-weight GPU kernel interface for graph operations☆15Updated 4 years ago