cuMF / cumf_sgd
CUDA Matrix Factorization Library with Stochastic Gradient Descent (SGD)
☆71Updated 6 years ago
Related projects: ⓘ
- CUDA Matrix Factorization Library with Alternating Least Square (ALS)☆173Updated 6 years ago
- HogWild++: A New Mechanism for Decentralized Asynchronous Stochastic Gradient Descent☆33Updated 7 years ago
- Efficient LDA solution on GPUs.☆24Updated 6 years ago
- Different implementation of sparse matrix multiplication. All matrices are in CSR format. The code contains different CUDA kernels for mu…☆16Updated 13 years ago
- FRED simulator and associated paper☆26Updated 8 years ago
- GPU-specialized parameter server for GPU machine learning.☆100Updated 6 years ago
- CUDA implementation of k-means☆22Updated 10 years ago
- cache-friendly multithread matrix factorization☆87Updated 8 years ago
- Machine Learning Toolkit for Extreme Scale (MaTEx)☆111Updated 6 years ago
- Light-weight GPU kernel interface for graph operations☆15Updated 4 years ago
- A platform for distributed optimization expriments using OpenMPI☆20Updated 6 years ago
- image to column☆31Updated 10 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 4 years ago
- Cyclades☆28Updated 6 years ago
- Kernel Fusion and Runtime Compilation Based on NNVM☆69Updated 7 years ago
- ☆30Updated 7 years ago
- Training deep neural networks with low precision multiplications☆63Updated 9 years ago
- MPI for Torch☆61Updated 7 years ago
- Random Walk (Personalized PageRank) Algorithms for Large Graphs☆73Updated 8 years ago
- Some tensorflow examples☆19Updated 6 years ago
- The Surprisingly ParalleL spArse Tensor Toolkit.☆68Updated 2 years ago
- Implementation of fast exact k-means algorithms☆47Updated 4 years ago
- SRS - Fast Approximate Nearest Neighbor Search in High Dimensional Euclidean Space With a Tiny Index☆54Updated 9 years ago
- (Spring 2017) Assignment 2: GPU Executor☆63Updated 7 years ago
- Implementations of various parallel algorithms for matrix factorization (including DSGD++)☆14Updated 7 years ago
- Scalable Distributed LDA implementation for Spark & Glint☆28Updated 7 years ago
- PMLS-Caffe: Distributed Deep Learning Framework for Parallel ML System☆194Updated 6 years ago
- LSH-GPU ANN package☆91Updated 5 years ago
- ☆62Updated this week
- DeepWalk implementation in C++☆100Updated 3 months ago