mvandermerwe / BP-GPU-Message-Scheduling
Code for "Message Scheduling for Performant, Many-Core Belief Propagation"
☆10Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for BP-GPU-Message-Scheduling
- Some CUDA design patterns and a bit of template magic for CUDA☆146Updated last year
- EGGS, a method to speed up sparse matrix operations when the same sparsity is used for multiple times. This repo contains examples that s…☆25Updated 4 years ago
- ☆42Updated 6 years ago
- Sparse-dense matrix-matrix multiplication on GPUs☆14Updated 6 years ago
- Escoin: Efficient Sparse Convolutional Neural Network Inference on GPUs☆15Updated 5 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆28Updated 4 years ago
- Implementation of ConjugateGradients method using C and Nvidia CUDA☆47Updated 2 years ago
- CNNs in Halide☆23Updated 9 years ago
- This example builds on the parallel-forall repo separate compilation example by adding CMake to it.☆17Updated 7 years ago
- CUDA-accelerated minimum spanning tree algorithm -- data parallel Boruvka's algorithm☆19Updated 8 years ago
- This is a tuned sparse matrix dense vector multiplication(SpMV) library☆21Updated 8 years ago
- Implementation of the maximum network flow problem in CUDA.☆28Updated 3 years ago
- GPU Accelerated Subsampled Newton Method for Convex Optimization☆8Updated 6 years ago
- Code examples for the CUDA workshop☆36Updated 2 years ago
- Introduction to CUDA programming☆113Updated 7 years ago
- CUDA C implementation of Principal Component Analysis (PCA) through Singular Value Decomposition (SVD) using a highly parallelisable vers…☆26Updated 5 years ago
- Efficient CUDA Stream Compaction Library☆33Updated last year
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆20Updated 6 years ago
- A sample code for sparse cholesky solver with cuSPARSE and cuSOLVER library☆18Updated 4 years ago
- The Surprisingly ParalleL spArse Tensor Toolkit.☆69Updated 2 years ago
- BGHT: High-performance static GPU hash tables.☆55Updated 2 months ago
- Test winograd convolution written in TVM for CUDA and AMDGPU☆40Updated 6 years ago
- CUDA implementation of exclusive prefix sum via Blelloch's algorithm☆25Updated 7 years ago
- ☆21Updated 7 years ago
- flexible-gemm conv of deepcore☆17Updated 4 years ago
- experimental python CFFI interface to NVIDIA's cuSOLVER and cuSPARSE libraries.☆13Updated 4 years ago
- Conjugate Gradient for Least Squares in CUDA☆51Updated 9 years ago
- CUSP : A C++ Templated Sparse Matrix Library☆404Updated 2 weeks ago
- Efficient graph clustering software for normalized cut and ratio association on undirected graphs. Copyright(c) 2008 Brian Kulis, Yuqiang…☆22Updated 12 years ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆49Updated 6 years ago