mvandermerwe / BP-GPU-Message-SchedulingLinks
Code for "Message Scheduling for Performant, Many-Core Belief Propagation"
☆11Updated 5 years ago
Alternatives and similar repositories for BP-GPU-Message-Scheduling
Users that are interested in BP-GPU-Message-Scheduling are comparing it to the libraries listed below
Sorting:
- ☆44Updated 7 years ago
- Fast K-Nearest Neighbor search with GPU☆141Updated 7 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆154Updated 2 years ago
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆22Updated 6 years ago
- EGGS, a method to speed up sparse matrix operations when the same sparsity is used for multiple times. This repo contains examples that s…☆25Updated 4 years ago
- BGHT: High-performance static GPU hash tables.☆65Updated last month
- Efficient CUDA Stream Compaction Library☆33Updated last year
- gSLIC is an GPU implementation of Simple Iterative Linear Clustering (SLIC) superpixel segmentation algorithm.☆19Updated 11 years ago
- Sparse-dense matrix-matrix multiplication on GPUs☆14Updated 6 years ago
- Introduction to CUDA programming☆118Updated 8 years ago
- CNNs in Halide☆23Updated 9 years ago
- A GPU algorithm for sparse matrix-matrix multiplication☆70Updated 4 years ago
- ☆20Updated 6 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆28Updated 4 years ago
- TMAC: A Toolbox of Modern Async-Parallel, Coordinate, Splitting, and Stochastic Methods☆48Updated 8 years ago
- This example builds on the parallel-forall repo separate compilation example by adding CMake to it.☆17Updated 7 years ago
- A warp-oriented dynamic hash table for GPUs☆73Updated last year
- Example code used in the CVPR 2015 tutorial☆40Updated 9 years ago
- ☆91Updated 8 years ago
- This is a tuned sparse matrix dense vector multiplication(SpMV) library☆21Updated 9 years ago
- ☆22Updated 7 years ago
- This is a PyTorch implementation of the Scalpel. Node pruning for five benchmark networks and SIMD-aware weight pruning for LeNet-300-100…☆41Updated 6 years ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆71Updated 4 years ago
- ☆66Updated 2 years ago
- Implementation of ConjugateGradients method using C and Nvidia CUDA☆51Updated 2 years ago
- A sample code for sparse cholesky solver with cuSPARSE and cuSOLVER library☆19Updated 5 years ago
- CUDA templates for tile-sparse matrix multiplication based on CUTLASS.☆51Updated 7 years ago
- GPU Accelerated Subsampled Newton Method for Convex Optimization☆8Updated 7 years ago
- Communication-Minimizing 2D Convolution in GPU Registers☆30Updated 11 years ago
- ☆22Updated 6 years ago