mvandermerwe / BP-GPU-Message-Scheduling
Code for "Message Scheduling for Performant, Many-Core Belief Propagation"
☆10Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for BP-GPU-Message-Scheduling
- Efficient graph clustering software for normalized cut and ratio association on undirected graphs. Copyright(c) 2008 Brian Kulis, Yuqiang…☆22Updated 12 years ago
- ☆42Updated 6 years ago
- CUDA Sparse-Matrix Vector Multiplication, using Sliced Coordinate format☆20Updated 6 years ago
- ☆21Updated 7 years ago
- Implementation of the maximum network flow problem in CUDA.☆27Updated 3 years ago
- GPU accelerated first order primal-dual algorithm for solving convex optimization problems, and its application in maximum flow minimum c…☆16Updated 3 years ago
- Fast K-Nearest Neighbor search with GPU☆141Updated 7 years ago
- CUDA implementation of exclusive prefix sum via Blelloch's algorithm☆25Updated 7 years ago
- Repository holding the code base to AC-SpGEMM : "Adaptive Sparse Matrix-Matrix Multiplication on the GPU"☆28Updated 4 years ago
- A sample code for sparse cholesky solver with cuSPARSE and cuSOLVER library☆18Updated 4 years ago
- This example builds on the parallel-forall repo separate compilation example by adding CMake to it.☆17Updated 6 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆146Updated last year
- SuiteSparse: a suite of sparse matrix packages by @DrTimothyAldenDavis et al. with native CMake support☆51Updated 3 months ago
- Communication-Minimizing 2D Convolution in GPU Registers☆30Updated 11 years ago
- Conjugate Gradient for Least Squares in CUDA☆51Updated 9 years ago
- Sparse-dense matrix-matrix multiplication on GPUs☆14Updated 6 years ago
- GPU Accelerated Subsampled Newton Method for Convex Optimization☆8Updated 6 years ago
- An GPU/CUDA implementation of the Hungarian algorithm☆107Updated 5 years ago
- EGGS, a method to speed up sparse matrix operations when the same sparsity is used for multiple times. This repo contains examples that s…☆25Updated 4 years ago
- Skeletonide is a parallel implementation of Zhang-Suen morphological thinning algorithm written in Halide-lang. Use it for fast skeletoni…☆12Updated 4 years ago
- A shallow fork of SuiteSparse adding build files for Visual Studio and support for ACML☆100Updated 9 years ago
- CNNs in Halide☆23Updated 9 years ago
- A fast and flexible convex optimization framework based on proximal splitting☆40Updated 3 years ago
- This is a cross-platform, CUDA-based C++ library for general-purpose, unconstrained nonlinear optimization on the GPU. It implements the …☆133Updated 4 years ago
- Computation using data flow graphs for scalable machine learning☆24Updated 6 years ago
- Progressive 3D Modeling All the Way☆24Updated 8 years ago
- Implementation of ConjugateGradients method using C and Nvidia CUDA☆47Updated 2 years ago
- Parallel Bundle Adjustment☆65Updated 5 years ago
- GPU-based large scale Approx. Nearest Neighbor Search, accepted at CVPR 2016☆91Updated 6 years ago
- Fork of magma to include more BLAS☆28Updated 7 years ago