MaxKotlan / Cuda-Find-Max-Using-Parallel-ReductionLinks
Benchmarks different cuda implementations of maximum-value algorithms. Uses Parallel Reduction design pattern.
☆18Updated 5 years ago
Alternatives and similar repositories for Cuda-Find-Max-Using-Parallel-Reduction
Users that are interested in Cuda-Find-Max-Using-Parallel-Reduction are comparing it to the libraries listed below
Sorting:
- GPU-accelerated KD-tree implementation☆46Updated 4 years ago
- Source code examples from the Parallel Forall Blog☆96Updated 6 years ago
- Code samples for the CUDA tutorial "CUDA and Applications to Task-based Programming"☆95Updated 2 years ago
- ☆280Updated 3 months ago
- A gpu based implementation of a K-D Tree Builder☆118Updated 6 years ago
- A Connected Component Labelling algorithm implemented in CUDA☆49Updated 4 years ago
- GPU acceleration of smallpt with CUDA. Obtain an acceleration of >35x comparing to the original CPU-parallelized code with OpenMP☆48Updated 5 years ago
- MWE for using the Eigen library in CUDA kernels☆121Updated 3 years ago
- The CMake version of cuda_by_example☆148Updated 5 years ago
- C++ library for reading and writing of numpy's .npy files☆424Updated last year
- An expression template based linear algebra library running completely on the GPU using CUDA☆25Updated 4 years ago
- Conjugate Gradient solver written in CUDA☆28Updated 6 years ago
- Conjugate Gradient for Least Squares in CUDA☆53Updated 10 years ago
- μ-Cuda, COVER THE LAST MILE OF CUDA. With features: intellisense-friendly, structured launch, automatic cuda graph generation and updatin…☆205Updated last month
- Source code repository for the projects from CUDA for Engineers☆134Updated 4 years ago
- CUDA based Iterative Closest Point Algorithm Implementation☆70Updated 6 years ago
- InfiniTAM on FPGA☆26Updated 6 years ago
- BGHT: High-performance static GPU hash tables.☆71Updated 7 months ago
- C++ implementation of sparse matrix using CRS (Compressed Row Storage) format☆125Updated 5 years ago
- Source Code for `Today’s TBB: C++ Parallel Programming with Threading Building Blocks, Second Edition' by Michael Voss and James Reinder…☆199Updated 8 months ago
- 离散几何处理中文版翻译☆62Updated 3 years ago
- Compute the exact Euclidean Distance Transform and Voronoi Diagram for 2D and 3D binary images using the GPU.☆80Updated 5 years ago
- CUDA C implementation of Principal Component Analysis (PCA) through Singular Value Decomposition (SVD) using a highly parallelisable vers…☆28Updated 6 years ago
- A* implementation for NVIDIA GPU☆75Updated 5 years ago
- A sample code for sparse cholesky solver with cuSPARSE and cuSOLVER library☆21Updated 6 years ago
- A C++/CUDA library to efficiently compute neighborhood information on the GPU for 3D point clouds within a fixed radius.☆109Updated last year
- a header-only c++ numeric library☆42Updated last year
- Eigen3.3.7 Doc中文版☆132Updated 6 years ago
- a simple C++ library to import/export Stanford PLY data.☆22Updated last year
- the GPU implementation of bucket based farthest point sampling, achieves 3-4x speedup than the conventional implementation☆21Updated 2 years ago