drufat / cuda-examples
A few cuda examples built with cmake
☆23Updated 5 years ago
Alternatives and similar repositories for cuda-examples:
Users that are interested in cuda-examples are comparing it to the libraries listed below
- Launching collective tasks in bulk☆37Updated 5 years ago
- Multi-dimensional array programming framework for C++ and multi-GPU CUDA applications☆28Updated 8 years ago
- Some C++ codes for computing a 1D and 2D convolution product using the FFT implemented with the GSL or FFTW☆58Updated 11 years ago
- ☆44Updated 7 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.☆55Updated 6 years ago
- Example of how to use CUDA with CMake >= 3.8☆69Updated last year
- Set of guidelines for porting OpenCL™ C to OpenCL C++☆41Updated 7 years ago
- CUDA FFT convolution☆15Updated 10 years ago
- Full-speed Array of Structures access☆169Updated 2 years ago
- a software library containing Sparse functions written in OpenCL☆174Updated 5 years ago
- Source code repository for the projects from CUDA for Engineers☆130Updated 3 years ago
- A portable high-level API with CUDA or OpenCL back-end☆54Updated 7 years ago
- Bitonic Sort for C and CUDA☆16Updated 6 years ago
- CUSP : A C++ Templated Sparse Matrix Library☆411Updated 5 months ago
- A demonstration of speeding up a 1D convolution using SSE☆51Updated 8 years ago
- CMake Examples (CMake, CMake+CUDA, CMake+CUDA+PandaRoot)☆41Updated 11 years ago
- K-d tree implementation in C++☆59Updated 12 years ago
- Conjugate Gradient for Least Squares in CUDA☆52Updated 9 years ago
- Automatically exported from code.google.com/p/opencl-book-samples☆165Updated 5 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- Source code from NVIDIA CUDACasts☆49Updated 10 years ago
- CMake module collection☆30Updated 10 years ago
- A class for performing principal component analysis using Eigen library☆30Updated 8 years ago
- A machine vision library written in SYCL and C++ that shows performance-portable implementation of graph algorithms☆161Updated last year
- an OpenCL based software library containing random number generation functions☆136Updated 3 years ago
- Utilities for CUDA programming☆40Updated 5 years ago
- Non-Negative Least Squares implementation for Eigen3☆37Updated 2 years ago
- ☆124Updated 12 years ago
- This example builds on the parallel-forall repo separate compilation example by adding CMake to it.☆17Updated 7 years ago