drufat / cuda-examples
A few cuda examples built with cmake
☆23Updated 5 years ago
Related projects: ⓘ
- Launching collective tasks in bulk☆36Updated 4 years ago
- Full-speed Array of Structures access☆155Updated last year
- Example of how to use CUDA with CMake >= 3.8☆69Updated last year
- Multi-dimensional array programming framework for C++ and multi-GPU CUDA applications☆28Updated 7 years ago
- ☆42Updated 6 years ago
- Corrected source for the OpenCL in Action book (work in progress)☆60Updated 11 years ago
- ☆117Updated 11 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 4 years ago
- a software library containing Sparse functions written in OpenCL☆173Updated 4 years ago
- A portable high-level API with CUDA or OpenCL back-end☆53Updated 6 years ago
- Conjugate Gradient for Least Squares in CUDA☆51Updated 9 years ago
- Utilities for CUDA programming☆39Updated 5 years ago
- A machine vision library written in SYCL and C++ that shows performance-portable implementation of graph algorithms☆160Updated 5 months ago
- CMake Examples (CMake, CMake+CUDA, CMake+CUDA+PandaRoot)☆41Updated 11 years ago
- CUSP : A C++ Templated Sparse Matrix Library☆400Updated 8 months ago
- Bitonic Sort for C and CUDA☆14Updated 5 years ago
- Efficient CUDA Stream Compaction Library☆33Updated last year
- A matrix and array operation library on GPU with Eigen compatible interface☆97Updated 6 years ago
- C++ convenience classes to be used with CUDA code, for both the host and the kerlel parts.☆55Updated 5 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆167Updated last year
- Some C++ codes for computing a 1D and 2D convolution product using the FFT implemented with the GSL or FFTW☆57Updated 11 years ago
- Automatically exported from code.google.com/p/opencl-book-samples☆159Updated 5 years ago
- CUDA FFT convolution☆14Updated 9 years ago
- Code appendix to an OpenCL matrix-multiplication tutorial☆160Updated 7 years ago
- GPU implementation of classical molecular dynamics proxy application.☆29Updated 7 years ago
- Set of guidelines for porting OpenCL™ C to OpenCL C++☆40Updated 7 years ago
- GPU Matrix Library - A CUDA-based C++ wrapper and syntax sugars for NVIDIA CUBLAS☆27Updated 8 years ago
- CMake module collection☆29Updated 9 years ago
- Library for fast image convolution in neural networks on Intel Architecture☆29Updated 7 years ago
- Fork of magma to include more BLAS☆28Updated 7 years ago