sol-prog / cuda_cublas_curand_thrust
☆22Updated 12 years ago
Related projects ⓘ
Alternatives and complementary repositories for cuda_cublas_curand_thrust
- CUDA Tensor Transpose (cuTT) library☆50Updated 7 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆263Updated last year
- CUDA C++ package for Sublime Text 2 & 3☆67Updated 6 years ago
- kmeans☆53Updated 8 years ago
- CLTune: An automatic OpenCL & CUDA kernel tuner☆170Updated last year
- ulmBLAS☆104Updated 2 years ago
- Full-speed Array of Structures access☆162Updated last year
- a software library containing Sparse functions written in OpenCL☆173Updated 4 years ago
- sparse matrix pre-processing library☆81Updated 6 months ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 4 years ago
- High-Performance Tensor Transpose library☆185Updated last year
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆291Updated 5 years ago
- Code appendix to an OpenCL matrix-multiplication tutorial☆164Updated 7 years ago
- CUSP : A C++ Templated Sparse Matrix Library☆404Updated 2 weeks ago
- The SHOC Benchmark Suite☆247Updated 2 years ago
- Fork of magma to include more BLAS☆28Updated 8 years ago
- Multi-GPU Computing Benchmark Suite (CUDA)☆42Updated 7 years ago
- Range-based for loops to iterate over a range of numbers or values☆35Updated 8 years ago
- Online CUDA Occupancy Calculator☆66Updated 3 years ago
- Kernel Tuning Toolkit☆55Updated 3 weeks ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆99Updated 7 years ago
- Source code from NVIDIA CUDACasts☆48Updated 10 years ago
- Experimental Linear Algebra Performance Studies☆12Updated 7 years ago
- Developer repository for ViennaCL. Visit http://viennacl.sourceforge.net/ for the latest releases.☆282Updated 3 years ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆45Updated 9 years ago
- GPU Eigensolver for symmetric/hermitian matrices.☆64Updated 3 years ago
- Multi-dimensional array programming framework for C++ and multi-GPU CUDA applications☆28Updated 7 years ago
- Greentea LibDNN - a universal convolution implementation supporting CUDA and OpenCL☆135Updated 7 years ago
- Simple utilities to enable code reuse and portability between CUDA C/C++ and standard C/C++.☆343Updated 2 years ago
- Source code that accompanies The CUDA Handbook.☆497Updated last week