sol-prog / cuda_cublas_curand_thrust
☆22Updated 12 years ago
Alternatives and similar repositories for cuda_cublas_curand_thrust:
Users that are interested in cuda_cublas_curand_thrust are comparing it to the libraries listed below
- sparse matrix pre-processing library☆81Updated 10 months ago
- CUDA Tensor Transpose (cuTT) library☆51Updated 7 years ago
- ulmBLAS☆105Updated 2 years ago
- A few cuda examples built with cmake☆23Updated 5 years ago
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆297Updated 6 years ago
- A CUDNN minimal deep learning training code sample using LeNet.☆264Updated last year
- Multi-dimensional array programming framework for C++ and multi-GPU CUDA applications☆28Updated 8 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 5 years ago
- Full-speed Array of Structures access☆167Updated last year
- CLTune: An automatic OpenCL & CUDA kernel tuner☆177Updated 2 years ago
- a software library containing Sparse functions written in OpenCL☆174Updated 5 years ago
- Contains sources related to the lectures and labs for the NVIDIA OpenACC course.☆51Updated 5 years ago
- Example python (numpy) -- CUDA installable package with a C-extension library☆142Updated 5 years ago
- CUDA and OpenMP implementations of C2R/R2C inplace transposition☆46Updated 10 years ago
- Experimental Linear Algebra Performance Studies☆12Updated 8 years ago
- Developer repository for ViennaCL. Visit http://viennacl.sourceforge.net/ for the latest releases.☆286Updated 3 years ago
- MWE for using the Eigen library in CUDA kernels☆119Updated 2 years ago
- kmeans☆54Updated 8 years ago
- CUDA C++ package for Sublime Text 2 & 3☆67Updated 7 years ago
- The SparseX sparse kernel optimization library☆40Updated 6 years ago
- A fork of Eigen 3.2 to use MAGMA (GPU & CPU) as backend in the same way it does with Intel MKL.☆48Updated 11 years ago
- Sparse matrix computation library for GPU☆54Updated 4 years ago
- CUSP : A C++ Templated Sparse Matrix Library☆411Updated 4 months ago
- Multi-GPU Computing Benchmark Suite (CUDA)☆42Updated 7 years ago
- Code appendix to an OpenCL matrix-multiplication tutorial☆165Updated 8 years ago
- High-Performance Tensor Transpose library☆191Updated last year
- Fork of magma to include more BLAS☆28Updated 8 years ago
- Introductory Thrust workshop materials☆43Updated 12 years ago
- A Light-weight and Fast Template Matrix Library☆132Updated 12 years ago
- Python wrappers for the NVIDIA cuDNN libraries☆140Updated 7 years ago