aramadia / udacity-cs344
Parallel Programming
☆28Updated 11 years ago
Related projects ⓘ
Alternatives and complementary repositories for udacity-cs344
- Kernel Fusion and Runtime Compilation Based on NNVM☆69Updated 7 years ago
- CUDA Tensor Transpose (cuTT) library☆50Updated 7 years ago
- Full-speed Array of Structures access☆160Updated last year
- Online CUDA Occupancy Calculator☆66Updated 3 years ago
- kmeans clustering with multi-GPU capabilities☆116Updated last year
- A simple memory manager for CUDA designed to help Deep Learning frameworks manage memory☆291Updated 5 years ago
- Some CUDA design patterns and a bit of template magic for CUDA☆146Updated last year
- CNN accelerated by cuda. Test on mnist and finilly get 99.76%☆184Updated 7 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆99Updated 7 years ago
- CUDA implementation of the fundamental sum reduce operation. Aims to be as optimized as reasonable.☆35Updated 7 years ago
- Subpart source code of of deepcore v0.7☆27Updated 4 years ago
- Multi-GPU Computing Benchmark Suite (CUDA)☆42Updated 7 years ago
- This is a tuned sparse matrix dense vector multiplication(SpMV) library☆21Updated 8 years ago
- A library to benchmark CUDA code, similar to google benchmark.☆28Updated 3 years ago
- ☆90Updated 7 years ago
- Third party assembler and GEMM library for NVIDIA Kepler GPU☆77Updated 5 years ago
- Python bindings for NVTX☆66Updated last year
- gossip: Efficient Communication Primitives for Multi-GPU Systems☆58Updated 2 years ago
- Efficient Top-K implementation on the GPU☆148Updated 5 years ago
- a heterogeneous multiGPU level-3 BLAS library☆45Updated 4 years ago
- Implementation of TSM2L and TSM2R -- High-Performance Tall-and-Skinny Matrix-Matrix Multiplication Algorithms for CUDA☆31Updated 4 years ago
- Sparse matrix computation library for GPU☆54Updated 4 years ago
- ☆14Updated 2 years ago
- CUDA FFT convolution☆14Updated 9 years ago
- ☆21Updated 7 years ago
- Source code that accompanies The CUDA Handbook.☆497Updated last week
- ulmBLAS