hurdad / fftw-cufftw-benchmarkLinks
Benchmark for popular fft libaries - fftw | cufftw | cufft
☆18Updated 7 years ago
Alternatives and similar repositories for fftw-cufftw-benchmark
Users that are interested in fftw-cufftw-benchmark are comparing it to the libraries listed below
Sorting:
- ☆98Updated 9 years ago
- ☆43Updated 4 years ago
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆109Updated 8 years ago
- ☆12Updated 3 years ago
- A Benchmark Suite for Heterogeneous System Computation☆55Updated 11 months ago
- ☆49Updated 5 years ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆66Updated 5 months ago
- ☆66Updated last year
- Chai☆47Updated 2 months ago
- Tartan: Evaluating Modern GPU Interconnect via a Multi-GPU Benchmark Suite☆68Updated 7 years ago
- Implement asm gemm on vega64 for 4096x4096 fp32 matrix☆22Updated 6 years ago
- GPTPU for SC 2021☆52Updated 2 years ago
- ❤️ CUDA/C++ GPU graph analytics simplified.☆32Updated 3 years ago
- Matrix-Vector Multiplication Using Shared and Coalesced Memory Access☆16Updated 12 years ago
- Performance Prediction Toolkit☆56Updated 4 months ago
- Benchmark for measuring the performance of sparse and irregular memory access.☆82Updated 5 months ago
- CSR-based SpGEMM on nVidia and AMD GPUs☆46Updated 9 years ago
- Simple message passing library☆30Updated 7 years ago
- tools to create performance and roofline plots from measured data☆60Updated 11 years ago
- Short examples illustrating AVX2 intrinsics for simple tasks.☆98Updated last year
- A Deep Learning Framework customized for Sunway TaihuLight☆41Updated 7 years ago
- Multi-GPU Computing Benchmark Suite (CUDA)☆43Updated 8 years ago
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 8 years ago
- development repository for the open earth compiler☆82Updated 4 years ago
- Code for paper "Design Principles for Sparse Matrix Multiplication on the GPU" accepted to Euro-Par 2018☆73Updated 5 years ago
- Dissecting NVIDIA GPU Architecture☆116Updated 3 years ago
- HeteroSync is a benchmark suite for performing fine-grained synchronization on tightly coupled GPUs☆31Updated last year
- GPU Performance Advisor☆65Updated 3 years ago
- Performance Prediction Toolkit for GPUs☆39Updated 3 years ago
- GPU Code optimizer for stencil computations. Refer to our IPDPS'19 paper for more details☆25Updated 6 years ago