KAdamek / GPU_Overlap-and-save_convolutionLinks
Shared memory overlap-and-save method for NVIDIA GPUs using CUDA
☆17Updated 5 months ago
Alternatives and similar repositories for GPU_Overlap-and-save_convolution
Users that are interested in GPU_Overlap-and-save_convolution are comparing it to the libraries listed below
Sorting:
- fast Fourier transform on GPU in shared memory for AstroAccelerate project☆27Updated 5 years ago
- CUDA-based implementation for linear 1D, 2D and 3D FFT-Shift functions.☆22Updated 10 years ago
- choosing FFT library...☆164Updated 3 years ago
- ☆43Updated 4 years ago
- CUDA implementation of the fundamental sum reduce operation. Aims to be as optimized as reasonable.☆39Updated 8 years ago
- Example code for Intel AVX / AVX2 intrinsics.☆144Updated 2 years ago
- CUDA tool set for non-C++ languages that provides similar functionality like Thrust, with NVRTC at its core.☆59Updated 3 years ago
- Parallel selection on GPUs☆15Updated 4 years ago
- A GPU based FX correlator for radio astronomy☆40Updated 7 years ago
- Some C++ codes for computing a 1D and 2D convolution product using the FFT implemented with the GSL or FFTW☆60Updated 12 years ago
- Simple OpenCL Samples that Build with Khronos Headers and Libs☆120Updated this week
- Simple OpenCL examples for exploiting GPU computing☆227Updated last year
- A GPU benchmark suite for assessing on-chip GPU memory bandwidth☆109Updated 8 years ago
- Source code that accompanies The CUDA Handbook.☆566Updated 4 months ago
- Online CUDA Occupancy Calculator☆83Updated 4 years ago
- Short examples illustrating AVX2 intrinsics for simple tasks.☆98Updated last year
- ☆70Updated 11 years ago
- 🎃 GPU load-balancing library for regular and irregular computations.☆66Updated 5 months ago
- An implementation of parallel exclusive scan in CUDA☆65Updated 7 years ago
- THIS REPOSITORY HAS MOVED TO github.com/nvidia/cub, WHICH IS AUTOMATICALLY MIRRORED HERE.☆85Updated last year
- Learn OpenCL step by step.☆138Updated 3 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆76Updated 2 years ago
- Agenium Scale vectorization library for CPUs and GPUs☆337Updated 4 years ago
- A GPU implementation of the Wavelet Transform☆83Updated 5 years ago
- BLISlab: A Sandbox for Optimizing GEMM☆555Updated 4 years ago
- gpuprec: Extended-Precision Libraries on GPUs☆39Updated 10 years ago
- CUDA official sample codes☆371Updated 10 years ago
- A GPU benchmark tool for evaluating GPUs and CPUs on mixed operational intensity kernels (CUDA, OpenCL, HIP, SYCL, OpenMP)☆448Updated last week
- ☆276Updated this week
- Examples for using SYCL on CUDA☆63Updated 5 months ago