KAdamek / SMFFTLinks
fast Fourier transform on GPU in shared memory for AstroAccelerate project
☆27Updated 5 years ago
Alternatives and similar repositories for SMFFT
Users that are interested in SMFFT are comparing it to the libraries listed below
Sorting:
- Subset of BLAS routines optimized for NVIDIA GPUs☆76Updated 2 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆198Updated this week
- bhSPARSE: A Sparse BLAS Library☆17Updated 10 years ago
- Kernel Tuning Toolkit☆65Updated last month
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆260Updated 11 months ago
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆47Updated this week
- BLAS implementation for Intel FPGA☆78Updated 5 years ago
- Next generation library for iterative sparse solvers for ROCm platform☆93Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆130Updated 2 weeks ago
- The Surprisingly ParalleL spArse Tensor Toolkit.☆73Updated 3 years ago
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆95Updated 4 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆124Updated 2 weeks ago
- Multiple 1-stencil implementations using nvidia cuda.☆13Updated 8 years ago
- A 128 bit unsigned integer class for CUDA☆46Updated last year
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆111Updated 5 months ago
- Autonomic Performance Environment for eXascale (APEX)☆50Updated 5 months ago
- ☆48Updated 5 years ago
- A C++ allocator based on cudaMallocManaged☆23Updated 7 years ago
- CSR-based SpGEMM on nVidia and AMD GPUs☆46Updated 9 years ago
- RAJA Performance Suite☆128Updated this week
- The Combinatorial BLAS (CombBLAS) is an extensible distributed-memory parallel graph library offering a small but powerful set of linear …☆80Updated 5 months ago
- The SparseX sparse kernel optimization library☆43Updated 6 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated last year
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆179Updated this week
- Full-speed Array of Structures access☆176Updated 2 years ago
- Fork of magma to include more BLAS☆28Updated 9 years ago
- Library to plot integer sets and maps☆53Updated 9 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆115Updated this week
- Loop Kernel Analysis and Performance Modeling Toolkit☆96Updated 9 months ago
- a heterogeneous multiGPU level-3 BLAS library☆46Updated 6 years ago