KAdamek / SMFFTLinks
fast Fourier transform on GPU in shared memory for AstroAccelerate project
☆27Updated 5 years ago
Alternatives and similar repositories for SMFFT
Users that are interested in SMFFT are comparing it to the libraries listed below
Sorting:
- BLAS implementation for Intel FPGA☆77Updated 5 years ago
- C++ Header-Only Library for High-Performance Tensor-Vector Multiplication☆23Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆198Updated last week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆95Updated 4 years ago
- A 128 bit unsigned integer class for CUDA☆46Updated 11 months ago
- Autonomic Performance Environment for eXascale (APEX)☆50Updated 5 months ago
- A unified framework across multiple programming platforms☆42Updated 6 months ago
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆110Updated 5 months ago
- CUDA accelerated(X) Multi-Precision library☆92Updated 9 years ago
- Loop Kernel Analysis and Performance Modeling Toolkit☆96Updated 9 months ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆74Updated 2 years ago
- bhSPARSE: A Sparse BLAS Library☆16Updated 10 years ago
- A GPU-based LZSS compression algorithm, highly tuned for NVIDIA GPGPUs and for streaming data, leveraging the respective strengths of CPU…☆37Updated 10 years ago
- Full-speed Array of Structures access☆176Updated 2 years ago
- The SparseX sparse kernel optimization library☆43Updated 6 years ago
- A domain-specific language and compiler for image processing☆77Updated 4 years ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆124Updated last week
- Concurrent CPU-GPU Programming using Task Models☆105Updated 6 years ago
- portDNN is a library implementing neural network algorithms written using SYCL☆113Updated last year
- Archived implementation of BLAS using the SYCL open standard. See oneMath for a replacement.☆260Updated 11 months ago
- A C++ allocator based on cudaMallocManaged☆23Updated 7 years ago
- Next generation library for iterative sparse solvers for ROCm platform