KAdamek / SMFFTLinks
fast Fourier transform on GPU in shared memory for AstroAccelerate project
☆26Updated 4 years ago
Alternatives and similar repositories for SMFFT
Users that are interested in SMFFT are comparing it to the libraries listed below
Sorting:
- Shared memory overlap-and-save method for NVIDIA GPUs using CUDA☆16Updated 2 years ago
- A Massively Parallel FFT Library for CPU/GPU☆56Updated 4 years ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆53Updated 3 months ago
- QCD for Intel Xeon Phi and Xeon processors☆14Updated last year
- Next generation library for iterative sparse solvers for ROCm platform☆81Updated last week
- A C++ allocator based on cudaMallocManaged☆23Updated 6 years ago
- FFTX Project☆24Updated 2 weeks ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated 2 years ago
- Kernel Tuning Toolkit☆59Updated 3 weeks ago
- A unified framework across multiple programming platforms☆38Updated last week
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆41Updated last year
- ☆40Updated 4 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆68Updated 2 years ago
- ☆15Updated 4 years ago
- Multi-dimensional array programming framework for C++ and multi-GPU CUDA applications☆28Updated 8 years ago
- Autonomic Performance Environment for eXascale (APEX)☆48Updated 2 weeks ago
- ☆39Updated last month
- PLASMA is a software package for solving problems in dense linear algebra using OpenMP☆30Updated last week
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆120Updated last week
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- Julia ports of the Rodinia benchmark suite for heterogeneous computing infrastructures☆50Updated last year
- sparse matrix pre-processing library☆82Updated last year
- MagmaDNN: a simple deep learning framework in c++☆49Updated 4 years ago
- CUDA implementation of the fundamental sum reduce operation. Aims to be as optimized as reasonable.☆37Updated 7 years ago
- DLA-Future☆74Updated 2 weeks ago
- The SparseX sparse kernel optimization library☆39Updated 6 years ago
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆22Updated last year
- ☆29Updated 2 weeks ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- Tensor Contraction Code Generator☆37Updated 7 years ago