amd / amd-fftw
FFTW code optimized for AMD based processors
☆54Updated last week
Alternatives and similar repositories for amd-fftw
Users that are interested in amd-fftw are comparing it to the libraries listed below
Sorting:
- High-performance object-based library for DLA computations☆42Updated last week
- DLA-Future☆73Updated last week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆87Updated last week
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆50Updated 8 months ago
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 4 years ago
- Partitioned Global Address Space (PGAS) library for distributed arrays☆103Updated last week
- List all available information about all SYCL devices and platforms☆15Updated 4 years ago
- ☆13Updated last month
- hipFFT is a FFT marshalling library.☆63Updated last week
- SYCL Conformance Tests☆70Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆123Updated 2 weeks ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆108Updated last week
- Next generation FFT implementation for ROCm☆192Updated this week
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆53Updated 2 months ago
- DEPRECATED. This Scalapck repository is deprecated. The last version in this repository is 3.0. Refer to "aocl-scalapack" repository unde…☆9Updated 4 years ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆107Updated last week
- Department of Energy Standard Utility Library☆31Updated 3 weeks ago
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆78Updated this week
- Counter-based random number generators for C, C++ and CUDA.☆98Updated last year
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆117Updated this week
- Distributed View Extension for Kokkos☆45Updated 5 months ago
- AMD optimized Sparse Linear Algebra library☆29Updated last week
- FFT software benchmarks☆22Updated last month
- A shared-memory FFT for the Kokkos ecosystem☆35Updated last week
- ☆39Updated 3 weeks ago
- QCD for Intel Xeon Phi and Xeon processors☆14Updated last year
- Library for length agnostic SIMD intrinsic support and the corresponding math operations☆20Updated 3 years ago
- Basic Tensor Algebra Subroutines☆48Updated 3 weeks ago
- RAJA Performance Suite☆117Updated 2 weeks ago
- AOCL-LibM☆113Updated last week