amd / amd-fftw
FFTW code optimized for AMD based processors
☆52Updated 6 months ago
Alternatives and similar repositories for amd-fftw:
Users that are interested in amd-fftw are comparing it to the libraries listed below
- ☆13Updated last week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆86Updated this week
- DEPRECATED. This Scalapck repository is deprecated. The last version in this repository is 3.0. Refer to "aocl-scalapack" repository unde…☆9Updated 4 years ago
- Partitioned Global Address Space (PGAS) library for distributed arrays☆102Updated last week
- DLA-Future☆71Updated last week
- An implementation of HIP that works on CPUs, across OSes.☆116Updated last year
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆50Updated 7 months ago
- BLAS-like Library Instantiation Software Framework☆138Updated 3 weeks ago
- Next generation FFT implementation for ROCm☆191Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆121Updated 3 months ago
- Counter-based random number generators for C, C++ and CUDA.☆96Updated last year
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆53Updated last month
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆107Updated 2 weeks ago
- AOCL-LibM☆112Updated 2 weeks ago
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆66Updated last year
- Autonomic Performance Environment for eXascale (APEX)☆46Updated last week
- AMD optimized Sparse Linear Algebra library☆29Updated 2 weeks ago
- SYCL Reference Manual☆27Updated 11 months ago
- HIPCC: HIP compiler driver☆41Updated 11 months ago
- Sample configuration files for using oneAPI in CI systems☆99Updated last week
- Next generation LAPACK implementation for ROCm platform☆100Updated this week
- Livermore Unstructured Lagrangian Explicit Shock Hydrodynamics (LULESH)☆107Updated last year
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆78Updated 2 weeks ago
- Reusable software components for ROCm developers☆83Updated last week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆107Updated this week
- Next generation library for iterative sparse solvers for ROCm platform☆79Updated last week
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 4 years ago
- FFT software benchmarks☆22Updated last month
- hipFFT is a FFT marshalling library.☆63Updated this week
- OpenSHMEM Application Programming Interface☆54Updated 5 months ago