amd / amd-fftw
FFTW code optimized for AMD based processors
☆49Updated 3 months ago
Alternatives and similar repositories for amd-fftw:
Users that are interested in amd-fftw are comparing it to the libraries listed below
- DEPRECATED. This Scalapck repository is deprecated. The last version in this repository is 3.0. Refer to "aocl-scalapack" repository unde…☆9Updated 3 years ago
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆49Updated 4 months ago
- hipFFT is a FFT marshalling library.☆57Updated this week
- High-performance object-based library for DLA computations☆39Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆82Updated this week
- DLA-Future☆69Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆116Updated last month
- ☆13Updated 7 months ago
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 3 years ago
- List all available information about all SYCL devices and platforms☆15Updated 4 years ago
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆39Updated last week
- Sample configuration files for using oneAPI in CI systems☆97Updated this week
- Partitioned Global Address Space (PGAS) library for distributed arrays☆102Updated last week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆104Updated this week
- Next generation LAPACK implementation for ROCm platform☆97Updated this week
- Reusable software components for ROCm developers☆81Updated this week
- Next generation FFT implementation for ROCm☆184Updated this week
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆103Updated last week
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆106Updated 5 months ago
- Autonomic Performance Environment for eXascale (APEX)☆42Updated this week
- floating-point errors checker☆55Updated last month
- BLAS-like Library Instantiation Software Framework☆131Updated this week
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆73Updated last week
- Department of Energy Standard Utility Library☆30Updated 4 months ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated this week
- This is ROCgdb, the ROCm source-level debugger for Linux, based on GDB, the GNU source-level debugger.☆53Updated this week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 3 months ago
- RAJA Performance Suite☆117Updated this week
- AMD’s C++ library for accelerating tensor primitives☆38Updated this week
- oneAPI Level Zero Conformance & Performance test content☆48Updated this week