amd / amd-fftw
FFTW code optimized for AMD based processors
☆50Updated 5 months ago
Alternatives and similar repositories for amd-fftw:
Users that are interested in amd-fftw are comparing it to the libraries listed below
- DEPRECATED. This Scalapck repository is deprecated. The last version in this repository is 3.0. Refer to "aocl-scalapack" repository unde…☆9Updated 4 years ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆121Updated 2 months ago
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆49Updated 6 months ago
- An implementation of HIP that works on CPUs, across OSes.☆115Updated last year
- Next generation FFT implementation for ROCm☆188Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆84Updated last week
- Next generation LAPACK implementation for ROCm platform☆99Updated this week
- Partitioned Global Address Space (PGAS) library for distributed arrays☆101Updated this week
- DLA-Future☆70Updated this week
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 4 years ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆51Updated 2 weeks ago
- ☆13Updated 2 weeks ago
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated this week
- High-performance object-based library for DLA computations☆40Updated last month
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated this week
- Molecular dynamics proxy application based on Kokkos☆32Updated 8 months ago
- SYCL Conformance Tests☆68Updated this week
- List all available information about all SYCL devices and platforms☆15Updated 4 years ago
- AOMP is an open source Clang/LLVM based compiler with added support for the OpenMP® API on Radeon™ GPUs. Use this repository for releas…☆211Updated this week
- Counter-based random number generators for C, C++ and CUDA.☆93Updated last year
- An MPI ABI compatibility layer☆32Updated 2 weeks ago
- Reusable software components for ROCm developers☆83Updated this week
- Sample configuration files for using oneAPI in CI systems☆99Updated last month
- AMD’s C++ library for accelerating tensor primitives☆39Updated this week
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆110Updated 2 months ago
- ☆15Updated 2 years ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- TAU Performance System Public Mirror (Updated every night at midnight, USA Pacific Time)☆42Updated this week
- Autonomic Performance Environment for eXascale (APEX)☆44Updated this week
- Distributed View Extension for Kokkos☆45Updated 3 months ago