amd / amd-fftw
FFTW code optimized for AMD based processors
☆49Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for amd-fftw
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆78Updated this week
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 3 years ago
- An implementation of HIP that works on CPUs, across OSes.☆112Updated 7 months ago
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆46Updated 2 months ago
- Partitioned Global Address Space (PGAS) library for distributed arrays☆100Updated this week
- DLA-Future☆65Updated this week
- hipFFT is a FFT marshalling library.☆53Updated this week
- DEPRECATED. This Scalapck repository is deprecated. The last version in this repository is 3.0. Refer to "aocl-scalapack" repository unde…☆9Updated 3 years ago
- Reusable software components for ROCm developers☆78Updated this week
- Portable HPC Containers (C++)☆48Updated this week
- BLAS-like Library Instantiation Software Framework☆128Updated 2 weeks ago
- List all available information about all SYCL devices and platforms☆15Updated 4 years ago
- ☆13Updated 4 months ago
- Distributed View Extension for Kokkos☆43Updated 2 months ago
- AMD’s C++ library for accelerating tensor primitives☆34Updated this week
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆92Updated last week
- Department of Energy Standard Utility Library☆30Updated 2 months ago
- SYCL Conformance Tests☆62Updated last week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆99Updated this week
- Molecular dynamics proxy application based on Kokkos☆31Updated 3 months ago
- Next generation LAPACK implementation for ROCm platform☆93Updated this week
- High-performance object-based library for DLA computations☆38Updated 2 weeks ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆112Updated 2 months ago
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆106Updated last week
- Implementation of AMD HIP for CPUs☆22Updated 4 years ago
- ROCm SPARSE marshalling library☆69Updated this week
- Counter-based random number generators for C, C++ and CUDA.☆88Updated 8 months ago
- Autonomic Performance Environment for eXascale (APEX)☆38Updated last week
- Asynchronous Task and Memory Interface, or ATMI, is a runtime framework and programming model for heterogeneous CPU-GPU systems. It provi…☆66Updated 8 months ago
- PMIx Reference RunTime Environment (PRRTE)☆35Updated this week