amd / amd-fftwLinks
FFTW code optimized for AMD based processors
☆54Updated 3 months ago
Alternatives and similar repositories for amd-fftw
Users that are interested in amd-fftw are comparing it to the libraries listed below
Sorting:
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆88Updated last month
- nvptx-tools: a collection of tools for use with nvptx-none GCC toolchains.☆52Updated 11 months ago
- DLA-Future☆77Updated this week
- An implementation of HIP that works on CPUs, across OSes.☆122Updated last year
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆134Updated last month
- List all available information about all SYCL devices and platforms☆15Updated 4 years ago
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆82Updated last month
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆195Updated this week
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆108Updated this week
- ☆14Updated 3 months ago
- AMD optimized Sparse Linear Algebra library☆32Updated 3 weeks ago
- SYCL Conformance Tests☆70Updated 2 weeks ago
- Counter-based random number generators for C, C++ and CUDA.☆105Updated last year
- Portable HPC Containers (C++)☆48Updated last month
- Implementation of AMD HIP for CPUs☆22Updated 5 years ago
- Partitioned Global Address Space (PGAS) library for distributed arrays☆104Updated last week
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆108Updated 3 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆63Updated this week
- Autonomic Performance Environment for eXascale (APEX)☆49Updated 3 weeks ago
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 4 years ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Updated 2 weeks ago
- RAJA Performance Suite☆120Updated this week
- High-performance object-based library for DLA computations☆42Updated 3 months ago
- Next generation LAPACK implementation for ROCm platform☆108Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆119Updated this week
- Examples for using SYCL on CUDA☆62Updated last month
- Parallel fast Fourier transforms☆56Updated 6 years ago
- An OpenMP runtime implemented using HPX☆24Updated 3 years ago
- A streamlined CMake build system foundation for developing HPC software☆274Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆84Updated this week