intel / double-batched-fft-library
☆13Updated 3 weeks ago
Alternatives and similar repositories for double-batched-fft-library:
Users that are interested in double-batched-fft-library are comparing it to the libraries listed below
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆84Updated last week
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆107Updated this week
- Department of Energy Standard Utility Library☆31Updated last month
- ☆38Updated last month
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆110Updated 2 months ago
- C++ template library for floating point operations☆24Updated this week
- ROCm SOLVER marshalling library☆25Updated this week
- DLA-Future☆70Updated this week
- A shared-memory FFT for the Kokkos ecosystem☆31Updated this week
- Portable HPC Containers (C++)☆48Updated 2 weeks ago
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆121Updated 2 months ago
- An MPI ABI compatibility layer☆32Updated 3 weeks ago
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆49Updated this week
- FFTW code optimized for AMD based processors☆50Updated 5 months ago
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆106Updated 8 months ago
- hipFFT is a FFT marshalling library.☆60Updated this week
- Next generation LAPACK implementation for ROCm platform☆99Updated this week
- Autonomic Performance Environment for eXascale (APEX)☆44Updated last week
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆62Updated last week
- Random number library that generate pseudo-random and quasi-random numbers.☆26Updated this week
- ROCm Thrust - run Thrust dependent software on AMD GPUs☆106Updated last week
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆77Updated 3 weeks ago
- SYCL Conformance Tests☆68Updated last week
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 4 years ago
- Implementation of AMD HIP for CPUs☆22Updated 4 years ago
- CHAI and RAJA provide an excellent base on which to build portable codes. CARE expands that functionality, adding new features such as lo…☆30Updated this week
- OpenSHMEM Application Programming Interface☆54Updated 4 months ago
- Collaborating on papers for the ISO C++ committee - public repo☆26Updated 7 months ago
- DARMA/vt => Virtual Transport☆36Updated this week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated 6 months ago