intel / double-batched-fft-libraryLinks
☆14Updated 3 months ago
Alternatives and similar repositories for double-batched-fft-library
Users that are interested in double-batched-fft-library are comparing it to the libraries listed below
Sorting:
- DLA-Future☆77Updated this week
- Compiler agnostic metaprogramming library providing concepts, type operations and tuples for C++ and cuda☆88Updated last month
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆134Updated last month
- Copy-hiding array abstraction to automatically migrate data between memory spaces☆108Updated this week
- Partitioned Global Address Space (PGAS) library for distributed arrays☆104Updated last week
- FFTW code optimized for AMD based processors☆54Updated 3 months ago
- BLAS++ is a C++ wrapper around CPU and GPU BLAS (basic linear algebra subroutines), developed as part of the SLATE project.☆82Updated last month
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆125Updated 2 months ago
- Department of Energy Standard Utility Library☆32Updated 2 months ago
- RAJA Performance Suite☆120Updated this week
- Autonomic Performance Environment for eXascale (APEX)☆49Updated 3 weeks ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Updated 2 weeks ago
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆68Updated this week
- CS infrastructure components for HPC applications☆173Updated this week
- Multiresolution Adaptive Numerical Environment for Scientific Simulation☆207Updated this week
- Distributed ranges is a generalization of C++ ranges for distributed data structures.☆51Updated this week
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆119Updated this week
- C++ Template Linear Algebra PACKage☆47Updated last week
- List all available information about all SYCL devices and platforms☆15Updated 4 years ago
- Next generation LAPACK implementation for ROCm platform☆108Updated this week
- Intel Data Parallel C++ (and SYCL 2020) Tutorial.☆94Updated 3 years ago
- Implementation of AMD HIP for CPUs☆22Updated 5 years ago
- Codeplay project for contributions to the LLVM SYCL implementation☆30Updated 4 years ago
- Portable HPC Containers (C++)☆48Updated last month
- Examples for using SYCL on CUDA☆62Updated last month
- A shared-memory FFT for the Kokkos ecosystem☆41Updated last week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 4 months ago
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆23Updated 10 months ago
- A massively-parallel, block-sparse tensor framework written in C++☆305Updated this week
- YASK--Yet Another Stencil Kit: a domain-specific language and framework to create high-performance stencil code for implementing finite-d…☆108Updated 3 weeks ago