af-ayala / heffteLinks
Highly Efficient FFT for Exascale
☆40Updated last year
Alternatives and similar repositories for heffte
Users that are interested in heffte are comparing it to the libraries listed below
Sorting:
- ☆105Updated this week
- Training examples for SYCL☆49Updated last week
- An Adaptive Pencil Decomposition Library for NVIDIA GPUs☆69Updated 2 weeks ago
- Run a parallel command inside a split tmux window☆151Updated 3 years ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆125Updated 3 months ago
- Comb is a communication performance benchmarking tool.☆25Updated 2 years ago
- RAJA Performance Suite☆121Updated last week
- Molecular dynamics proxy application based on Kokkos☆34Updated last year
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆69Updated 2 weeks ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Updated last month
- GEMMul8 (GEMMulate): GEMM emulation using int8 matrix engines based on the Ozaki Scheme II☆25Updated last week
- ☆32Updated 2 weeks ago
- Intermediate MPI lesson☆27Updated 2 years ago
- CPE change log and release notes☆26Updated last year
- Distributed View Extension for Kokkos☆47Updated 9 months ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆45Updated last year
- High-performance, GPU-aware communication library☆86Updated 7 months ago
- ☆14Updated 2 years ago
- Wrapper interface for MPI☆94Updated 3 months ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆208Updated 3 months ago
- OpenACC* to OpenMP* API assisting migration tool☆37Updated 10 months ago
- ☆74Updated this week
- Pragmatic, Productive, and Portable Affinity for HPC☆44Updated this week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 4 months ago
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆71Updated 2 months ago
- MPI accelerator-integrated communication extensions☆37Updated 2 years ago
- Data parallel C++ mathematical object library☆165Updated last week
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆145Updated last week
- Collective and Neighbor Collective Optimizations and Extensions☆11Updated last week
- Sources for the Oak Ridge Leadership Computing Facility User Documentation☆66Updated last week