NVIDIA / cuDecompLinks
An Adaptive Pencil Decomposition Library for NVIDIA GPUs
☆76Updated last week
Alternatives and similar repositories for cuDecomp
Users that are interested in cuDecomp are comparing it to the libraries listed below
Sorting:
- Distributed View Extension for Kokkos☆49Updated last year
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆69Updated 4 months ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆37Updated last week
- A shared-memory FFT for the Kokkos ecosystem☆46Updated this week
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆38Updated last year
- Experimental Explicit Communications API for Kokkos☆29Updated last week
- DDC is a discrete domain computation library.☆41Updated this week
- Comb is a communication performance benchmarking tool.☆26Updated 2 years ago
- Fortran interfaces for ROCm libraries☆83Updated last week
- The Kokkos Fortran Interop repository contains tools and interfaces which help interactions between Fortran portions of an applications a…☆38Updated 5 months ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆131Updated 3 months ago
- SPH-EXA is a C++20 simulation code for performing hydrodynamics simulations (with gravity and other physics), parallelized with MPI, Open…☆101Updated last week
- Molecular dynamics proxy application based on Kokkos☆33Updated last year
- OPS is an API with associated libraries and preprocessors to generate parallel executables for applications on mulit-block structured mes…☆72Updated 3 weeks ago
- Tools and libraries for writing Kokkos-enabled HPC C++ in E3SM ecosystem☆19Updated last week
- GPU-Enabled, Zero-Copy AMReX Python Bindings including AI/ML☆52Updated last week
- ☆49Updated 2 months ago
- IPPL is a C++ library to develop performance portable code for fully Eulerian, Lagrangian or hybrid Eulerian-Lagrangian methods.☆52Updated last week
- Structured PIC proxy app based on Cabana☆15Updated 7 months ago
- P3DFFT++ (a.k.a. P3DFFT v. 3) is a new generation of P3DFFT library that aims to provide a comprehensive framework for simulating multis…☆22Updated 2 years ago
- A flyweight in situ visualization and analysis runtime for multi-physics HPC simulations☆233Updated this week
- Parthenon AMR infrastructure☆150Updated last week
- The sources for the OpenACC Programming and Best Practices Guide.☆40Updated last week
- A parallel programming training mini app simulating weather-like flows☆173Updated 5 months ago
- Collective and Neighbor Collective Optimizations and Extensions☆13Updated last week
- Library of GPU-resident linear solvers☆75Updated this week
- Training examples for SYCL☆49Updated 2 months ago
- Next generation library for iterative sparse solvers for ROCm platform☆94Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆138Updated 3 weeks ago
- ☆33Updated last year