NVIDIA / cuDecompLinks
An Adaptive Pencil Decomposition Library for NVIDIA GPUs
☆69Updated this week
Alternatives and similar repositories for cuDecomp
Users that are interested in cuDecomp are comparing it to the libraries listed below
Sorting:
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆35Updated 6 months ago
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆69Updated 3 weeks ago
- A shared-memory FFT for the Kokkos ecosystem☆44Updated this week
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆37Updated 10 months ago
- Distributed View Extension for Kokkos☆48Updated 10 months ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆129Updated 4 months ago
- Molecular dynamics proxy application based on Kokkos☆33Updated last year
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆55Updated 2 months ago
- Fortran interfaces for ROCm libraries☆81Updated this week
- OPS is an API with associated libraries and preprocessors to generate parallel executables for applications on mulit-block structured mes…☆70Updated last week
- The Kokkos Fortran Interop repository contains tools and interfaces which help interactions between Fortran portions of an applications a…☆36Updated last month
- Experimental MPI Wrapper for Kokkos☆23Updated 2 weeks ago
- GPU-Enabled, Zero-Copy AMReX Python Bindings including AI/ML☆49Updated this week
- Training examples for SYCL☆49Updated last month
- ☆31Updated last year
- SPH-EXA is a C++20 simulation code for performing hydrodynamics simulations (with gravity and other physics), parallelized with MPI, Open…☆97Updated this week
- Comb is a communication performance benchmarking tool.☆25Updated 2 years ago
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆40Updated 3 months ago
- A parallel programming training mini app simulating weather-like flows☆167Updated last month
- Algebraic multigrid benchmark☆34Updated last year
- Parthenon AMR infrastructure☆144Updated this week
- CS infrastructure components for HPC applications☆175Updated this week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆134Updated 2 weeks ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆46Updated last year
- A flyweight in situ visualization and analysis runtime for multi-physics HPC simulations☆219Updated 2 weeks ago
- DDC is a discrete domain computation library.☆40Updated this week
- OpenACC* to OpenMP* API assisting migration tool☆38Updated 3 weeks ago
- MiniFE Finite Element Mini-Application☆35Updated last year
- ☆113Updated this week
- AmgXWrapper: An interface between PETSc and the NVIDIA AmgX library☆47Updated 3 years ago