NVIDIA / cuDecomp
An Adaptive Pencil Decomposition Library for NVIDIA GPUs
☆60Updated this week
Alternatives and similar repositories for cuDecomp:
Users that are interested in cuDecomp are comparing it to the libraries listed below
- YAKL is A Kokkos Layer: A simple C++ framework for performance portability and Fortran code porting☆64Updated 3 weeks ago
- A shared-memory FFT for the Kokkos ecosystem☆31Updated this week
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆111Updated 3 months ago
- Distributed View Extension for Kokkos☆45Updated 4 months ago
- Experimental MPI Wrapper for Kokkos☆19Updated 3 weeks ago
- DDC is a discrete domain computation library.☆37Updated this week
- Fortran interfaces for ROCm libraries☆75Updated this week
- Comb is a communication performance benchmarking tool.☆24Updated 2 years ago
- IPPL is a C++ library to develop performance portable code for fully Eulerian, Lagrangian or hybrid Eulerian-Lagrangian methods.☆32Updated 2 weeks ago
- Training examples for SYCL☆40Updated last week
- ☆39Updated last month
- GPU-Enabled, Zero-Copy AMReX Python Bindings including AI/ML☆43Updated last week
- Molecular dynamics proxy application based on Kokkos☆32Updated 9 months ago
- Gyselalib++ is a collection of C++ components for writing gyrokinetic semi-lagrangian codes and similar☆29Updated this week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated last week
- The Kokkos Fortran Interop repository contains tools and interfaces which help interactions between Fortran portions of an applications a…☆35Updated 5 months ago
- OPS is an API with associated libraries and preprocessors to generate parallel executables for applications on mulit-block structured mes…☆66Updated this week
- Library of GPU-resident linear solvers☆61Updated last week
- Kokkos C++ Performance Portability Programming Ecosystem: Profiling and Debugging Tools☆121Updated 2 months ago
- ☆30Updated 10 months ago
- CS infrastructure components for HPC applications☆171Updated this week
- JIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal☆13Updated 4 months ago
- Kripke is a simple, scalable, 3D Sn deterministic particle transport code☆39Updated 3 months ago
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆22Updated 6 months ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆41Updated last year
- Department of Energy Standard Utility Library☆31Updated last month
- AmgXWrapper: An interface between PETSc and the NVIDIA AmgX library☆47Updated 2 years ago
- SPH-EXA is a C++20 simulation code for performing hydrodynamics simulations (with gravity and other physics), parallelized with MPI, Open…☆84Updated this week
- Astrophysical multifluid radiation hydrodynamics code☆25Updated this week
- A collection of physics databases and implementation code for use with the Pele suite of of codes☆65Updated last week