pkestene / MS-HPC-AI-GPU
resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI
☆22Updated 8 months ago
Related projects: ⓘ
- Kokkos Remote Spaces implements distributed Kokkos Views and related APIs for distributed parallel programming.☆42Updated 2 weeks ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆34Updated 8 months ago
- Comb is a communication performance benchmarking tool.☆23Updated last year
- C++ HPC Tutorial materials☆46Updated 2 months ago
- Intermediate MPI lesson☆25Updated last year
- MiniFE Finite Element Mini-Application☆28Updated 4 months ago
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆31Updated 2 months ago
- Library of GPU-resident linear solvers☆51Updated this week
- Yet Another Kernel Launcher: A simple C++ framework for performance portability and Fortran code porting☆55Updated last month
- Highly Efficient FFT for Exascale☆35Updated 4 months ago
- Experimental MPI Wrapper for Kokkos☆12Updated last week
- Training examples for SYCL☆38Updated 6 months ago
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆84Updated 2 months ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆21Updated last week
- Next generation library for iterative sparse solvers for ROCm platform☆74Updated this week
- AmgXWrapper: An interface between PETSc and the NVIDIA AmgX library☆42Updated 2 years ago
- ☆21Updated 3 weeks ago
- DDC is a discrete domain computation library.☆32Updated this week
- An Adaptive Pencil Decomposition Library for NVIDIA GPUs☆55Updated 2 months ago
- Software to support people learning OpenMP with our book ... The OpenMP Common Core: Making OpenMP Simple Again☆70Updated 10 months ago
- MagmaDNN: a simple deep learning framework in c++☆45Updated 4 years ago
- NVIDIA HPCG is based on the HPCG benchmark and optimized for performance on NVIDIA accelerated HPC systems.☆41Updated 3 weeks ago
- This aims to be an wrapper to C-MPI3 for C++, using the principles of simplicity, STL, RAII and Boost and enforcing type-safety. This i…☆20Updated 2 months ago
- Example codes demonstrating the use of various XSDK packages in combination.☆16Updated last year
- This tool serves as a test harness for different optimization techniques to improve stencil computations performance in shared and distri…☆20Updated last year
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- Molecular dynamics proxy application based on Kokkos☆30Updated 2 months ago
- Algebraic multigrid benchmark☆28Updated 2 months ago
- JIT Compilation for Multiple Architectures: C++, OpenMP, CUDA, HIP, OpenCL, Metal☆12Updated last month
- Flexible local Fourier analysis library.☆11Updated 3 years ago