DmitryLyakh / CUDA_Tutorial
☆23Updated 5 years ago
Alternatives and similar repositories for CUDA_Tutorial:
Users that are interested in CUDA_Tutorial are comparing it to the libraries listed below
- MagmaDNN: a simple deep learning framework in c++☆49Updated 4 years ago
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated 8 months ago
- MiniMD Molecular Dynamics Mini-App☆50Updated last month
- GPU Eigensolver for symmetric/hermitian matrices.☆65Updated 3 years ago
- CompPhys - a Computational Physics repository☆89Updated last year
- A C++ library for computing large scale tensor contractions.☆38Updated 6 years ago
- The fftMPI library performs 2d/3d FFTs in parallel for grids distributed across MPI processes.☆14Updated 2 years ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆53Updated last month
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆29Updated 9 months ago
- Contains sources related to the lectures and labs for the NVIDIA OpenACC course.☆51Updated 5 years ago
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated last week
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆142Updated this week
- A Massively Parallel FFT Library for CPU/GPU☆56Updated 4 years ago
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆41Updated last year
- Example codes from the book Parallel Programming With OpenACC☆85Updated 8 years ago
- Tensor Algebra Library Routines for Shared Memory Systems☆38Updated last year
- Molecular dynamics proxy application based on Kokkos☆32Updated 9 months ago
- This repository mirrors the principal Gitlab repository of the Chebyshev Accelerated Subspace iteration Eigensolver. If you want to contr…☆17Updated last week
- Highly Efficient FFT for Exascale☆37Updated 11 months ago
- Implementation of MPI that supports large counts☆48Updated 4 months ago
- ☆12Updated last year
- Intermediate MPI lesson☆27Updated last year
- DLA-Future☆71Updated this week
- This repository contains application codes and solutions for the Book on "OpenACC for Programmers - Concept & Strategies".☆34Updated 6 years ago
- Solving Poisson equation using a spectral method, also introducing VTK which will probably be used for other projects☆16Updated last year
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆112Updated 3 months ago
- Tools to run and parse MKL verbose mode☆17Updated 2 years ago
- CUDA Tensor Transpose (cuTT) library☆51Updated 7 years ago
- ☆71Updated 2 months ago