DmitryLyakh / CUDA_TutorialLinks
☆23Updated 5 years ago
Alternatives and similar repositories for CUDA_Tutorial
Users that are interested in CUDA_Tutorial are comparing it to the libraries listed below
Sorting:
- CompPhys - a Computational Physics repository☆90Updated last year
- MiniMD Molecular Dynamics Mini-App☆49Updated 3 months ago
- MagmaDNN: a simple deep learning framework in c++☆49Updated 4 years ago
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆29Updated 11 months ago
- Introduction to CUDA programming☆118Updated 8 years ago
- Example codes from the book Parallel Programming With OpenACC☆85Updated 8 years ago
- GPU Eigensolver for symmetric/hermitian matrices.☆65Updated 3 years ago
- Sparse 3D FFT library with MPI, OpenMP, CUDA and ROCm support☆53Updated 3 months ago
- A C++ library for computing large scale tensor contractions.☆38Updated 6 years ago
- QMCPACK miniapp: a simplified real space QMC code for algorithm development, performance portability testing, and computer science experi…☆27Updated 10 months ago
- ☆12Updated last year
- Tensor Algebra Library Routines for Shared Memory Systems☆38Updated last year
- Fast and full-featured Matrix Market I/O library for C++, Python, and R☆79Updated 10 months ago
- DLA-Future☆74Updated 2 weeks ago
- Distributed Communication-Optimal Matrix-Matrix Multiplication Algorithm☆206Updated 3 weeks ago
- A Massively Parallel FFT Library for CPU/GPU☆56Updated 4 years ago
- This repository mirrors the principal Gitlab repository of the Chebyshev Accelerated Subspace iteration Eigensolver. If you want to contr…☆18Updated 3 weeks ago
- Contains sources related to the lectures and labs for the NVIDIA OpenACC course.☆51Updated 5 years ago
- ☆57Updated 3 weeks ago
- Material for the SC21 Deep Learning at Scale Tutorial☆25Updated 2 years ago
- The fftMPI library performs 2d/3d FFTs in parallel for grids distributed across MPI processes.☆14Updated 3 years ago
- Qbox public repository☆36Updated last month
- Offload Eigen operations to GPUs☆20Updated 3 years ago
- Intermediate MPI lesson☆28Updated 2 years ago
- DBCSR: Distributed Block Compressed Sparse Row matrix library☆142Updated last week
- This repository contains application codes and solutions for the Book on "OpenACC for Programmers - Concept & Strategies".☆34Updated 6 years ago
- ALCF Computational Performance Workshop☆37Updated 2 years ago
- A scalable eigensolver for dense, symmetric (hermitian) matrices (fork of https://gitlab.mpcdf.mpg.de/elpa/elpa.git)☆30Updated 4 months ago
- ☆21Updated 4 years ago
- Highly Efficient FFT for Exascale☆38Updated last year