MagmaDNN / magmadnn
MagmaDNN: a simple deep learning framework in c++
☆49Updated 4 years ago
Alternatives and similar repositories for magmadnn:
Users that are interested in magmadnn are comparing it to the libraries listed below
- Distributed Communication-Optimal LU-factorization Algorithm☆12Updated 3 years ago
- Subset of BLAS routines optimized for NVIDIA GPUs☆68Updated last year
- MPI accelerator-integrated communication extensions☆32Updated last year
- resources pour le cours d'introduction à la programmation des GPUs du mastère spécialisé HPC-AI☆22Updated last year
- Next generation library for iterative sparse solvers for ROCm platform☆78Updated last week
- MATLAB Code for Parameters of Floating-Point Arithmetics☆8Updated 2 years ago
- A proxy app for the Monte Carlo Transport Code, Mercury. LLNL-CODE-684037☆40Updated last year
- FP64 equivalent GEMM via Int8 Tensor Cores using the Ozaki scheme☆56Updated last month
- ☆23Updated last week
- MiniAMR Adaptive Mesh Refinement (AMR) Mini-App☆34Updated 4 months ago
- Specialized Parallel Linear Algebra, providing distributed GEMM functionality for specific matrix distributions with optional GPU acceler…☆28Updated 8 months ago
- cuASR: CUDA Algebra for Semirings☆35Updated 2 years ago
- Generate simple index ranges in C++ and CUDA C++☆39Updated last year
- SLATE is a distributed, GPU-accelerated, dense linear algebra library targetting current and upcoming high-performance computing (HPC) sy…☆110Updated 2 months ago
- High-performance Geometric Multigrid☆34Updated 5 years ago
- A unified framework across multiple programming platforms☆36Updated 9 months ago
- ☆17Updated last year
- Distributed View Extension for Kokkos☆45Updated 3 months ago
- RAJA Performance Suite☆118Updated last week
- GTensor is a multi-dimensional array C++14 header-only library for hybrid GPU development.☆36Updated 6 months ago
- ☆17Updated 5 years ago
- Matrix multiplication on GPUs for matrices stored on a CPU. Similar to cublasXt, but ported to both NVIDIA and AMD GPUs.☆30Updated 3 months ago
- HiCMA: Hierarchical Computations on Manycore Architectures☆30Updated 2 years ago
- AMD optimized Sparse Linear Algebra library☆27Updated last week
- ☆43Updated 4 years ago
- Error-Free Transformations as building blocks for compensated algorithms☆14Updated 2 years ago
- sparse matrix pre-processing library☆81Updated 10 months ago
- NUMA-aware multi-CPU multi-GPU data transfer benchmarks☆22Updated last year
- SYCL materials for ENCCS workshop☆25Updated last year
- PaRSEC is a generic framework for architecture aware scheduling and management of micro-tasks on distributed, GPU accelerated, many-core …☆55Updated last week